Skip to content

Notes, scripts, and exercises for data science workshops put on through ECOSCOPE (http://ecoscope.ubc.ca/)

License

Notifications You must be signed in to change notification settings

lmcewen/workshops_data_science

 
 

Repository files navigation

Workshops

Notes, scripts, and exercises for data science workshops through ECOSCOPE (http://ecoscope.ubc.ca/)

Introduction to R (2hr)

This is a truly introductory workshop for beginners with no experience in R. In this workshop, we introduce you to R and RStudio at the beginner level. This condensed 2-hour workshop is meant to get you started in R and acts as a pre-requisite for our more advanced workshops.

In it, we cover:

  • R and RStudio
  • RStudio projects
  • R scripts
  • Installing packages
  • Reading in data as a data frame
  • Vectors, single values, and data types
  • Basic data visualization
  • The help function

We recommend this workshop be given across a single 2 hour session.

Introduction to the tidyverse

In this workshop, we provide a brief introduction to RStudio, then delve into data manipulation and graphics in the tidyverse including the packages dplyr, tidyr, and ggplot2. We teach different ways to manipulate data in tabular and text forms as well as the critical concepts underlying the grammar of graphics and how they are implemented in ggplot. We will use RStudio, a powerful but user-friendly R environment, and teach you how to use it effectively.

You will learn how to:

  • create an R project and import data from a file into R,
  • create subsets of rows or columns from data frames using dplyr,
  • select pieces of an object by indexing using element names or position,
  • change your data frames between wide and narrow formats,
  • create various types of graphics,
  • modify the various features of a graphic, and
  • save your graphic in various formats

We recommend this workshop be given across 6 hours as 3 x 2 hour sessions. Participants with little to no prior experience should consider taking or reviewing our "Introduction to R" workshop prior to this tidyverse series.

Reproducible research

In this workshop, we introduce computational reproducibility and its importance to modern research. We will teach the general principles for reproducible computer-based analyses, along with specific methods and tools for reproducibility and version control with RStudio and GitHub.

You will learn how to:

  • Construct reproducible, automatable workflows in R with scripts and Make
  • Create reproducible documents using Rmarkdown to include underlying code / computations with relevant graphical and statistical results in several different formats (reports, presentation slides, handouts, notes)
  • Use Git version control
  • Integrate version control with GitHub for both personal and group projects

We recommend this workshop be given across 6 hours as 3 x 2 hour sessions. Participants with little to no prior experience should consider taking or reviewing our "Introduction to R" workshop prior to this series.

Statistical models in R

Under development

In this workshop, we introduce various types of regression models and how they are implemented in R. We cover linear regression, ANOVA, ANCOVA and mixed effects models for continuous response data, logistic regression binary response data, and Poisson and Negative Binomial regression for count response data.

You will learn:

  • the assumptions behind the different models
  • how to interpret the main effects and interaction terms in a model
  • various experimental design concepts that help maximize the power

In R, you will learn how to:

  • build a statistical model
  • define and manipulate model terms
  • use the lsmeans package to answer specific research questions

We recommend this workshop be given across 6 hours as 3 x 2 hour sessions. Participants with little to no prior experience should consider taking or reviewing our "Introduction to R" workshop prior to this series.

Intermediate R programming

Under development

In this workshop, we teach you to use R as a programming environment, allowing you to write more complex, yet clearer data analysis code. We will teach you three fundamental concepts of R programming: functions, classes, and packages. 

You will learn how to:

  • Define objects, classes, and attributes in data and built-in functions
  • Write functions including for loops
  • Output large result tables to your hard drive
  • Write and publish an R package

We recommend this workshop be given across 6 hours as 3 x 2 hour sessions. Participants with little to no prior experience should consider taking or reviewing our "Introduction to R" workshop prior to this series.

About

Notes, scripts, and exercises for data science workshops put on through ECOSCOPE (http://ecoscope.ubc.ca/)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 99.9%
  • Other 0.1%