Outline

To navigate this material, you might move through the topics as ordered below, or you might focus on only the conceptual modules or the R modules. For each module, the total number of video minutes is shown, but the module should take longer: we encourage you to think carefully about the questions between the videos and pause to run R the code shown in the video. Note that you can change the playback speed in the youtube settings at the bottom right of each embedded video.

Unit 1

Sampling and Bias (69 minutes)

Beginning with R (11 minutes)

R Basics 1 (47 minutes)

R Basics 2 (27 minutes)

Intro to Hypothesis Testing (19 minutes)

Intro to Non-Parametric Tests (47 minutes)

Rank Sum Tests (20 minutes)

More Non-Parametric Tests (38 minutes)

Non-Parametric Tests for Pairs (30 minutes)

Unit 2

Sampling Variability and Measures of Dispersion (44 minutes)

Central Limit Theorem (17 minutes)

R Part 2 (33 minutes)

Data Ethics 1

Parametric Hypothesis Tests, Part1: Z-test and t-test (42 minutes)

Parametric Hypothesis Tests, Part 2: Example, Paired Data, Confidence Intervals (48 minutes)

Robustness and Power (38 minutes)

Log transformations for t-tests (22 minutes)

Optional derivations (36 minutes)

Unit 3

Data Cleaning and Management (70 minutes)

R Part 3 (12 minutes)

Reshaping and Manipulating Data (tidyverse) (42 minutes)

Data Ethics 2

Practical vs. Statistical Significance (8 minutes)

Unit 4

Splines (6 minutes)

Regression trees (10 minutes)

Connecting splines, trees, and linear regression (74 minutes)

R Part 4a (22 minutes)

Intro to linear regression (31 minutes)

Estimating regression coefficients (75 minutes)

R Part 4b (9 minutes)

Optional proof that slope coefficient is unbiased (15 minutes)

Missing Data (75 minutes)

Unit 5

Data Ethics 3

Inferences about Regression (21 minutes)

Intervals for Regression (38 minutes)

Log Transformations for Linear Regression (30 minutes)

R Part 5 (43 minutes)

Git tutorial (this is a non-R technical tool) (30 minutes)

SQL (15 minutes)

Web scraping (18 minutes)

ggplot2 (19 minutes)

Mapping (15 minutes)

Unit 6

Sums of Squares (63 minutes)

ANOVA (19 minutes)

Regression and ANOVA are different ways of formatting output from the same model! (72 minutes)

Model selection 1 (54 minutes)

Model selection 2 (58 minutes)

Multiple Comparisons (45 minutes)