Paul Cichello - Visualizations

Aimee has also created a template do file for others (students or instructors) who use Stata to create similar Stata videos with any two variables you like. You will need to modify some items in the code but she leaves notes so you can see where to make the changes. (You will also need to download ffmpeg.)

That template code is found here.

Power of a test

These videos assume a general understanding of the Power of a Test

Here you can see what happens to the power of a test when the sample size increases and when the distance between the hypothesized mean and true mean increases.

Increased sample size

Greater distance between null hypothesis and true mean

Credit to Luke DeMartin (MCAS '23) for creating videos in this section.

OLS minimizes the sum of squared residuals

What does this really mean?

Imagine three possible lines that we might choose as our prediction line. Which will we select?

First, understand "the sum of squared errors"

Caution: Some texts call this sum of squared residuals (SSR)!

Then, see which line Stata will choose

Luckily there is an analytical solution to finding the line that minimizes the sum of squared residuals!

Therefore, Stata doesn't have to really create all the possible lines and check to see which has the lowest sum of squared residuals.

Instead we use calculus and take first order conditions set them equal to zero. The algebra and resulting outcomes for simple regression (slope coefficient = Cov(X,Y)/Var(X) and every regression line goes through the point of means) will not be presented here.

Credit to Aimee Hong (MCAS '26) for creating videos in this section.

R-squared

Below we present two of the three components of R-squared- NOTE: the missing component, the sum of squared errors is shown in the previous section!

We then have an overview video of how to construct R-squared.

SST

The total variation around the mean of the dependent variable

Also, conceptually, the sum of squared errors if you can't use the information on X to predict Y

(i.e. if you predict the mean of Y)

SSR

The sum of squares due to regression.

The variation in Y that can be accounted after predicting Y

Be careful!

Some texts call this SSE or sum of squares explained!

R-Squared

R^2 = 1 - SSE/SST

(SSE video is shown in

the section above)

Credit to Aimee Hong (MCAS '26) for creating videos in this section.

Page updated

Report abuse