Teacher notes here
Take 5 samples, and use them to answer your investigative question
Instructions (also on student Google Doc)
Make copies of the necessary documents by clicking below....
Slideshow to assist with running the activity....
The variation in a sample statistic from sample to sample.
Suppose a sample is taken and a sample statistic, such as a sample median, is calculated. If a second sample of the same size is taken from the same population, it is almost certain that the sample median calculated from this sample will be different from that calculated from the first sample. If further sample medians are calculated, by repeatedly taking samples of the same size from the same population, then the differences in these sample medians illustrate sampling variation.
The error that arises in a data collection process as a result of taking a sample from a population rather than using the whole population.
Estimates from different random samples (of the same size) will vary from sample to sample, and each estimate is likely to be different from the true value of the population parameter.
The sampling error for a given sample is unknown but when the sampling is random, for some estimates (for example, sample mean, sample proportion) theoretical methods may be used to measure the extent of the variation caused by sampling error.
In a real world situation we would only take one sample. Given our understanding of sampling error, we know the sample median from this single sample will only be an estimate of the population median.
We would suggest that the population median is somewhere around the sample median we have found. A nice way to present this is in the form of an interval, so we could say that we think the population median is somewhere between here and here. So we want to add a little bit on and take a little bit off our sample median to create an interval.......but how much should we add on /take off??
This will all depend on how much Sampling error is present in the situation. More sampling error would mean we need a wider interval to be confident it captures the true population median. Less sampling error would mean we can have a narrower interval. (We also don't want to just make the interval extremely wide so that we know it contains the true population median, the wider the interval becomes the less useful it becomes.)