There is variation everywhere. No two objects are exactly alike.
Often we would like to measure the amount of variation but this can be tricky.
Here we are trying to measure the maths abilities of a group of students.
We want to see this real variation but our view can be obscured by other forms of variation.
Every individual is different.
Natural variation is the variation that occurs from individual to individual.
This is sometimes called individual-to-individual variation or real variation.
This is the variation we want to measure.
For the Year 9 students above, what is the true ability of each student in maths?
What inaccuracies might be introduced if we are not careful about how we measure this ability?
This is the variation introduced by inconsistencies in measuring devices or processes.
This is the difference between the actual value and the recorded value.
This could be due to any of the following:
An inaccurate measuring device.
egs:
A poorly designed test.
A faulty measurement tool
Inaccurate measurement caused by human error which could be:
transcriptional error - eg: writing 89 instead of 98
estimation error - eg: reading 98mm instead of 99mm.
Inaccurate measurement caused by the subject.
eg: An animal might be wriggling while it is being measured.
Examples for the maths ability measuring process:
A poorly designed test that had too many hard questions or questions not about maths.
A marker who did not know how to mark the test.
Multiple markers who marked differently - some were strict (their students got low scores) and some were lenient (their students got high scores).
Marks getting recorded incorrectly.
This is variation introduced when trying to measure variables whose value changes depending on when the measurement is taken.
Examples for the maths ability measuring process:
Students who took their test at the beginning of the day (more energy) did better than students who did their test later in the day (tired).
Students who took their test on a Monday morning did not do as well as students who took their test on other mornings.
This is variation that is caused by the experimental or measurement process.
Examples for the maths ability measuring process:
Some students taking tests get so nervous that they cannot not perform to their best (anxiety issues).
Other examples
Some people will have an increased heart rate when this is being measured in a public setting.
The growth of a plant in a laboratory will be different from that of one growing in the wild.
This is variation caused by the effect of a particular 'situation'.
Examples for the maths ability measuring process:
Students who sit their test just after doing some strenuous activity will be tired and not perform to their best.
Students who sit their test right after lunch will feel tired if they have had a decent meal.
Students who sit their test in a hot/stuffy/dark/loud space will not do as well compared to doing it in a bright/breezy/quiet space.
Students who did not have breakfast that morning will perform lower than normal
Students who did not have enough sleep the night before will not perform as well as they could have.
Other examples
A person who has just run to the classroom for the lesson will have a higher heart rate than normal.
The weight of an animal will be affected by how recently it has eaten.
A student who did not get enough sleep the night before will have a slower reaction time than usual.
This is variation that we see when sampling a population.
Sometimes we will already have access to a selection of data (Secondary data) that we can use in an investigation. If the dataset is large enough we do not have to use all of it to produce a reasonable analysis for our investigation. We can simply take a sample of this and analyze that instead.
For large populations, samples are likely going to be different. In order for us to minimize the effect of sampling variation on our analysis we simply need to take a large enough sample. For most of our purposes the minimum sample size will be 100.
We also need to make sure that our sample isn't biased towards certain outcomes. Our sample needs to be representative of the population we are referring back to. In most cases, we can prevent bias in a sample by taking a simple random sample.
In general, sampling variation is managed by:
Taking a large enough sample size(minimum of 100).
Performing a simple random sample.