Plan
(Relationship)
(Relationship)
In this section:
Name and describe the two variables in your investigation. Including any units.
If you are using secondary data then you will need to state the data source.
Identify which variable is the explanatory variable and which is the response variable.
State the sample size.
Identify and describe at least 2 different sources of variation that could affect the data collection.
You need to discuss two different types.
State how you will manage these sources (or how you assume they have been managed if you are using secondary data).
If you are using primary data, then these will be the names of the things you measured/collected.
egs:
height: height of student in cm
gender: sex of student (male or female)
If you are using secondary data then these will be given in a table.
You will simply need to copy and paste the two variable names and descriptions to your report.
If you are collecting your own data, then you are using Primary Data.
(See below)
If you are using data that someone else collected, then you are using Secondary Data.
(See below)
What are the two variables that you are investigating?
Both these variables need to be quantitative (numeric) variables.
Identify the Explanatory and Response variables
Explanatory Variable
Sometimes called the Independent Variable.
This variable is the x-axis.
Response Variable
Sometimes called the Dependent Variable.
This variable is the y-axis.
What are the units of each variable?
How are you going to take any measurements?
What sources of variation are likely to affect your data?
How are you going to minimize the effects of these sources?
Same as for Primary Data above, except that you should also consider the following questions:
What is the data source of the data you are going to use?
Who collected the data?
Why did they collect that data?
When was the data collected?
How many records are in this dataset?
(You may not be able to answer all of the above questions with the information you are provided. You could try and research these if you have time).
What sources of variation might have affected the data collection?
How would these sources be minimized?
Could there have been some sources of variation that could not have been identified and minimized when the measurements were taken?