These are the questions for your sythesis that is due on Thursday. You may work with other people EXCEPT on question number 1. Number one you need to turn into me separately. We will be sharing these with the class, so be aware of that fact.
1) Your final project focuses on some information you will collect. It might focus on something here at MAUHS, but it might also be something else that interests you out there in the 'real' world. Examples might be: what people eat for lunch each day, can people tell the difference between coke and pepsi, is there a correlation between SAT scores and GPA, what are the political views of people at MAUHS, what can be said about the population of students at MAUHS, what can be said about the teacher population of MAUHS. You can also find data from somewhere else that hasn't already been analyzed thoroughly and use it. There are many other things that can be done, and you should try to make it your own--don't just pick something on my list.
Here are two sites that might have data you can be interested in:
http://www.statsci.org/datasets.html
http://lib.stat.cmu.edu/datasets/
a. What information interests you? In order to collect it, will you need to conduct an experiment? This information can be either from MAUHS or from somewhere else.
b. If you are going to do an experiment, sketch a rough outline of how you will implement. If you are going to write a survey, create an outline of the questions you will ask. If using data from somewhere else, explain how you will acquire this information (from student life, the internet, etc. etc.). If an internet source, name it.
c. Once you have collected your data and have it available, how do you plan to display it to others? Sketch an idea of what you want based upon fake data. This should incorporate all the information that you think you are going to need, all the questions that you are planning to ask, and anything else you feel is important.
The following questions you may work with others on like a regular Data Set.
2) Find table 1.5 on page 76 (you can read about the information in question 1.167).
a. Run a statistical analysis on the biological clock data. Remove any outliers (if any). Make sure to make appropriate graphs. Explain what this analysis tells you about biological clocks for this plant.
b. What do you think might cause variation in the different plant biological clocks?
3) p. 95, question 2.20
4) Many people claim that drinking caffeine makes them work better, have more focus, perform better, and stay awake/stay alert. You are doubtful of this claim and want to create an experiment to test whether or not this is true.
a) Using our format, create: Goal, Hypothesis, Cautions, and Procedure. Make sure to fill these areas out completely. Consider what you may or may not need to do in order to show whether or not it is caffeine that's causing a difference as opposed to something else (time of day, familiarity with a subject, placebo effect, etc. etc.)
b) Now that you've created a possible procedure, what other problems are there that could occur when you would try to do this experiment?
5) p 199, question 3.59. The question only asks for your random sample. I want to know EXACTLY HOW YOU GOT THAT SAMPLE. Walk me through the process in detail, from first looking at the information to the five you have chosen.
6) p 200, question 3.70. The question only asks for your random sample. I want to know EXACTLY HOW YOU GOT THAT SAMPLE. Walk me through the process in detail, from first looking at the information to the five you have chosen.
7) p 201, question 3.77, parts a and b.
8) p 99, table 2.1. Here is a lot of information about NBA teams as a business.
VALUE: how much the team as a whole is worth
REVENUE: how much the team makes simply by playing basketball (this does not include merchandise, etc)
DEBT: how much the team owes (for their stadium and other things, perhaps)
INCOME: takes into consideration new debt, revenue, and all other sources of money coming in (or going out)
a) make an appropriate graph and sufficient explanation about one correlation in these numbers that is 'interesting'.
b) make an appropriate graph and sufficient explanation about a non-correlation (in other words, talk about one relationhip that you thought might have a correlation but really didn't).
9) Pitchers in the Major league have top speeds (in MPH) on their fastballs that correspond roughly to a curve with N(90, 4).
a) The last time Justin Verlander pitched, his top speed was 97 MPH. Find a z-score for this pitch.
b) 99% of all fastballs are below what speed?
c) 95% of all fastballs are between what speeds? (use the 68-95-99.7 rule)