NOTE: on my computer, two of the .csv files at the bottom are showing up as (0k). If something is the matter and the data did not transfer (worked for me, but I'm wary), PLEASE let me know and I'll re-upload them. Thanks and good luck!
As with the midterm, this should be completed on your own. You may use your notes and notes that you can find online, but you should not get help from other people or from stats forums. This is my check to make sure you understand the different concepts we have covered throughout the year.
Treat this as a data set. Give me appropriate graphs, r code, and reasoning. Make sure the formatting on this final one is correct (you know, the graphs will show up for other people, it's not all tossed carelessly in an e-mail)—the easier it is for me to read and get information, the easier it is for me to grade. Make sure questions are answered in full, with appropriate language.
Question 1: A professor asked their students “How many drinks do you have per session of drinking?” Below is the information for all the students who responded some value above zero (in this case, we are not concerning ourselves with those who do not drink).
This question was asked of a large, diverse class, and thus can be considered and SRS of the campus. Based on previous experiments, we do know that people tend to exaggerate (which may or may not be relevant).
Do a complete analysis of this information. In your results, make sure to comment on the drinking behavior claimed by women, the drinking behavior claimed by men, and a comparison between those drinking habits.
Female Students:
2.5, 5, 3, 4, 7, 2, 4, 2, 9, 3.5, 6, 5, 7, 5, 5, 1, 1, 5, 5, 3.5, 7, 2.5, 1, 2.5, 3.5, 1 , 3 , 4, 3.5, 3, 5, 2.5, 2.5, 2, 8, 2, 3, 4.5, 3, 3, 1, 6, 1, 2.5, 9, 4, 1, 7, 6, 5, 10, 5, 10, 3, 3, 3, 5, 5, 4, 7, 3, 7, 6, 3, 4, 4, 3, 3, 4, 8, 3, 9, 3, 4, 3, 4, 3, 6, 8, 4, 4, 2.5, 6.5, 4, 4, 1, 6, 4, 2.5, 4, 7, 2, 6, 7, 4
Male Students:
7, 16, 6, 15, 7, 8, 6, 7.5, 4, 8, 3, 3, 4, 4.5, 8, 8, 8, 10, 7, 2, 5, 15, 5, 4.5, 7, 6, 4, 3, 9, 10.5, 4, 4, 12.5, 4, 7, 8, 6, 5, 3, 1, 7, 6, 5, 2, 15, 5, 3, 10, 2, 5, 2, 11, 5, 1, 10, 5.5, 6, 4.5, 6.5, 9, 7, 9, 3, 18, 6, 1, 8, 9, 10, 4, 4, 12, 7, 5, 10, 3, 10, 4, 8, 8, 4, 10,
Question 2:
How long will you tolerate being put on hold? An airline has a toll-free number for reservations, and wants to know what sort of audio to play in the background while you are on hold: an advertisement for the airline, Muzak, or classical music. They wish to keep you on hold as long as possible, to increase the chances that they can eventually sell you a ticket. They selected 15 callers at random, played one of the three types of audio to each, and timed how long it took before each caller hung up (in minutes). The data can be found in the file onhold.csv. The study kept the samples small, to avoid alienating customers. Do an analysis of this experiment, and turn in a concise, clear, and complete report.
Question 3:
Do children tend to learn better when raised by natural parents than when adopted? Susan Farber performed an experiment on this question, in which she found 32 pairs of identical twins, one of whom had been adopted and raised by people other than their natural parents. She tested both children for their IQ’s; the results are given in kids.csv. Do an analysis of the experiment and the data, and turn in a concise, clear, and complete report.
Question 4:
Assume IQ scores are N(107, 15) .
a) Suzie scored a 133. What is her normalized score, and what percent of the population did she do better than?
b) Jon scored a 91. What is his normalized score and what percent of the population did he do better than?
c) What percent of the population scores between a 95 and a 113 on the IQ test?
d) In order to be join MENSA, you must score in the top 2% on the IQ test. What score must you achieve?
Question 5:
Below are the number of reported/verified tornadoes in the United States each year from 1953 until now (the NOAA did not start collecting data until then).
The media often claims that there are more tornadoes per year now than there were in the past. Explore this claim, using what you know of statistics. Use the appropriate tests, etc. etc.
According to your regression line, how many tornadoes should we expect this year? How about in 2030?
A global warming activist is claiming that the rise in tornadoes is due to rising temperatures and a shift in the gulf stream. Whether or not you agree with this statement, write a brief paragraph outlining other possible outcomes/problems with this data that might make a correlation connection difficult.