the best: http://cran.r-project.org/doc/contrib/Verzani-SimpleR.pdf
you want to do something in r? find how to do it with the link above.
Problems for Monday:
1) The average height for an 18-year-old female in America is 64.2 inches. You wonder whether or not the mean height of this year's female graduates from Mount Anthony is different from the national average. You measure an SRS of 78 female graduates and find that sample_mean = 63.1 inches. What are your null and alternate hypotheses?
2) You own a company that puts people on an assembly line. As part of a morale directive, you are trying to see if people have a difference in job satisfaction between either being self-paced or machine-paced. All subjects will work in both settings, in random order. After two weeks in each setting, the workers will take a test of job satisfaction.
3) Change in sweetness for a cola.
We want to see if the storage of a soda causes the soda to lose sweetness over time. We have several different professional tasters come in and drink sodas. We then store the same batch of sodas for four months at a high temperature and have them taste again. Both were rated on a scale of 1-10. We subtract the new from the old and get these values:
sweet<-c(2,0, 0.4, 0.7, 2.0, -0.4, 2.2, -1.3, 1.2, 1.1, 2.3)
[note: the higher the number, the more the loss of sweetness]
sd_pop = 1
4) My Prius is supposed to get 35 mpg (with a sd_pop of 3.5). I'm wondering if this is still true after all these years, so I've recorded the gas mileage of my car over the past 24 tanks of gas.
mileage<-c(28, 30, 32, 33, 27, 36, 40, 28, 27, 32, 36, 33, 39, 28, 39, 28, 28, 34, 32, 30, 37, 33, 31, 26)
What should we do?
HW: read section 6.2. Do problems 6.36, 6.37, 6.50, 6.51
Tuesday:
Two groups take the SAT. The first group is a control, the second group has gone through an intensive battery of trainings trying to raise their scores. The SAT has a sd.pop = 100
Here are the two sets:
sat.con<-c(650,670,530,440, 470,550,600,730,400,500,520,640)
sat.prep<-c(520,590,740,710,540,490,530,650,690,560,650,590)
1) Set Ho and Ha.
2) Check data
3) run analysis
4) conclude
Question 2:
Same as before but:
sat.con average = 558.3333
sat.prep average = 564.122
n= 1000
sd.pop = 100
Is there a statistical difference here?
Question Three: New r Code for us:
Categorical Data: creating a graph with favorite colors
colors<-c("red","red","blue","blue","green","orange","blue","blue","green","red","yellow")
barplot(table(colors))
make it all nice looking.
Wednesday-Friday: data set of everything from the get go.