Question One:
We are curious about Facebook use on Penn State Campus. Here are some numbers collected about two different places to live on campus. Note these are different groups--a person who counts for once a day is not included in the once a week data. In addition, we were only interested in people who used facebook, not non-users.:
1) Add the margin totals and n to this table.
2) Create a joint distribution table for this information
3) What percent of University Park users say they use facebook at least once a week?
4) What percent of people who use facebook once a day are on the Commonwealth campus?
5) Find the expected values for each of these cells.
6) Can we say that the campus has an influence on how often people use facebook? Set up a null and alternate hypothesis, a significance level, and then run the appropriate tests.
7) Create a barplot with all of the information in a useful manner. Make it colorful.
Question Two:
Collect the data from this class about the use of a website, tool or some other statistic (homework completion, GPA, something along those lines). Do the same for another one of your classes. Each person in your groups should do this, so you end up with at least three columns
1) Create a two way table of your data.
2) Create a joint distribution table of your information.
3) Create marginal distribution tables from your data. You should end up with two--one for rows and one for columns.
4) Explain what the percentages in the marginal distribution tables represent. Again, do this with both marginal distribution tables.
5) Can we conclude that there is a statistical difference between the clases? Set up null and alternate hypothesis, a significance level, and run the appropriate tests.
6) Create a barplot with all of the information in a useful manner. Make it colorful.
Question Three:
There is a question as to whether or not smoking and level of school completed are related. We took an SRS of males from France and looked at the data below:
1) Create a two way table of your data.
2) Using the marginal distribution tables, outline the useful information (percentage wise) about these numbers.
3) Get a chi-squared value for each of the different cells. Which one appears to be the furthers off from what would be expected?
4) Test whether or not there appears to be a correlation between smoking habits and school completed.
5) Create some type of plot showing this information accurately and in an interesting way.
Question Four:
A study of people who refused to answer survey questions is shown below:
1) Give a brief analysis of the population of people here using the marginal tables and marginal proportion tables.
2) Are there any possible problems with the way we've broken up this data age wise?
3) At alpha=0.01, can we say that age affects likelihood to respond to a survey?
4) Make a graph that shows this data in both an accurate and interesting way.
Question Five:
Find a data set online that will allow you to make a two way table, analyze the results to see if there is a correlation, and make a neat graph.
this is obscure on purpose. do a good job.