DUE: September 27th by 11:59 PM.
Before we begin, a word on working with others. This Assignment is designed with the expectation that you will be working with other people. Talking through decisions (when to remove an outlier, how to make a nice boxplot, etc.) are things that are, by nature, social choices.
That being said, when you turn something in, you should be able to claim it as your own. I will ask that you show all needed r code. This means that if you create the graph in r, I would like the code you used to make that graph as well. When answering, be sure to give descriptions as well as the graphs, so that I can understand your reasoning.
WHEN EMAILING YOUR FINAL PIECE, PLEASE LABEL THE EMAIL 'ASSIGNMENT 1'. This will help me organize all of the wonderful assignments being sent to me.
Question One:
The data for this question can be found on page 56, table 2.1
a) Search each of the data sets for possible outliers, using the methods we have discussed. Allow me to follow your steps in the write up (this means show me the summaries, explain which method you use to check, and any graphs you might use along the way). If you remove any outliers (or leave any possible outliers in the in data), please explain your choices.
b) Compare and contrast the differences in the data sets.
c) Create some type of graphic that allows us to compare all of the data sets at once.
Question 2:
The data below is a partial list. An experiment was conducted where people were given a loan of $50 and then were asked to pay the money back over time. This list is how many days it took to get all of the money back (reminders were sent to the subjects, though there were no threats of kneecap breakings or any other such thing). We have taken some of the values below. We have also excluded all of the people who either gave the money back immediately or never paid back the money:
43, 45, 53, 56, 56, 57, 58, 66, 67, 73, 74, 79, 80, 80, 81, 81, 81, 82, 83, 83, 84, 84, 88, 89, 91, 91, 92, 92, 97, 99, 99, 100, 101, 102, 102, 102, 103, 104, 107, 108, 109
a) Search each of the data sets for possible outliers, using the methods we have discussed. Allow me to follow your steps in the write up (this means show me the summaries, explain which method you use to check, and any graphs you might use along the way). If you remove any outliers (or leave any possible outliers in the in data), please explain your choices.
b) Create the most appropriate graph/presentation of this data for use when explaining this information to others.
Question 3:
Ask 25 different people on campus what time they go to bed and what time they wake up on a specific day (specify Monday, Tuesday, etc. so the data is all about one evening--there are big differences usually between a Saturday and a Tuesday, so we're trying to avoid graphing that).
a. Using the information collected, create a third set of data of how long each person slept.
b. Create a histogram, 5 point summary, and boxplot for the data created in part a. If there are any influential points, identify them. If you believe them to be outliers, make a case and then remove them from your data.
c. Create ONE GRAPHIC that easily shows when each preson went to sleep, when they woke up, and how long they slept. Try to use r to do this--but it will stretch your abilities. If r does not work, feel free to draw by hand, create something awesome, whatever and attach it to the e-mail you send me.
Question 4:
This is the open ended question.
Below is a .csv file I pulled from the internet. The confusing column names mean the following:
sumlev -- honestly not sure
state -- a numbering system for the states
region -- a numbering system for the regions of the united states
division -- something similar to region
name -- name of the states
popestimate2009 -- state population as of 2009
popest18plus2009 -- state population as of 2009 over 18 years old
PCNT_POPEST18PLUS -- a percent of people in the state over 18 years old (found using the two previous columns)
Your job is to create some presentation of some of this data. Find a piece interesting to you. Use this as an excuse to try something new and different or just make something look really good.
I undersand this question is vague -- it is vague on purpose. Go forth and create.