1. Choose one of the data sets listed above in the Activity section or one that you find on your own and give a brief description of it. What specifically were the types of data (text, sounds, transactions, etc.) included in the data set you chose?
Answer
The data set that I chose deals with the temperatures of a certain location- the record high, record low, and normal ranges of temperatures. It also included annual snowfall in for each month. The data set includes average, high, and low temperatures as well as snowfall and rainfall for each month.
2. What new facts did you learn when exploring the data set? List at least 3 facts.
Answer
Three facts that I learned from this data set include that the record high temperature for september in Annapolis was 94 degrees, record high rain in June was 84 inches, and the overall average temperature is 58.9 degrees.
3. Write a question you have about the data set you chose. Now, convert that question into a hypothesis (a statement) with your prediction about the data.
(Hypotheses take the form of "If __________, then _________." For example, a hypothesis about the student debt data could be, "If the tuition costs are higher at an institution, the student debt will be higher."
Answer
If cold areas have a lower average temperature than warm ones, then Annapolis can be considered cold because it has an average temperature that is .2 degrees less than average.
4. Identify at least one security and/or privacy concern that is associated with the data in the data set you chose?
Answer
This data set raises the concern that people can misuse the data for things like planning crimes for times when it is so cold that nobody will witness them.
5. If your data set included a visualization, explain the purpose of the visualization. How would you change or improve the visualization? If it did not include a visualization, describe one that you think would be useful in understanding the data.
Answer
The visualization is a graph of the data. I would improve it by making it easier to read, so in a more traditional format. This would make it more effective for the readers who consume the data from the graph.