Myles McMillan
mmcmillan@daltonschool.kr
Myles McMillan
mmcmillan@daltonschool.kr
Data fluency is becoming increasingly important for success in the modern world and has also become a major feature in standardized testing with the proportion of data related questions increasing significantly in the SAT over recent years. Data Science introduces students to the main ideas in data science through free tools such as Google Sheets, Python and Data Commons. Students will learn to be data explorers in project-based units, through which they will develop their understanding of data analysis, sampling, correlation/causation, bias and uncertainty, probability, modeling with data, making and evaluating data-based arguments, the power of data in society, and more!
Unit 1: Data Tells a Story
Week 1: Variability: What are variability, data, and models? How can I use data to tell as story? What is data ethics?
Week 2: Data Ethics: What are variability, data and models? How can I use data to tell a story? What is data ethics?
Week 3: Data Representations: How can I use data to tell a story? How do I create a visual representation of my data? How do I model with data?
Week 4: Model, Analyze, and Synthesize Data: How can I use data to tell a story? How do I create a visual representation of my data? How do I model with data?
In this unit students will be introduced to data science through a reflection of their own experiences using self-generated data, an exploration of a larger dataset of people’s media use, and an analysis of business data. Through these activities students will learn about the data science process, begin using data to tell stories, and think about the ethics involved in working with data. Students will make sense of the questions: What part of the story is told by data? What is variation? How is data generated? What data is gathered about themselves? During the unit, students will be learning to use CODAP and Google Sheets as they consider the ways data can be used to model the world. As students learn about data, they will be introduced to many different ways to represent data and will explore univariate, bivariate, and multivariate data. From the data visualizations they will consider what story they can tell from their data.
Unit 2: The Data of Our Community
Week 1: How can univariate data be described and visualized? How can you tell a story with univariate data?
Week 2: How can univariate data be described and visualized? How can you compare data distributions?
Week 3: How can univariate data be described and visualized? How can you compare data distributions?
Learning From Data Distributions Project Criteria Rubric (Part 1)
In Unit 2 students will explore different ways of modeling data, starting with the basic models of measures of center and spread, as well as considering sampling. Students will likely already be familiar with the calculations needed to find measures of center and spread for small data sets, but this unit takes a deeper dive into understanding the concepts, deeper meanings, limitations, and the impact of outliers in the context of data modeling. Students will explore distributions and the role of probability in understanding them. Additionally, students will collect their own data and compare it to a larger data set. During the project, students will consider their sampling choices and those of the larger data set to see how such decisions impact the comparisons drawn between the two data sets.
Unit 3: Water in Your Life
Week 1: What is data and what is it good for? What can you do with data? - Line of Fit
Week 2: What is data and what is it good for? What can you do with data? - Correlation Coefficient
Week 3: What is data and what is it good for? What can you do with data? - Correlation vs. Causation
In this unit, students will learn about bivariate data through discussions and data explorations around the theme of water usage. Students will explore scatter plots as a visual way to represent the relationship between two variables, draw their own lines of best fit, and learn how data scientists determine and analyze lines of best fit . Throughout the unit, students will use the analytic tools of Google Sheets, CODAP and Tableau to make and refine claims about water usage based on both self-collected data and large, publicly available data sets.
Unit 4: Shuffling Songs
Week 1: Do the genres you hear played on shuffle represent the genres of the songs in our playlist? - Probability and Shuffling Songs
Week 2: Do the genres you hear played on shuffle represent the genres of the songs in our playlist? - Programming With Python
Week 3: Do the genres you hear played on shuffle represent the genres of the songs in our playlist? - More Python
Week 4: Do the genres you hear played on shuffle represent the genres of the songs in our playlist? - Using our Model to Communicate results
In this unit, students will again consider the modeling process and the role played by variation, reflecting on the data collected from simulations and the ways data can help answer probabilistic questions and leverage this power for decision-making. In the process of creating powerful simulations, students will learn the basics of programming, which will continue to be a powerful tool for data analysis. During this unit students will use Python in Edu-Blocks and Colab.
Unit 5: Skin Tones and Representation - Categorical Data
Week 1: How can I gather data to analyze skin tone representation in the media and present my findings in a coherent article? - Asking Questions and Gathering Data
Week 2: How can I gather data to analyze skin tone representation in the media and present my findings in a coherent article? - Modelling
Week 3: How can I gather data to analyze skin tone representation in the media and present my findings in a coherent article? - Analyzing and Synthesizing
Week 4: How can I gather data to analyze skin tone representation in the media and present my findings in a coherent article? - Communicating
Skin Tones and Representation Group Slides: Criteria and Feedback Rubric
Skin Tones and Representation Individual Final Project: Criteria and Feedback Rubric
In this unit, students explore the issues around skin tone representation in the media through a data-based exploration of skin tone representation in magazines. Students conduct both a categorical and a numerical analysis and compare the benefits and drawbacks of both. In their categorical analysis students create two-way tables based on their interpretation of the skin tones of the people pictured, and in the numerical analysis they use the RGB values of the images themselves. After both analyses, students chose an audience for whom the information would be relevant and write a data-supported piece to share their findings with that audience. During the unit students will work in Google Sheets and Google Colab (Python).
Unit 6: What's the Best Place for Me?
Week 1: How can I build a prioritization model that will create a ranking of the best places for me to live in? - Asking Questions, Collecting and Organizing Data
Week 2: How can I build a prioritization model that will create a ranking of the best places for me to live in? - Modelling, Analyzing and Synthesizing
Week 3: How can I build a prioritization model that will create a ranking of the best places for me to live in? - Communicating Results
City Ranking Group Final Product: Criteria and Feedback Rubric
City Ranking Individual Analysis & Reflection: Criteria and Feedback Rubric
In this unit students will build a prioritization model to create a ranking. In this process, students will decide what they value, collect variables based on their values, gather and clean data, create functions to combine variables, normalize data, and create a weighting system for prioritizing their data. Students will do a sensitivity analysis on their weighting system. During this process, students will discuss how bias impacts mathematical models. They will use reasoning, justifications, and visualizations to explain their decisions. During this unit students will use Google Sheets, Google Data Commons, and Tableau.
Unit 7: Predicting My Preferences
Week 1: How can I build machine learning algorithms that will make predictions on whether I like a song? - Algorithms and Machine Learning
Week 2: How can I build machine learning algorithms that will make predictions on whether I like a song? - Modelling
Week 3: How can I build machine learning algorithms that will make predictions on whether I like a song? - Conditional Probabilities
Week 4: How can I build machine learning algorithms that will make predictions on whether I like a song? - Analysis and Data Ethics
Week 5: How can I build machine learning algorithms that will make predictions on whether I like a song? - Communication
In this unit, students will be introduced to the big ideas behind machine learning. They will build two different machine learning algorithms to make predictions on whether they will like a song. In this process they will learn about using vectors and matrices as data structures as well as applying conditional probability and exercising their basic programming abilities. Students will also consider how machine learning impacts their lives and others’ lives and will share their newly gained understandings of machine learning with a member of their community. During the unit, students will work in Colab and Edublocks.
Unit 8: Being a Data Scientist
Week 1: How can I use the full cycle of data science to pose and solve a guiding question that interests me? - Asking Questions - Starting the final project
Week 2: How can I use the full cycle of data science to pose and solve a guiding question that interests me? - Gathering Data - Planning the final project
Week 3: How can I use the full cycle of data science to pose and solve a guiding question that interests me? - Modeling and Analyzing - Completing the final project
Week 4: How can I use the full cycle of data science to pose and solve a guiding question that interests me? - Communication - Presenting the final project
Being a Data Scientist Final Project Overview
Being a Data Scientist Final Project Criteria Feedback Rubric
This unit will bring together all that the students have been working on. Students will have an opportunity to work through the full cycle of data science: making their own decisions about the questions they are interested in exploring, finding data to answer that question, cleaning the data, creating and analyzing a model, communicating with the data visually and reflecting on their process. This will be an iterative process mirroring how data scientists work on a project. Students will gather their own data. They will make decisions about how to work with it and describe the choices they have made including what technology tools to use, cleaning moves, visualization selection, univariate or bivariate data choices, combining data, and other content relevant to their project of choice.
Parents can find more information of the standards assessed in each Broad Learning Category on PowerSchool.
All assessed standards can be found here.
Students can expect the following from the teacher concerning the following:
Come to class prepared with all necessary course materials, this includes a graphing calculator (Ti-83/84/89/N-spire).
Check Google Classroom on a daily basis for assignments, documents, etc. Contact teacher if unable to access it.
Master standards relevant to each unit of study.
Complete and submit assessments on time.
Work effectively in collaborative groups.
Contribute positively to the classroom and class discussions.
Show kindness, open-mindedness, and respect to peers.
Seek appropriate help when needed.
Follow Check-in retake procedures. Submit all homework, book appointment with your teacher and wait for it to be accepted.
Leave your space tidy when you exit the room.
Students can expect the following from the teacher concerning the following:
GOOGLE CLASSROOM - All assignments will be posted in Google Classroom. If you are having trouble locating it, contact the teacher.
RUBRICS AND POLICIES - Once posted here they should not change. However, if a change is necessary the teacher will inform students well in advance and will clearly mark the changes.