Data & Analysis with Computer Science

How Do We Better Understand Data?

Data and Analysis is both a Science and Engineering Practice in the California NGSS Science Standards and a branch of the California Computer Science Standards.   

Analyzing and Interpreting Data in Science

Science and Engineering Practice 4 is Analyzing and Interpreting Data.  The California Science Framework, published in 2016, has an appendix 3 about computer science in NGSS which helps to clarify how computer science can enhance science education. While we can engage students in better understanding data through generating models and looking at the data they produce, we can also use computer science as a tool to help us both collect and better understand data gathered from the real world.  

Data and Analysis in Computer Science

Using Computer Science to Gather Data

Coding Circuit Boards

One circuit board that is particular powerful as a sensor and easy to use to gather data is the Circuit Playground Express.  Its built in sensors allow for automated gathering of data.  The breadboard has a sound sensor, light sensor, accelerometer, and more.  Rather than gathering 3 or for data points to draw a fast graph with, students can act like real modern scientists and program the device to gather hundreds of values.  Students can even gather multiple values at once to find the relationship between values.

In the example below temperature values are being gathered and the Circuit Playground Express is adding them into a spreadsheet.

Google Data Studio

Using Computer Science to Process Data

Using Events to Capture Data in Google Sheets

Every Google Doc is actually a website.  More accurately it is a webapp.  That means that it is a webpage with the ability for data to be processed in the cloud.  That is how you and your friend can both be writing  in a doc, and how you can install plugins. We can us a little bit of a language from Google called apps script to run some code every time that  a device adds an entry into a Google Sheet. 

The example here shows a timer running in the background of a spreadsheet and adding an entry when the timer adds t to indicate that a car that a student built has tripped a sensor.  It is worth pointing out that we could have just used block code run on the circuit playground to gather time data from a specific point.  One added value of this implementation is that even if several students had timers that they each started, because the timing is being run on the cloud, there is a single viewpoint and control on the data.  That means one student cannot do any shenanigans with their controller (like waiting a moment before they press start) to get a lower time.

For more ways this could be used, because this is built into the cloud, you could improve automation by having your sensor to email students whenever something outside or inside a specific boundary occurs so that they can check their setup.   You could even have the data being analyzed realtime, with the spreadsheet noting when data was gathered that fit certain criteria.  

Using AI to Gather Data in Google Sheets

Using Computer Science to Visualize Data

Coding in R

Coding in Python with Jupyter Notebook

Jupyter Notebooks are a powerful tool used in data science.  They are often coded in python.  One easy way to get started using a jupyter notebook is to use Google Colab

Coding in Apps Script with Google Sheets