How can we find a good starter word for using Wordle?
Time: 2-3 90-minute periods.
This happens the day before and as homework to lead into part 2:
The goal is to get data from the community to see what their starter words are and to be able to see if their starter words are good words based on letter frequency in words. Have students type their data in a google sheet.
Explanation of Wordle and how the game is played, so everyone knows the game. The goal of Wordle is to find the five-letter word. The user has six tries to get the word, and letters get highlighted in yellow if they are in the word or green if in the word and in the correct spot.
Students can read about Wordle here and watch this video.
Homework Task: Ask adults and friends you come into contact with:
Do you Play Wordle? If yes, what is your starter word? If they have two, write them down. If they say they use a bunch, ask them for at least one or two words. Please keep track of this information as we will do data analysis.
As the teacher, collect more data by asking members of your social media the same questions. From Facebook, this google sheet was developed. The sheet also has all the other data as well. Note the sheet has different typings of the words on purpose. Google Sheets will clean that for us.
When students come in, the data needs to be collected into one spreadsheet. Have students copy their data into the master spreadsheet.
Questions to ask:
How many people played? How many starter words did we get? How can we use this information to beat the game quicker? How can we determine if these are good starter words? What makes a good start word? What is the most popular starter word? These are things we can answer from the data. We can see if the start word is a good starter word by looking at the frequency of letters. Does this list show we are playing the game correctly? The bolded question is what we are going to try to determine. If we have good starter words we should be able to win in fewer turns.
Make sure everyone makes a copy of the spreadsheet so that they are working on their own, not the master. Including me!
The data needs to be cleaned. We need to have a better organization than the list. This is the raw data. How can we represent this data to show the starter words? Through a frequency chart. This will give us the top words in our community. We can sort the data and then count, but let the computer do the work for you. You create a Pivot table (Insert Pivot Table) Pivot tables help us get numerical data from this categorical data. This can also help to group large data sets too.
Check in 1: Does the frequency chart tell us if these are good starter words? How can we get that information?
Have students develop a plan to determine this. Can they think about what kind of data we can use to figure it out?
No. We need to take a look at the frequency of the letters and see if they match the frequency of letters in most common words. Similar to how cryptology would break code. We can find the letter frequency by using formulas in the spreadsheet. This formula page tells you how to count a letter. Because we need to count all 26 letters we can use a row at the top for the letters. The formula then becomes Len($A2)-Len(SUBSTITUTE($A2,B$1,"")). Dollar signs are used to force the columns and rows to stay the same. Since formulas when copied go by reference. This is an absolute reference for more information see Alice Keeler.
Steps to get the frequency in google sheets:
Copy the words from the pivot table since we don’t need repeats into a new sheet in this workbook.
In the top row put the letters of the alphabet
Use the formula and copy it across and down. You can click and drag across the lower-left-hand corner and double-click on the lower-left-hand corner of the cell box to fill down.
Freeze the top row of the letters
Scroll to the bottom of the words and add totals using the Sum formula
Now we need 5 letter words and see what the frequency of the letters is to see if our starter words match. How can we find this data? How can we find a good list of all five-letter words? What are the most popular letters and how many words contain these letters?
This list of 5 letter words is almost our control data. This will give us the top frequency of 5 letter words (as this may vary compared to all English words). We can compare our sample top letter frequency to this. How does the sample data change the top 5 letters?
First find a data set:
Students will explore data sets that are available to use and determine the best one for the question we want to answer. Let Google search 5 letter words in English. Talk about the challenges of getting data from websites and that it’s best to have data in a format that is easy to import and manipulate (excel, csv, txt). Provide them with the following data sets and have them choose one to use.
Check in 2: Which one will help you get to where you want to go? Voting on which data set we should use and why?
(Don’t put them in this order), in the presentation the top two are the ones that I would use. The other two require more cleaning of the data. This is when a discussion that data may need to be cleaned happens. Ask the students what is good or bad about each of these pages and use them with a spreadsheet. The second sheet has fewer words and won’t take as much time to crunch. Have them repeat the process that was done on the Nouns with 5 letters (Number 2) and then have a discussion. .
What story does the data tell?
Take a look at the data. Have the students look at the frequency letters.
Do a think pair share about the sample size and how more words gave us different results. The amount of data in a sample can make a difference in the results. Were there any errors in our sampling procedure (If students asked the same teachers some of the words may have been repeated so our popularity may be off but letter frequency wouldn’t be)? Based on this information what words from our sample(s) would you use for the “best starter words” How can we display our frequency charts graphically. (have students figure out in pairs how to create a chart using google help)
Check-in 3: What have you learned about data? What questions do you still have about extracting knowledge from data? How comfortable are you with google sheets?
Write a story based on our information for the school newspaper or website. Include an explanation of the procedure that you did to determine your results. Include your reasoning for your words and provide evidence-based on the data we developed. Include some graphical representation as well.
Collegeboard. (2020). AP Computer Science Principles: Civic knowledge and action: Voter registration. [pdf]. AP Central. https://apcentral.collegeboard.org/pdf/ap-computer-science-principles-voter-registration-lesson-plan.pdf?course=ap-computer-science-principles
Google. (n.d.). Create & use pivot tables. Google Help. https://support.google.com/docs/answer/1272900?hl=en&co=GENIE.Platform%3DDesktop
Keeler, A. (2013, November 18). Google Spreadsheets: Absolute cell referencing. https://alicekeeler.com/2013/11/18/google-spreadsheets-absolute-cell-referencing/#:~:text=Keyboard%20Shortcut%20%E2%80%93%20F4,signs%20on%20that%20cell%20reference.
Prashanth. (n.d.). Count a specific character in google sheets. Infoinspired. https://infoinspired.com/google-docs/spreadsheet/count-a-specific-character-in-google-sheets/
Salie, F. (2022, January 30). Wordle, the five-letter spelling addiction [Video]. CBS News. https://www.cbsnews.com/video/wordle-the-five-letter-spelling-addiction/
United States Census Bureau. (n.d.). What is a statistical question? [pdf]. https://www2.census.gov/programs-surveys/sis/activities/math/mm-10_teacher.pdf
Victor, D. (2022, January 3). Wordle is a love story. New York Times. https://www.nytimes.com/2022/01/03/technology/wordle-word-game-creator.html
Eye on Tech. (2019, November 14). What is a pivot table and what is it used for? [Video]. YouTube.