STATISTICS AND RESEARCH METHODS
A course in statistics using spreadsheets and resampling, in the context of quantitative research methods and reasoning.
There are some common statistical techniques and research methods that are widely-used (in the life sciences, at least). Understanding some of the most common statistical and research methods could potentially be useful for students in many fields of science.
As a part of an undergraduate course on statistics and research methods, I have created a set of 22 activities to help guide students through the process of quantitative research. The activities provide a structured course format that can potentially increase engagement and learning of the course material (Eddy and Hogan, 2017). The activities are organized into three main categories:
SECTION 1. WHY do we need probability and statistics to help us make decisions?
SECTION 2. WHAT are scientific models? How can data lead to scientific understanding?
SECTION 3. HOW can we design research to help make robust discoveries?
A syllabus that provides more detail about how the activities were implemented in the context of a course is shown below. An example of one of the activities (on normal distributions) is also provided below.
The activities are consistent with the Guidelines for Assessment and Instruction in Statistics Education (GAISE) recommendations to:
1. Teach statistical thinking.
Teach statistics as an investigative process of problem-solving and decision making.
Give students experience with multivariable thinking.
2. Focus on conceptual understanding.
3. Integrate real data with a context and purpose.
4. Foster active learning.
5. Use technology to explore concepts and analyze data.
6. Use assessments to improve and evaluate student learning.
Some of the main objectives of my approach to statistics and research methods were
A) To encourage students to construct knowledge about statistics and research methods based on an understanding of the reasons and principles that have led to research practices. For example:
Understanding why statistics are necessary (because of cognitive biases, logical and practical constraints).
Understanding how statistics are based on probability and counting.
Understanding the importance of statistical and scientific MODELS for research.
Understanding how general techniques such as “normalization” and “variance accounted for” can contribute to both statistics and research methods.
B) To encourage active learning using real data. A primary goal (particularly for online learning) was to make statistics and research methods a “hands-on” course. I used several methods to encourage active learning:
Encouraging students to learn how to use spreadsheets (Google Sheets, although Excel could be used for most activities) instead of powerful statistical software. In my estimation, knowing how to use spreadsheets is a fundamental college skill. However, many students do not know how to use spreadsheets, and particularly how to use functions to perform calculations. Therefore, the activities are based around problem-solving with spreadsheet functions.
Introducing resampling ("bootstrapping") before introducing parametric statistics. The worksheets (and associated spreadsheets) teach students how to set up their own “experiments” using resampling to test statistical hypotheses. Using resampling allows students to actively construct their own sampling distributions, and visualize the processes that underlie statistical tests. I am indebted to Alan Garfinkel (UCLA) for introducing me to the utility of resampling statistics for understanding.
Illustrating course concepts with real data that are currently relevant. The activities draw many of their examples from the COVID-19 pandemic, and other examples from publicly-available datasets such as the 500 Cities Project.
C) To integrate statistics into a broader context of research methods and scientific reasoning. Statistics is only one link in a chain of scientific reasoning. The activities place statistical methods within the larger context of reasoning and science. For example,
Understanding why science involves so much “negativity”—e.g. why null hypotheses must be rejected. Understanding why logic requires the somewhat counter-intuitive reasoning of rejecting null hypotheses.
Understanding how statistical hypotheses relate to research hypotheses (research hypotheses being both general scientific models and measurable predictions).
Analyzing the mathematics of basic statistics to discover that statistics is based on comprehensible principles of counting and algebra.
I have tried to make the activities accessible and conversational. I have tried to incorporate extensive repetition of important concepts throughout the activities (in my experience, repetition is essential). I have tried to structure the activities so that students “discover” many of the important concepts through their own problem solving.
Do they work? I have only anecdotal experience. My sense is that the class experience is challenging and intense, but the students gain an understanding of statistical and research concepts, and successfully learn how to set up and solve problems using spreadsheet functions. Of course, everything is a work in progress. In the future, I hope to port the activities to a platform such as Jupyter Notebook, which could allow for direct assessment of effectiveness.
Another disclaimer. I am NOT a statistician. There are probably errors in the activities (hopefully minor ones ;-). There is some inconsistent terminology that I need to make more consistent. Moreover, nothing has been copy-edited by anyone else but me, so there are formatting inconsistencies etc. that need to be addressed. I will provide MS Word format documents so that people can adapt them to their own needs. I am thankful to Danielle Navarro for providing excellent open-access materials on statistics which provided inspiration.
Just in case others are interested in using the approach (particularly those who are tasked with leading synchronous/asynchronous online classes and who do not want to record even more lectures for students to watch), I am willing to share the entire course (MS word format). A sample of the course activities is below. All of the activities can also be found in the "Book Version" of Research Methods/Reasoned Writing.
I am willing to share the entire course with almost anyone. However, I am not willing to share my materials with people at institutions with discriminatory hiring or enrollment practices (e.g. institutions that require religious affirmations or other prejudiced policies). Your institution wants to be exclusive? You’ve excluded yourself, sorry. Please do not use any of the materials on this site.
For anyone else interested, send me a direct message (email@example.com) and I can provide a link to all the course materials.