# STATISTICS AND RESEARCH METHODS

A course in statistics using spreadsheets and resampling, in the context of quantitative research methods and reasoning.

There are some common statistical techniques and research methods that are widely-used (in the life sciences, at least). Understanding some of the most common statistical and research methods could potentially be useful for students in many fields of science.

As a part of an undergraduate course on statistics and research methods, I have created a set of 22 activities to help guide students through the process of quantitative research. The activities provide a structured course format that can potentially increase engagement and learning of the course material (Eddy and Hogan, 2017). The activities are organized into three main categories:

SECTION 1. WHY do we need probability and statistics to help us make decisions?

SECTION 2. WHAT are scientific models? How can data lead to scientific understanding?

SECTION 3. HOW can we design research to help make robust discoveries?

A syllabus that provides more detail about how the activities were implemented in the context of a course is shown below. An example of one of the activities (on normal distributions) is also provided below.

The activities are consistent with the Guidelines for Assessment and Instruction in Statistics Education (GAISE) recommendations to:

1. Teach statistical thinking.

Teach statistics as an investigative process of problem-solving and decision making.

Give students experience with multivariable thinking.

2. Focus on conceptual understanding.

3. Integrate real data with a context and purpose.

4. Foster active learning.

5. Use technology to explore concepts and analyze data.

6. Use assessments to improve and evaluate student learning.

Some of the main objectives of my approach to statistics and research methods are

A) To encourage students to construct knowledge about statistics and research methods based on an understanding of the reasons and principles that have led to research practices. For example:

B) To encourage active learning using real data. A primary goal (particularly for online learning) is to make statistics and research methods a “hands-on” course. I use several methods to encourage active learning:

C) To integrate statistics into a broader context of research methods and scientific reasoning. Statistics is only one link in a chain of scientific reasoning. The activities place statistical methods within the larger context of reasoning and science. For example,

I have tried to make the activities accessible and conversational. I have tried to incorporate extensive repetition of important concepts throughout the activities (in my experience, repetition is essential for learning). I have tried to structure the activities so that students “discover” many of the important concepts through their own problem solving.

Do they work?  I have only anecdotal experience. My sense is that the class experience is challenging and intense, but the students gain an understanding of statistical and research concepts, and successfully learn how to set up and solve problems using spreadsheet functions. However, the course material is very challenging and, in my experience, requires the full 9 hours per week expected for a 3-unit course. Of course, everything is a work in progress. In the future, I hope to port the activities to a platform such as Jupyter Notebook, which could allow for direct assessment of effectiveness

Another disclaimer. I am NOT a statistician. There may be errors in the activities (hopefully minor ones ;-). There is some inconsistent terminology that I need to make more consistent. Moreover, nothing has been copy-edited by anyone else but me, so there are formatting inconsistencies etc. that need to be addressed. I am thankful to other authors such as Danielle Navarro for providing excellent open-access materials on statistics, which provided inspiration.

All of the activities can also be found in the "Book Version" of Research Methods/Reasoned Writing (although the book version is not quite as updated as the activities in the syllabus).

I am willing to share the entire course with almost anyone. However, I am not willing to share my materials with institutions that use discriminatory hiring or enrollment practices (e.g. institutions that require religious affirmations or other prejudiced policies). Your institution wants to be exclusive? You’ve excluded yourself, sorry. If your institution requires religious affirmations for employment or enrollment, or discriminates against other groups, please do NOT use any of the materials on this site.

An example of one of the course activities. Most weeks, students complete two activities, which are discussed during synchronous lecture/discussion periods.

RM_HYPOTHESIS_TESTING_01.pdf

SECTION 1: STATISTICAL RESEARCH METHODS_ 7

1) ESTIMATING PROBABILITIES_ 8

3) COGNITIVE BIASES_ 24

4) POPULATIONS, SAMPLES, AND RESAMPLING_ 31

5) PROBABILITY_ 37

6) CONDITIONAL PROBABILITY_ 54

7) REASONING_ 68

8) SCIENTIFIC MODELS AND PREDICTIONS_ 73

9) MEASUREMENTS_ 79

10) SAMPLES AND POPULATIONS_ 94

11) DESCRIPTIVE STATISTICS_ 107

12) FREQUENCY AND PROBABILITY DISTRIBUTIONS_ 123

13) HYPOTHESIS TESTING_ 136

14) CUMULATIVE DISTRIBUTION FUNCTIONS_ 150

15) THE NORMAL DISTRIBUTION_ 155

16) CONFIDENCE INTERVALS_ 170

17) Z TESTS AND T TESTS_ 179

18) “GOODNESS OF FIT” AND CHI SQUARE TESTS_ 192

19) LOGICAL FALLACIES AND HYPOTHESIS TESTING_ 200

20) CORRELATION AND REGRESSION_ 205

21) MULTIPLE COMPARISONS_ 223

22) RESEARCH DESIGN_ 239