Module 09

Correlation

Introduction

  • T-tests and ANOVA allow us to evaluate the relationship between nominal IV(s) and a continuous DV.

  • But they cannot be used when both the IV and DV are continuous (e.g., how happiness score depends on the amount of chocolate one eats every day).

  • How do we measure and evaluate the relationship between two continuous variables?

1. What is Correlation?

Continuous variables in a study can be interrelated with one another, so that they increase/decrease together, or change in opposite directions.

One way to assess the relationship between two continuous variables is correlation. It is a measure of the extent to which two variables are related to each other.

A correlation coefficient is a standardized measure of the relationship between two variables. The most commonly used correlation coefficient is the Pearson’s r. There are other types of correlation coefficients for other purposes, such as Spearman and Kendall's tau-b. In this module, we focus on Pearson’s r (or, most frequently, it's simply known as r). Pearson’s r ranges from -1.0 to +1.0.

Whenever we look at the Pearson’s r value between two variables, we look at two aspects:

  • sign (positive or negative): If r > 0, it means that the two variables tend to increase or decrease together. This is known as a positive correlation, in which the two variables tend to change in the same direction. If r < 0, it means that the two variables tend to change in opposite direction. In other words, when one variable increases, the other tends to decrease, and vice versa. This is known as a negative correlation.

  • magnitude (the size of the number, regardless of the sign): Values of r that are closer to +1 or -1 (i.e., of a larger magnitude) represent stronger correlation between the two variables. Conversely, values of r that are closer to 0 (i.e., a smaller magnitude) represent weaker correlation between the two variables. The strongest possible correlation is either +1 or -1. The weakest possible correlation is 0.

2. Example 1: Correlation between "SNS" and "BSC"

For example, we want to know what might be related to people’s excessive time spent on social media. One of the possible factors might be self control. So we need to test the relationship between the time people spent on social media and their level of self control.

Q: How do we test the relationship between “SNS” and “BSC”?

A: We use the “Correlation Matrix” under “Regression” in jamovi.

Example 9.1_Correlation_SNS_BSC.mp4

Conclusion/ Interpretation (APA format):

  • Results of the correlation analysis showed that there was a negative linear relationship between the number of hours people spent on social media and self control (r = -.476, p < .001).

  • The results suggested that a negative relationship was found between the number of hours people spent on social media and self control (r = -.476, p < .001).

3. Example 2: Correlation between "PerAttract" and "NumRel"

Another example of a correlational study would be the relationship between perception of attractiveness and number of romantic relationships. Some people claimed that the number of romantic relationships may be related to how people perceive themselves, such as attractiveness. We want to know whether such relationship does exist or not.

Q: How do we test the relationship between “PerAttract” and “NumRel"?

A: We use the “Correlation Matrix” under “Regression” in jamovi.

Example 9.3_Correlation_PerAttract_NumRel.mp4

Conclusion/ Interpretation (APA format):

  • Results of the correlation analysis showed that there was a strong positive linear relationship between the perceived attractiveness and the number of romantic relationships (r = .708, p < .001).

  • The results suggested that a positive relationship was found between the perceived attractiveness and the number of romantic relationships (r = .708, p < .001).

4. Effect Size interpretation

Same as t-test and ANOVA, we can measure the effect size in correlation analysis. Indeed, we can take the size of the correlation coefficient (e.g., Pearson correlation coefficient r) as a measurement of effect size. Generally, it is used when both variables are continuous (interval or ratio scales). Theoretically, the range of r is from negative one to positive one (-1 to +1). E.g., if r = .47, it means that the variable X and variable Y have a large positive association.

Below is a table for judging the size of r in correlation analysis in general.

Module Exercise

Complete the exercise!

    • Now, if you think you're ready for the exercise, you can check your email for the link.

    • Remember to submit your answers before the deadline in order to earn the credits!