6 Correlation

One of the core goals in statistics is to calculate the strength of the relationship between two variables. That goal can be applied to a broad diversity of research questions.

Say you did an experiment to test the effect of temperature on plants and say you have plantss growing at different temperatures. In this case, you probably would like to relate temperature to a dependent variable like say plant size.

Instead, you may be interested to see how rain affects the common cold, in which case you may wish to relate the amount of daily rainfall and the daily number of hospitalizations by cold….

The possibilities are endless…but at the end of the day…all of them come down to simply assessing the strength of the relationship between two variables. For this specific purpose, we use the correlation and/or the regression analysis.

.

Expectation for this chapter

At the end of this chapter, it is expected that you:

  1. Can create a publication quality R scatterplot to visualize the relationship between two variables.

  2. Define the type of relationship between two variables (e.g., positive, negative, non-linear, non-existent).

  3. Calculate the coefficient of correlation between two variables and understand what it tells you about the relationship between the two variables.

Alright, a lot to cover, so let’s get started.