In statistics, bivariate data is data on two variables, where each value of one of the variables is paired with a value of the other variable.
We can analyze bivariate data by looking for an association between the two variables.
The analysis of bivariate data should include:
A scatterplot can be used to display bivariate data once the independent and dependent variables are defined.
The correlation coefficient, r, is a statistic that can describe both the strength and direction of a linear association.
It is important to be able to distinguish between causal relationships (when changes in one variable cause changes in the other variable) and correlation where the two variables are related, but one variable does not necessarily influence the other.
Determine whether the following statement is true or false:
"There is a causal relationship between number of cigarettes a person smokes and their life expectancy"
A study was conducted to find the relationship between the age at which a child first speaks and their level of intelligence as teenagers. The following table shows the ages of some teenagers when they first spoke, and their results in an aptitude test:
Age when first spoke (months) | 14 | 27 | 9 | 16 | 21 | 17 | 10 | 7 | 19 | 24 |
---|---|---|---|---|---|---|---|---|---|---|
Aptitude test results | 96 | 69 | 90 | 101 | 87 | 92 | 99 | 104 | 93 | 97 |
Create a scatterplot to model the data.
Estimate the correlation coefficient and describe the association between the variables.
Determine if there is enough evidence to suggest a causal relationship between the age when a child first speaks and their intelligence as teenagers.