Bivariate data is the technical name for numerical data consisting of to variables organized into pairs of values. When we are analyzing bivariate data we are interested in determining whether there is a relationship between the two variables..
We may, for example, conduct an experiment where we measure how much time a person spends lifting weights in a week and how many pull ups they can do in a row. The two variables here are time spent lifting weights and number of pull ups. We can describe these as the independent variable and the dependent variable.
Independent variable - can be changed freely, does not depend on any other variables
Dependent variable - changes as a result of the independent variable, depends on the value of the independent variable
In our example a person can freely choose how much time they will spend lifting weights, so that is the independent variable. The number of pull ups they are able to do depends on how much weight lifting they do so the number of pull ups is the dependent variable. See the table below for some data collected on these variables.
|Time lifting weights (in minutes)||Number of pull ups|
Notice from the table above, it seems like the more time spent lifting weights, the more pull ups a person is able to do, but it is often useful to display data like this graphically to see if that is really what is happening. Each data pair represents a point on the graph. We plot the data points with the value of the independent variable on the horizontal ($x$x) axis and the value of the dependent variable on the vertical ($y$y) axis. In our example the point $\left(45,12\right)$(45,12) would represent a person who lifts weights for $45$45 minutes and is able to do $12$12 pull ups.
An experiment or observational study is typically designed to investigate a relationship between two variables, but just because there appears to be a relationship (or correlation) does not mean that a change in one variable causes a change in the other. For instance, it may seem like a person is able to do more pull ups because they spent more time lifting weights but it could actually be because they are smaller, and therefore weigh less. We can say that there is a correlation but not necessarily a causation between the variables.
Consider the following variables:
Which of the following statements makes sense?
Which is the dependent variable and which is the independent variable?
For the following sets of axes, which have the variables placed in the correct position? Select all the correct options.
The scatter plot shows the relationship between sea temperature and the amount of healthy coral.
Which variable is the dependent variable?
Which variable is the independent variable?