Univariate Data

Hong Kong

Stage 4 - Stage 5

Lesson

So we have already seen how data can be displayed in histograms and in box plots. These two displays are great for being able to identify key features of the shape of the data, as well as the range and in the case of the box plot the inter-quartile range and the median.

We should expect then that the shape of the data would be the same whether it is represented in a polygon, box plot or histogram. Remember that the shape of data can be symmetric, left skewed or right skewed.

Looking at the diagrams above, can you see the similarities in the representations?

We can see the skewed tails, where the bulk of the data sits and general shape. These are some of the features you can use to match histograms and box-and-whisker plots. You can also look at the data range.

Let's see if you can match histograms to their correct box plot representation.

Match the box plots and histograms together.

To identify matching data start by identifying tails (left or right) and symmetric type data.

- I can see, that A and 3 have right tails, and thus are both right skewed. So they are a match.
- C and 2 have left tails, and thus are both left skewed and so are a match.
- Which leaves B and 1, Which are both symmetric data.

Match the column graph shown here to the correct box plot.

- 102030405060708090A102030405060708090B102030405060708090C102030405060708090D

Match the box plot shown to the correct column graph.

10

20

30

40

50

60

70

80

90

- ABCD

Consider the following pairs of histograms and box plots:

Which two of these histograms and box plots are correctly paired?

ABCDIn part (a) we determined that the following histogram/box plot were an incorrect match:

Which two of the options correctly describe why?

The box plot has a long tail to the right which indicates positive skew, while the histogram does not appear to be skewed.

AThe data on the histogram is widely spread, while the box plot indicates that the data is mostly located around the median.

BThe median for the histogram is roughly in the middle, while the median of the box plot is located further to the left.

C