topic badge
CanadaON
Grade 12

Connecting boxplots and histograms

Lesson

So we have already seen how data can be displayed in histograms and in box plots.  These two displays are great for being able to identify key features of the shape of the data, as well as the range and in the case of the box plot the inter-quartile range and the median. 

We should expect then that the shape of the data would be the same whether it is represented in a polygon, box plot or histogram.  Remember that the shape of data can be symmetric, left skewed or right skewed.

Symmetric 

 

 

 

Positive skewed (also called skewed right)

 

Negative Skewed (also called skewed left)

Looking at the diagrams above, can you see the similarities in the representations?

We can see the skewed tails, where the bulk of the data sits and general shape. These are some of the features you can use to match histograms and box-and-whisker plots. You can also look at the data range.

Let's see if you can match histograms to their correct box plot representation.

 

Example

Match the box plots and histograms together. 

To identify matching data start by identifying tails (left or right) and symmetric type data.

  • I can see, that A and 3 have right tails, and thus are both right skewed.  So they are a match.
  • C and 2 have left tails, and thus are both left skewed and so are a match.
  • Which leaves B and 1, Which are both symmetric data.  

Worked Examples

question 1

Match the bar graph shown here to the correct box plot.

510102030405060708090

A bar graph is shown with x-axis ranging from 0 to 90 marked in intervals of 10, and y-axis ranging from 0 to 10 marked in major intervals of 5 and minor intervals of 1. At x=10, the height of the bar is 3. At x=20, the height of the bar is 7. At x=30, the height of the bar is 9. At x=40, the height of the bar is 3. At x=50, the height of the bar is 2. At x=60, the height of the bar is 0. At x=70, the height of the bar is 1. At x=80, the height of the bar is 1. At x=90, the height of the bar is 1.
  1. 10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot is shown with horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The left whisker starts at 30, the left side of the box is at 40, the vertical line inside the box is at 50, the right side of the box is at 60, and the right whisker ends at 70.
    A

    10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot is shown with horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The left whisker starts at 10, the left side of the box is at 20, the vertical line inside the box is at 30, the right side of the box is at 40, and the right whisker ends at 90.
    B

    10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot is shown with horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The left whisker starts at 10, the left side of the box is at 60, the vertical line inside the box is at 70, the right side of the box is at 80, and the right whisker ends at 90.
    C

    10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot is shown with horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The left whisker starts at 10, the left side of the box is at 20, the vertical line inside the box is at 50, the right side of the box is at 80, and the right whisker ends at 90.
    D

question 2

Match the box plot shown to the correct bar graph.

10
20
30
40
50
60
70
80
90

  1. 510102030405060708090

    A

    510102030405060708090

    B

    510102030405060708090

    C

    510102030405060708090

    D

question 3

Consider the following pairs of histograms and box plots:

  1. Which two of these histograms and box plots are correctly paired?

    A
    B
    C
    D
  2. In part (a) we determined that the following histogram/box plot were an incorrect match:

    Which two of the options correctly describe why?

    The box plot has a long tail to the right which indicates positive skew, while the histogram does not appear to be skewed.

    A

    The data on the histogram is widely spread, while the box plot indicates that the data is mostly located around the median.

    B

    The median for the histogram is roughly in the middle, while the median of the box plot is located further to the left.

    C

Outcomes

12D.D.1.3

Generate, using technology, the relevant graphical summaries of one-variable data based on the type of data provided

12D.D.1.5

Interpret statistical summaries to describe the characteristics of a one-variable data set and to compare two related one-variable data sets; describe how statistical summaries can be used to misrepresent one-variable data; and make inferences, and make and justify conclusions, from statistical summaries of one-variable data orally and in writing, using convincing arguments

What is Mathspace

About Mathspace