topic badge
CanadaON
Grade 12

Connecting boxplots and histograms

Lesson

So we have already seen how data can be displayed in histograms and in box plots.  These two displays are great for being able to identify key features of the shape of the data, as well as the range and in the case of the box plot the inter-quartile range and the median. 

We should expect then that the shape of the data would be the same whether it is represented in a polygon, box plot or histogram.  Remember that the shape of data can be symmetric, left skewed or right skewed.

Symmetric 

 

 

 

Positive skewed (also called skewed right)

 

Negative Skewed (also called skewed left)

Looking at the diagrams above, can you see the similarities in the representations?

We can see the skewed tails, where the bulk of the data sits and general shape. These are some of the features you can use to match histograms and box-and-whisker plots. You can also look at the data range.

Let's see if you can match histograms to their correct box plot representation.

 

Example

Match the box plots and histograms together. 

To identify matching data start by identifying tails (left or right) and symmetric type data.

  • I can see, that A and 3 have right tails, and thus are both right skewed.  So they are a match.
  • C and 2 have left tails, and thus are both left skewed and so are a match.
  • Which leaves B and 1, Which are both symmetric data.  

Worked Examples

question 1

Match the bar graph shown here to the correct box plot.

510102030405060708090

A bar graph has an x-axis ranging from 0 to 90 marked in intervals of 10, and y-axis ranging from 0 to 10 marked in major intervals of 5 and minor intervals of 1. The first bar is at $x=10$x=10, and has a height of $3$3. The second bar is at $x=20$x=20, and has a height of $7$7. The third bar is at $x=30$x=30, and has a height of $9$9. The fourth bar is at $x=40$x=40, and has a height of $3$3. The fifth bar is at $x=50$x=50, and has a height of $2$2. The sixth bar is at $x=60$x=60, and has a height of $0$0. The seventh bar is at x=70, and has a height of  $1$1. The eighth bar is at $x=80$x=80, and has a height of 1. The ninth bar is at $x=90$x=90, and has a height of $1$1.
  1. 10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot has a horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The minimum value starts at $30$30, the left side of the box is at $40$40, the median value is at $50$50, the right side of the box is at $60$60, and the maximum value ends at $70$70.
    A

    10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot has a horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The left minimum value at $10$10, the left side of the box is at $20$20, the median value inside the box is at $30$30, the right side of the box is at $40$40, and the maximum value ends at $90$90.
    B

    10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot has a horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The minimum value starts at $10$10, the left side of the box is at $60$60, the median value is at $70$70, the right side of the box is at $80$80, and the maximum value ends at $90$90.
    C

    10
    20
    30
    40
    50
    60
    70
    80
    90

    A box plot has a horizontal axis ranging from 10 to 90 marked in major intervals of 10 and minor intervals of 5. The minimum value starts at $10$10, the left side of the box is at $20$20, the median value is at $50$50, the right side of the box is at $80$80, and the maximum value ends at $90$90.
    D

question 2

Match the box plot shown to the correct bar graph.

10
20
30
40
50
60
70
80
90

  1. 510102030405060708090

    A

    510102030405060708090

    B

    510102030405060708090

    C

    510102030405060708090

    D

question 3

Consider the following pairs of histograms and box plots:

  1. Which two of these histograms and box plots are correctly paired?

    A
    B
    C
    D
  2. In part (a) we determined that the following histogram/box plot were an incorrect match:

    Which two of the options correctly describe why?

    The box plot has a long tail to the right which indicates positive skew, while the histogram does not appear to be skewed.

    A

    The data on the histogram is widely spread, while the box plot indicates that the data is mostly located around the median.

    B

    The median for the histogram is roughly in the middle, while the median of the box plot is located further to the left.

    C

Outcomes

12D.D.1.3

Generate, using technology, the relevant graphical summaries of one-variable data based on the type of data provided

12D.D.1.5

Interpret statistical summaries to describe the characteristics of a one-variable data set and to compare two related one-variable data sets; describe how statistical summaries can be used to misrepresent one-variable data; and make inferences, and make and justify conclusions, from statistical summaries of one-variable data orally and in writing, using convincing arguments

What is Mathspace

About Mathspace