topic badge

Investigation: The problem with "average"

Overview
Activity 1
Activity 2
Reflection

Investigate mean and median and decide which is a better measure of center for certain situations.

Objectives
  • To investigate mean and median in real life.
  • To determine the best measures of center based on specific scenarios.
  • To understand when statistics are misleading.
Materials
  • Pen or pencil
  • Paper
  • Calculator (optional)

The problem with average

Understanding averages
Mean scores are often called "averages" in everyday life. Averages are used a lot, both inside and outside the classroom.
For example, when your teacher gives you your test scores back, they often tell you the "average" score so that you can get a sense of how you performed compared to the rest of your class.

Let's look at one example of this. Let's say that your math class has 10 students in it, and these are everyone's test scores:

A table with 2 columns titled Student and Score and with 11 rows. Speak to your teacher for more information.

Let's calculate the average mark for this class, using the mean. To do this, we add all the scores up, and then divide by 10.

\displaystyle \text{Mean}\displaystyle =\displaystyle \dfrac{70+65+60+55+50+61+57+48+72+63}{10}
\displaystyle =\displaystyle 60.1

Since the class average was around 60, and your score is 63, you'd probably be feeling pretty good. You're above average.

However, the next day let's say a new student comes to your class. That student is Terence Tao, an Australian-American mathematician who has been described as the smartest person in the world.

When Terence was 8 years old, he was teaching calculus to high school students, and he started university when he was just 14 years old. In 2014, he won a \$3 million prize for his groundbreaking discoveries in mathematics. Needless to say, Terence would find your test fairly easy.

Let's say everyone gets the same score in the next test, except Terence, who gets 100. Now how would your classes scores look?

A table with 2 columns titled Student and Score and with 12 rows. Speak to your teacher for more information.

Now what is the average? Let's add them up and see.

\displaystyle \text{Mean}\displaystyle =\displaystyle \dfrac{70+65+60+55+50+61+57+48+72+63+100}{11}
\displaystyle =\displaystyle 64.6

Now your score of 63 is below average. Oh no. But really, if we think about it, it doesn't make sense to be disappointed. You did just as well in the test, and just because there is a genius in same class as you, it shouldn't change how you look at your mark.

These kinds of situations, where one abnormally high result changes the mean significantly, are called "outliers". They are one of the biggest problems with using the mean as an average.

Investigate
Consider the following questions once you have completed the above procedure.
1.
What would the median be for your original class of 10 people?
2.
What would the median be for the class with Terence in it?
3.
How does the change in the median when Terence was added compare with the change in the mean?
4.
If a plastic mannequin joined your class scored 0 on the test, what would be the mean for this class of 12?
5.
How many additional mannequins would you need to make all the human student's scores be above the class mean?
6.
How many mannequins do you think you would need for this to work in your real class at school?

The problem with average

Imagine a country where the average income is over \$1000000 (US) per year. Would you want to live there? Sounds pretty good, doesn't it.

However, if this "average" is the mean, you have to be careful with this statistic. For example, let's say we have a country made up of 1000000 people. 999900 of these people are incredibly poor. Since the UN defines poverty as anyone living on less than \$1 per day, which is \$365 per year, we'll say that these people all make \$300 per year.

The other 100 of these people are all as rich as Bill Gates, one of the richest people in the world, having around \$96500000000 (that's 96.5 billion US dollars) each.

Normally, what you expect is a nice smooth distribution where there are a few people with very low incomes, lots of people in the middle and a few people with very high incomes, where the mean lies in the middle, like this:

The graph showing a smooth distribution  with label Number of people in the y-axis and Thing being measured in the x-axis.. Speak to your teacher for more information.

However, in a bimodal distribution, you end up with two "peaks" of people, like this:

The graph showing a bimodal distribution with label Number of people in the y-axis and Thing being measured in the x-axis.. Speak to your teacher for more information.

In this case, the mean lies in-between the two groups. Since in this case no-one actually has the "mean" as their income, it seems silly to call it "average".

Investigate
Consider the following questions once you have completed the above procedure.
1.
What would the mean income for the country above be?
2.
What is the median of this country?
3.
Would you still want to live there?

Discuss your responses to the previous questions with a classmate, then answer the questions below.

Discussion
1.
Would adding mannequins work in Activity 1 if your teacher uses the median as the average?
2.
Is the mean, median or mode the best measure of center? Explain your choice.
3.
Do you think there are any actual countries like the one in Activity 2?
4.
Is there anyone who actually has the mean as their income?

Outcomes

6.11a

Represent the mean of a data set graphically as the balance point

What is Mathspace

About Mathspace