topic badge

9.01 Sampling techniques

Introduction

When collecting data we have to decide if we need to conduct a census, which means that data will be collected for every individual, or a sample . A sample is a group of people or objects that are taken from a larger population for measurement. Let's explore different ways to collect a sample.

Sampling techniques

An important thing when taking a sample is ensuring that the group is representative of the entire population. In other words, we want to make sure there is no bias that may affect our results. There is bias if it includes sampling or selecting based on age, gender, or interests.

A sampling method is biased when one or more members of the population have an increased chance of being selected compared to the rest of the population. A biased sample is one in which some members of the population have a higher or lower sampling probability than others.

When we are convenience sampling we sample a group or set of objects because it is easy to do. For example, you may want to sample a group of people about the type of public transportation they take, and stand outside of a bus station to gather data. Could this lead to any biases in the data?

In a simple random sampling every person or object has the same probability of being chosen. One example would be numbers being drawn out in the lottery. Every number has an equal chance of being chosen. This is an example of unbiased sampling.

Stratification is the process of dividing a group into subgroups with the same characteristics before we draw our random sample. Then we look at the size of each subgroup as a fraction of the total population. The number of items from each subgroup that are included in the sample should be in the same ratio as the amount they represent of the total population.

No person or object should fit into more than one subgroup, and no group of the total population should be excluded.

A pack of jelly beans.

Think of a pack of jelly beans. There are lots of different colors in the pack aren't there? Instead of considering them as a whole group of jellybeans, we could divide them up by color into subgroups.

For example, we decide to sample 50 bags of jelly beans. Here is a list of how many jelly beans we have of each color and how we would calculate the number of jelly beans we would need to collect to create a stratified sample:

ColorNumber jelly beansProportional Number for Sample
\text {red}200\dfrac{200}{1000} \times 50 = 10
\text{yellow}180\dfrac{180}{1000} \times 50 = 9
\text{blue}200\dfrac{200}{1000} \times 50 = 10
\text{green}140\dfrac{140}{1000} \times 50 = 7
\text{black}100\dfrac{100}{1000} \times 50 = 5
\text{purple}180\dfrac{180}{1000} \times 50 = 9
\text{Total}100050

If we use systematic sampling, we are basically picking one in every nth item. From the sample, a starting point is chosen at random, and items are chosen at regular intervals. For example, we may choose every 5th name from an alphabetical list or choose every 10th chocolate at a factory to quality test.

Examples

Example 1

The local mayor wants to determine how people in her town feel about the new construction project. Select which type of sampling each scenario uses.

a

Selecting every 50th name from an alphabetical list of residents.

A
Stratified sampling
B
Systemic sampling
C
Convenience sampling
D
Simple random sampling
Worked Solution
Apply the idea

It is a systematic sampling because there is a process, every 50th name from an alphabetical list of residents.

The scenario uses systematic sampling, and the answer is Option B.

b

Giving each resident a random number between 1 and 10 and then selecting everyone with the number 3.

Worked Solution
Apply the idea

This is not systematic because even though everyone with the number 3 is chosen they are not necessarily distributed evenly. So in this case we have simple random sampling and the answer is Option D.

c

Selecting 10\% of the residents from each suburb.

Worked Solution
Apply the idea

Because we split out the town in to each suburb and selected an equal proprotion from each of the suburbs, we are using stratified sampling and the answer is Option A.

Example 2

Beth is interested in which students from her school catch public transport. Select whether the following sampling methods are likely to be biased or not.

a

Selecting every 10th person on the bus she catches.

Worked Solution
Create a strategy

Use the fact that an unbiased sampling method means everyone in the population has an equal chance of being selected.

Apply the idea

The population in this case is Beth's year group. So not every student have an equal chance to be selected because the sample is limited only to the bus that she catches.

The sampling method is likely to be biased.

b

Selecting every 10th person on the student list.

Worked Solution
Create a strategy

Use the fact that an unbiased sampling method means everyone in the population has an equal chance of being selected.

Apply the idea

The population in this case is students at Beth's school. Every student have an equal chance to be selected because she is doing a systematic sampling choosing every 10th on the student list at her school.

The sampling method is not biased.

c

Selecting the first 50 students that arrive in the morning.

Worked Solution
Create a strategy

Use the fact that an unbiased sampling method means everyone in the population has an equal chance of being selected.

Apply the idea

The population in this case is students at Beth's school. So every student does not have an equal chance to be selected because Beth is getting only the earliest students in the morning. Students that arrive later will not be selected.

The sampling method is likely to be biased.

d

Selecting by having a computer randomly choose student numbers.

Worked Solution
Create a strategy

Use the fact that an unbiased sampling method means everyone in the population has an equal chance of being selected.

Apply the idea

The population in this case is students at Beth's school. Every student have an equal chance to be selected through a computer with their student numbers.

The sampling method is not biased.

Idea summary

The four sampling methods are:

  • Stratified - selects a proportional amount of people from the different strata in a population.
  • Systematic - selects people using a series of rules on an ordered list.
  • Convenience - selects the people that are easy to survey.
  • Simple random - selects people through a purely random selection.

To have an unbiased sampling method, we want everyone in the population to have an equal chance of being selected.

Outcomes

7.SP.A.1

Understand that statistics can be used to gain information about a population by examining a sample of the population; generalizations about a population from a sample are valid only if the sample is representative of that population. Understand that random sampling tends to produce representative samples and support valid inferences

7.SP.A.2

Use data from a random sample to draw inferences about a population with an unknown characteristic of interest. Generate multiple samples (or simulated samples) of the same size to gauge the variation in estimates or predictions.

What is Mathspace

About Mathspace