Use stratified sampling
Let’s say I’m being paid by a company to find out what type of chocolate everyone in the UK prefers. Well, I can’t ask everyone, so I’ll ask a much smaller group of people. That group is called a sample of the population.
Thing is, people are all different. Some are young, some are old, some are men, some women, some are rich, most are not, some have lived here all their lives, some moved here from other countries with different cultures. If I ask only one type of person, 12 year old white boys for example, and then try to pretend that the chocolate they prefer is the same for everyone, my sample would be biased. This means it doesn’t represent reality.
To make sure my sample isn’t biased there are two things I need to do. First, I need to choose a sample size that’s big enough. If we’re talking about the UK population, which is about 6070 million people living all kinds of different lives, then ten people for example will hardly be enough. For a serious professional, a sample of even a hundred people might not be enough, you may need to ask as many as a thousand people.
The second thing I need is a ‘sampling strategy’, i.e. how am I going to choose the people to ask?
Stratified sampling
Stratified sampling is considered the most representative. It involves splitting the population into categories you choose (e.g. age brackets – 010, 1120, 2130 etc.), finding the proportion of people in the whole population who fit into each category, and then ensuring the same proportion of people are selected for your sample.
e.g. if 10% of 60 million people are between 1120 years old, make sure 10% of your 1000 person sample (i.e. 100 people) are also 1120 years old.
Example:
The owner of a health club wants to out how often members use the club. He collects data about the age of the 1000 members.
Under 25  25 40  4150  Over 50  
Population size  340  380  200  80 
The owner decides to take a stratified sample of size 50.
To calculate how many members of each age group he should choose, we need to first divide the size of the sample (50) by the number of members in the health club = 50/1000 then multiply this by the number in each age group. By doing this we get the table below
Under 25  25 40  4150  Over 50  
Population size  340  380  200  80 
Sample Size  17  19  10  4 
But 4 out of 80 (for Over 50s) might be less accurate than the 19 out of 380 (for 2540s), so he might choose 5 Over 50s and 18 2540s. This principle may be useful if a table such as the above contained fractions (such as when an bage group does not contain a number which is a multiple of 20).
Nothing in this section yet. Why not help us get started?
Follow the links below to see how this topic has appeared in past exam papers
Edexcel June 2010 (H)  Page 19, Question 24
Related Topics
Requires a knowledge of…
Related Questions

1Vote4Answers

1Vote3Answers

1Vote2Answers

1Vote3Answers

0Votes4Answers

0Votes1Answer

0Votes1Answer

2Votes4Answers

0Votes2Answers