Lesson 4
The Shape of Distributions
4.1: Which One Doesn’t Belong: Distribution Shape (10 minutes)
Warmup
The mathematical purpose of this warmup is to collect informal terminology students may use to describe shapes of distributions, as well as any ways to describe distributions they may remember from work in earlier grades. This warmup prompts students to compare four distributions. It gives students a reason to begin using language precisely (MP6) and gives you the opportunity to hear how they use terminology and talk about characteristics of the items in comparison to one another. Listen for students who use the statistically correct vocabulary as well as those who use informal language to describe the shapes.
Launch
Arrange students in groups of 2–4. Display the distributions for all to see. Give students 1 minute of quiet think time and then time to share their thinking with their small group. In their small groups, tell each student to share their reasoning why a particular item does not belong and together find at least one reason each item doesn't belong.
Student Facing
Which one doesn’t belong?
Student Response
Student responses to this activity are available at one of our IM Certified Partners
Activity Synthesis
Select the identified groups to share their reason why a particular item does not belong so that those with informal language speak first and those with more precise terminology follow up. Ensure that each group shares one reason why a particular item does not belong. Record and display the responses for all to see. After each response, ask the class if they agree or disagree. Since there is no single correct answer to the question of which one does not belong, attend to students’ explanations and ensure the reasons given are correct. During the discussion, recast any informal language that is used to describe the shape of each distribution. Introduce and define the terms symmetric, skewed, uniform, bimodal, and bellshaped. It is important to note that the bellshaped distribution is also symmetric.
4.2: Matching Distributions (15 minutes)
Activity
The mathematical purpose of this activity is to give students a chance to practice finding data displays that represent the distribution of the same data set and using precise vocabulary for describing the shape of the distributions while taking turns matching cards. Students trade roles explaining their thinking and listening, providing opportunities to explain their reasoning and critique the reasoning of others (MP3).
Launch
Arrange students in groups of 2. Display the images of the dot plot and histogram. Ask students what they notice and wonder.
If it does not come up, help students notice that these two data displays show the same data in different formats. They can also be described as skewed right.
Give each group a set of cutup cards. Explain that a match is two different displays that represent the distribution of the same set of data. Ask students to take turns: the first partner identifies a match, explains why they think it is a match, then describes the distribution while the other student listens and works to understand. Then they switch roles.
Student Facing
Take turns with your partner matching 2 different data displays that represent the distribution of the same set of data.
 For each set that you find, explain to your partner how you know it’s a match.
 For each set that your partner finds, listen carefully to their explanation. If you disagree, discuss your thinking and work to reach an agreement.
 When finished with all ten matches, describe the shape of each distribution.
Student Response
Student responses to this activity are available at one of our IM Certified Partners
Anticipated Misconceptions
For students having trouble with the uniform distribution histograms, remind them that the lower bound for each interval is included and the upper bound is not. Ask them why this might change the last bar in each of these histograms. Some students may not know where to start to match data displays. You can tell them to look at the lowest and highest values as a starting point to finding similarities between two representations.
Activity Synthesis
Once all groups have completed the matching, discuss the following:
 “Which matches were tricky? Explain why.” (The uniform distributions may be difficult.)
 “What vocabulary was useful to describe the shape of the distribution?” (symmetric, skewed, uniform, bimodal, bellshaped)
 “Were there any matches that could be described by more than one of these vocabulary terms?” (Yes, symmetric or skewed can also be used with some of the other terms for some of the distributions.)
If necessary, ask students to revoice less formal descriptions of the shape of the distribution using formal language including:
 Symmetric distribution
 Skewed distribution
 Uniform distribution
 Bimodal distribution
 Bellshaped distribution
Supports accessibility for: Language; Organization
4.3: Where Did The Distribution Come From? (10 minutes)
Activity
The mathematical purpose of this activity is to remind students of the importance of context to statistics. Although some analysis can be done outside of a context, it is often useful to think about the real situations in which the data was collected to engage student intuition and understanding.
Launch
Keep students in the same groups. Assign each pair of students one of the completed matches from the card sort activity. Tell students there are many possible answers for each representation. After 2 minutes of quiet work time, ask students to compare their responses to their partner’s and decide if they are both reasonable. You may need to demonstrate this activity before beginning if you think students may have trouble getting started. After each group finishes with their assigned distribution, assign the group another distribution to consider.
Design Principle(s): Support sensemaking
Supports accessibility for: Language; Organization
Student Facing
Your teacher will assign you some of the matched distributions. Using the information provided in the data displays, make an educated guess about the survey question that produced this data. Be prepared to share your reasoning.
Student Response
Student responses to this activity are available at one of our IM Certified Partners
Student Facing
Are you ready for more?
This distribution shows the length in inches of fish caught and released from a nearby lake.

Describe the shape of the distribution.

Make an educated guess about what could cause the distribution to have this shape.
Student Response
Student responses to this activity are available at one of our IM Certified Partners
Activity Synthesis
Ask each group to share their response for at least one of the distributions they were assigned. After each group shares, ask the class if their context is reasonable. Here are some questions for discussion:
 “How did you use the shape of the data to come up with your question?” (Since the data was bellshaped, I tried to think of situations where most of the data would be similar with a few points a little away from the these values.)
 “Would you always expect your question to result in a [symmetric, skewed, bellshaped, etc.] distribution?” (Not necessarily, but for most cases it would.)
Reveal the actual survey question that produced the distribution. Actual questions by row:
 How many points did Kiran score in each of his 22 games this season?
 What were typical low temperatures in a Siberian town during January?
 On a scale of 1–8, how was the service at the restaurant?
 How many questions did people get correct on the vocabulary test the first week of school?
 How many questions did people get correct on the vocabulary test the second week of school?
 How many feet below the surface were each of the core samples taken?
 How many trees are in my backyard at various temperatures?
 What was the sum when you spun a spinner labeled 0 to 5 twice?
 What was the weight of the crystal you grew in chemistry class?
 How many questions did students get correct on a 10item matching test?
Ask students to share what they have learned about the distribution now that they can think of the data in a real situation.
Lesson Synthesis
Lesson Synthesis
In this lesson students describe the shape of distributions using formal language and invent contexts for distributions with different shapes. Here are some questions for discussion.
 “What does a symmetric data set look like?” (It will have a line of symmetry in the middle and the left side will look like a reflection of the right side.)
 “What does it mean to say that the shape of a distribution is uniform?” (There will by an equal number of each data value and the shape will look rectangular.)
 “Have you heard of a bell curve before? How does this relate to a bellshaped distribution?” (Yes. I have heard of it in science class where a bell curve was used to compare data in an experiment.)
 “What is an example of a context where you would expect to find a bimodal distribution?” (You might find it if you measured the weight of a herd of cows in the springtime. The adult cows would be one peak and the calves would be the other peak.)
 “Can a skewed distribution also be symmetric? Why or why not?” (No, because skewed means that one side of the peak of the data has more data values further away from the peak than the other side. There is no line of symmetry.)
4.4: Cooldown  Distribution Types (5 minutes)
CoolDown
Cooldowns for this lesson are available at one of our IM Certified Partners
Student Lesson Summary
Student Facing
We can describe the shape of distributions as symmetric, skewed, bellshaped, bimodal, or uniform. Here is a dot plot, histogram, and box plot representing the distribution of the same data set. This data set has a symmetric distribution.
In a symmetric distribution, the mean is equal to the median and there is a vertical line of symmetry in the center of the data display. The histogram and the box plot both group data together. Since histograms and box plots do not display each data value individually, they do not provide information about the shape of the distribution to the same level of detail that a dot plot does. This distribution, in particular, can also be called bellshaped. A bellshaped distribution has a dot plot that takes the form of a bell with most of the data clustered near the center and fewer points farther from the center. This makes the measure of center a very good description of the data as a whole. Bellshaped distributions are always symmetric or close to it.
Here is a dot plot, histogram, and box plot representing a skewed distribution.
In a skewed distribution, one side of the distribution has more values farther from the bulk of the data than the other side. This results in the mean and median not being equal. In this skewed distribution, the data is skewed to the right because most of the data is near the 8 to 10 interval, but there are many points to the right. The mean is greater than the median. The large data values to the right cause the mean to shift in that direction while the median remains with the bulk of the data, so the mean is greater than the median for distributions that are skewed to the right. In a data set that is skewed to the left, a similar effect happens but to the other side. Again, the dot plot provides a greater level of detail about the shape of the distribution than either the histogram or the dot plot.
A uniform distribution has the data values evenly distributed throughout the range of the data. This causes the distribution to look like a rectangle.
In a uniform distribution the mean is equal to the median since a uniform distribution is also a symmetric distribution. The box plot does not provide enough information to describe the shape of the distribution as uniform, though the even length of each quarter does suggest that the distribution may be approximately symmetric.
A bimodal distribution has two very common data values seen in a dot plot or histogram as distinct peaks.
Sometimes, a bimodal distribution has most of the data clustered in the middle of the distribution. In these cases the center of the distribution does not describe the data very well. Bimodal distributions are not always symmetric. For example, the peaks may not be equally spaced from the middle of the distribution or other data values may disrupt the symmetry.