instagram

(Using another tool? Please select it now: Pyret.)

Students practice reading and describing histograms, using new vocabulary to describe histogram shape.

Lesson Goals

Students will be able to…​

  • Describe the distribution of data in a histogram by identifying peaks, clusters, gaps, and outliers in histograms.

  • Identify a histogram’s shape as skewed right, skewed left, or symmetric.

  • Understand that bar charts display the frequency of categorical data while histograms show the frequency and distribution of quantitative data.

  • Recognize that changing a histogram’s bin size impacts the histogram’s shape and can affect what we learn about the distribution of the data.

Student-facing Lesson Goals

  • Let’s learn how to describe histogram shape.

Materials

Preparation

  • For the Histograms Card Sort activity in this lesson you will need to print and cut one set of cards for each pair of students in your class. We recommend storing the cards in envelopes.

  • There is an interactive Desmos activity in the lesson.

    • Open the Desmos activity you plan to use and make a code or link to share with students.

    • If you are using our Google Slides, add the appropriate link to the slide deck.

    • If you’re a first-time Desmos user, fear not! Here’s what you need to do.

Assessments

🔗Describing Shape

Overview

This activity focuses on describing shape from a histogram. Students explore shape by considering symmetry, peaks, clusters, and skew.

Launch

When we have a large dataset and we want a more collective overview of the data, histograms shine and dot plots become impractical. Histograms are valuable because they help us to see shape.

The Histograms Card Sort activity students are about to complete requires some teacher preparation. Make sure you’ve printed and cut out a set of cards for each pair of students before proceeding.

If that preparation is unrealistic for you, project the images for students to refer to as they work through this section and modify the directions accordingly. (Viewing all of the images at once, rather than as individual cards, requires a higher cognitive load for students, so we don’t recommend it.)

Your teacher will give you and your partner an envelope containing lettered histogram cards. Lay out the cards in front of you.

  • What do you Notice about the histograms?

  • What do you Wonder about the histograms?

Let’s sort some histograms!

  • With your partner, use the histogram cards to sort the cards into two piles: histograms that are roughly symmetrical, and histograms that are not roughly symmetrical.

  • Be prepared to share your groupings with the class.

If students are not familiar with the concept of "symmetry", spend some time briefly explaining that it exists when the right and left side of the data visualization are mirror images. Ensure that students agree on which histograms are symmetrical and which ones are not before proceeding with the activity.

Investigate

  • With your partner, use the histogram cards to complete Sorting Histograms.

  • We will pause for discussion after you complete the first section.

As you circulate, make note of the different groupings that students use; this will help with facilitation of class discussion and debrief. Encourage students to use appropriate vocabulary (i.e., peak, cluster, gap).

Let’s debrief.

  • How did you group your symmetrical histograms?

  • Responses will vary. Students may have grouped their histograms by the number of peaks, the number of gaps, or the location of the center. There are no wrong answers here.

  • How can you determine what’s typical of a symmetrical histogram?

  • How did you group your asymmetrical histograms?

  • Possible groupings: no peaks, 1 peak, 2 peaks; 1 gap, no gaps; and center at X, center at Y.

  • How can you determine what’s typical of a symmetrical histogram?

  • The center of a symmetrical histogram is the line of symmetry. A common misconception is arguing that the center of the axis is the center of the histogram; students must also consider the range of the histogram.

You probably noticed that some histograms trail off to the left, and others trail off to the right. Statisticians refer to this trailing as "skew". Let’s compare skew-left, skew-right, and symmetric histogram shapes.

Symmetric Skew-left Skew-right

A hill-shaped histogram, with both sides sloping away from the peak equally

A hill-shaped histogram, with a clump of taller bars on the right side, and smaller bars trailing off to the left=

A hill-shaped histogram, with a clump of taller bars on the left side, and smaller bars trailing off to the right side

Values are balanced on either side of the center.

Values are clumped around what’s typical, but trail off to the left.

Values are clumped around what’s typical, but trail off to the right.

Skew-left distributions look like the toes on your left foot, and skew-right distributions look like the toes on your right foot!

Shape is useful because it enables us to quickly identify and describe trends in data. When we recognize a particular shape, we can immediately draw conclusions about a dataset! We’ll dig deeper into that idea in our lesson on Histograms: Interpreting "Shape".)

Shape is one of a few key features that sets histograms apart from bar charts!

Bar Charts Versus Histograms surfaces two common student misconceptions about bar graphs that Whittaker & Jacobbe, 2017Douglas Whitaker, Tim Jacobbe (2017). Students' Understanding of Bar Graphs and Histograms: Results From the LOCUS Assessments. Journal of Statistics Education, 25(2), 90-102. cite in their research.

A bar chart displaying student enrollment by major, with the bars arranged tallest to shortest

  • First, students commonly fail to understand that skew is associated with distributions of quantitative variables. This is why a bar graph with its bars arranged in increasing or decreasing order does not display a skewed distribution.

  • Students also commonly believe that the category with the greatest value represents a majority of the responses. In this dataset, that is not the case: the bar representing the college of science does not represent more than half of the students.

Because a bar chart displays categorical data, we can order the bars anyway that we wish. That is not the case with histograms: since quantitative data must follow a natural order, a histogram’s bars cannot be re-ordered.

  • What are some ways that bar charts and histograms are alike?

  • Both bar charts and histograms display frequency. Both displays have bars!

  • What are some ways that bar charts and histograms are different?

  • Bar charts display categorical data, while histograms display quantitative data. A histogram’s bars touch, and a bar chart’s bars do not. Histograms have shape, so their bars cannot be reordered. Bar chart bars can be reordered.

Synthesize

  • Envision a skew-left histogram. Where do you think its outliers are? Explain how you know.

  • A skew-left histogram trails off to the left. The histogram trails to the left because there are few data points (outliers) to the left, rather than many (causing tall bars).

  • Why do histograms have shape but bar charts do not?

  • Because histograms are quantitative, their bars must appear in numerical order. Bar charts show categorical data, meaning that the bars can be arranged in any way.

Want to check student mastery of the content you’ve just taught? Administer Histograms: Visualizing Shape Formative Assessment 1 to get a snapshot of your students' current level of mastery. Make sure you have created a link or code for your class to the assessment.

Alternatively, we offer a compilation of both Checkpoints in Histograms: Visualizing Shape Cumulative Assessment.

🔗Choosing the Right Bin Size

Overview

Students learn to make histograms from the animals-dataset in CODAP and explore the importance of choosing the right bin size in order for a histogram to show us the shape of the data.

Launch

  • What is a bin on a histogram? And what do we already know about them?

  • The goal here is to hear what students have internalized so far. This is not the moment to teach about bins as we are about to delve into a deep exploration.

  • We would expect that students have some sense that bins are the intervals that data gets grouped into.

Bins that are too small will hide the shape of the data by breaking it into too many short bars. Bins that are too large will hide the shape by squeezing the data into just a few tall bars. So far, the bins were provided for you. But how do you choose a good bin-size?

Make sure you have created a link or code for your class to Exploring Bin Size (Desmos).

  • Open the Desmos link I’ve shared with you. (The file should be called Histogram Bin Size Exploration.)

  • Use the Bin Size slider to explore how changing the bin size impacts the shape of the histogram and what we can learn about the distribution of the data.

  • Record your notices and wonders in the space provided on Slide 1.

  • Before moving on to Slide 2, be sure to click the "New Dataset" button and see if you notice and wonder anything new.

  • When you’re done exploring Slide 1, move on to Slide 2 and answer the questions.

Investigate

Suppose we want to know how long it takes for animals from the shelter to be adopted.

  • What did you Notice?

  • We see most of the histogram’s area under the two bars between 0 and 10 weeks, so we can say it was most common for an animal to be adopted in 10 weeks or less.

  • We see a small amount of the histogram’s area trailing out to unusually high values, so we can say that a couple of animals took an unusually long time to be adopted: one took even more than 30 weeks.

  • More than half of the animals (17 out of 31) took just 5 weeks or less to be adopted. But the few unusually long adoption times pulled the average up to 5.8 weeks.

  • What was a typical adoption time?

  • Almost all of the animals were adopted in 10 weeks or less, but a couple of animals took an unusually long time to be adopted — even more than 20 or 30 weeks!

  • Be sure to draw attention to the fact that it would have been hard to give this summary by reading through the table, but the histogram makes it easy to see!

  • What bin sizes worked best for analyzing adoption?

  • Have students talk about the bin sizes they tried. Encourage open discussion as much as possible here, so that students can make their own meaning about bin sizes before moving on to the next point.

Rule of thumb: a histogram should have between 5–10 bins.

Histograms are a powerful way to display a dataset and assess its shape. Choosing the right bin size for a column has a lot to do with how data is distributed between the smallest and largest values in that column! With the right bin size, we can see the shape of a quantitative column.

But how do we talk about or describe that shape, and what does the shape actually tell us?

Our Histograms: Interpreting "Shape" lesson addresses these questions and explores how histogram shape affects the mean (average).

Synthesize

  • What would the histogram look like if most of the animals took more than 20 weeks to be adopted, but a couple of them were adopted in fewer than 5 weeks?

  • The histogram would be skewed left, with a peak on the right.

  • What would the histogram look like if every animal was adopted in roughly the same length of time?

  • All of the animals would be stacked into one very tall bar.

These materials were developed partly through support of the National Science Foundation, (awards 1042210, 1535276, 1648684, 1738598, 2031479, and 1501927). CCbadge Bootstrap by the Bootstrap Community is licensed under a Creative Commons 4.0 Unported License. This license does not grant permission to run training or professional development. Offering training or professional development with materials substantially derived from Bootstrap must be approved in writing by a Bootstrap Director. Permissions beyond the scope of this license, such as to run training, may be available by contacting contact@BootstrapWorld.org.