For many people the first word that comes to mind when they think about statistical charts is “lie.”
– Edward R. Tufte
When I moved from engineering to management, people expected me to make charts.
After spending some time learning about data visualization, I’ve come to two important conclusions:
- Good data visualization is powerful
- Powerful data visualization is rare
But creating a compelling chart is an underrated superpower for engineers. Here are some ideas that helped me learn how to do that.
Learning from experts
The best books I’ve read on data visualization are:
- “The Visual Display of Quantitative Information” by Tufte, and
- “Storytelling with Data” by Cole Nussbaumer Knaflic
Between those two books, you get a balance: the ideal (from Tufte) and the PowerPoint world (from Knaflic).
Tufte
Edward R. Tufte is professor emeritus of statistics, political science, and computer science at Yale.
And he’s a man upset by pie charts.
the only worse design than a pie chart is several of them.
– Edward R. Tufte
Tufte’s rules:
- 🦑Maximize the data-ink ratio – Most of your chart should be data. Erase the parts that convey nothing.
- 🍫Avoid chart junk – Avoid needless colors, shading, tickmarks, and gridlines: remove distractions from data.
Knaflic
Cole Nussbaumer Knaflic was the former manager of Google’s People Analytics team. She knows her way around a PowerPoint1.
Knaflic’s rules:
- 📈Choose an appropriate visual display – Know your context, then pick your chart
- 🏰Tell a story – Charts ought to communicate something; you should know what that something is.
Tufte’s principles: a case study
Take a look at this chart from the Wikimedia Foundation’s 2023–2024 budget projections:
This is a default Google Sheets chart for this data.
What I’m able to glean from this chart:
- “Building analytics & ML Services”: the biggest, > 25%
- “Features and functionality” and “Supporting volunteers”: smaller, ~20% each
- “Fundraising,” “Protecting access,” and “General & Admin”: smaller still, ~10% each
- The colors seem meaningless
Now consider what we glean from the table used to generate it:
Program | Budget (millions) | Percent |
---|---|---|
Building analytics & ML services | $46.4 | 26.2% |
Features and functionality | $39.7 | 22.4% |
Supporting volunteers | $35.1 | 19.8% |
General & Admin | $21.3 | 12.0% |
Fundraising | $17.9 | 10.1% |
Protecting access | $16.6 | 9.4% |
Total | $177.0 | 100.0% |
This table gives us more information than the chart in a similar amount of space.
From the table, we learn:
- Exact percentages—no need to guess
- Exact dollar amounts
- The total budget as a dollar figure
And we get rid of the meaningless colors.
OK—confession time: there were percentages on the original pie chart. I edited them out.
But I did that to prove a point. The pie chart is doing less work than the numbers. The slices of the pie add almost nothing to the numbers.
Tufte would prefer the table above to the pie chart above because the table:
- Shows the data
- Maximizes the data-ink ratio
- Avoids chartjunk (like the meaningless colors)
And in this case, I agree: I like the table.
But choosing data visualizations is difficult.
How to choose a chart: visual perception accuracy ranked
If you intend to communicate, then people must be able to read your chart.
In 1985, Cleveland & McGill conducted what is still the most cited experiment on what charts are easiest to read. Their purpose was simple: rank standard charts by the number of errors people make while reading them.
Here’s their ranking of charts—from easiest to hardest to read:
Icon | Name | Example |
---|---|---|
Position along a common scale | scatterplots, bar charts, sparklines | |
Positions along nonaligned scales | stacked bar charts | |
Length, direction, angle | pie charts, donut charts | |
Area | bubble charts, treemaps | |
Volume, curvature | 3d charts | |
Shading, color saturation | Heatmaps |
But I love heatmaps! You can use heatmaps. Just understand that only a sophisticated audience can interpret heatmaps correctly.
Know your audience, then use something like the data viz catalog to choose your chart.
Tell a story
Charts are communication.
At their worst, as Tufte said, they’re little more than “devices for showing the obvious to the ignorant.”
But at their best, they’re a powerful way to intuitively communicate a lot of data in a small space.
Tufte will not help you with PowerPoint. In his essay, “The Cognitive Style of PowerPoint” (which is included in the book “Beautiful Evidence”), he says: “bulleted outlines make us stupid” before going on to blame PowerPoint for the space shuttle Columbia disaster.↩︎