# DATA101 Critique Activity: Perception & Color + Reading Charts

Use this notebook to practice *reading* charts and *critiquing* design choices.
Your goal is not to "judge"—your goal is to describe what a chart makes easy/hard, and propose defensible fixes.


## Learning Objectives

By the end, you should be able to:

- Identify the **claim** a chart invites
- Name the **task(s)** a viewer is trying to do
- Diagnose common perception problems (color, grouping, scale, missing data, area encodings)
- Propose redesigns that improve **accuracy, accessibility, and trust**


## Group Information (edit this cell)

Fill this table in before you start.

| Group # | Names |
|---:|---|
| 8 | Colobong, Franz Andrick |
|  | Domanais, Joshua |
|  | Manlapig, Jose Mari |
|  | Rocha, Angelo |
|  | Santos, Ryan Joseph |


## What To Submit

Submit the completed notebook with:

- Your critique notes for **all case studies**
- A completed **summary table**
- A short set of **rules of thumb** you will use in your own work
- A **References** section


## Chart Reading Checklist (Use for Every Case)

Use this checklist to structure your critique.

1. **Title / claim:** What is the chart inviting you to believe?
2. **Data:** What is measured? What is missing?
3. **Units + denominators:** Counts vs rates vs percent? Per-capita?
4. **Axes:** Scale, baseline, truncation, log?
5. **Encoding:** Position vs length vs area vs color — is the channel appropriate for the task?
6. **Grouping / context:** Aggregation choices, categories, labeling, legend hunting.
7. **Color + accessibility:** Sequential/diverging/categorical? CVD-safe? Enough contrast?
8. **Conclusion:** Does the chart support the claim? What redesign would make it more defensible?


# Case Study 1 — Hurricane “Cone of Uncertainty” (Uncertainty Misread)

This National Hurricane Center (NOAA) graphic shows a forecast track and the famous "cone of uncertainty" for a tropical cyclone.
It’s widely shared, but many viewers misread the cone as the *storm size* or as the *area of impact*.
Your job is to describe what it actually represents and what design choices can reduce misinterpretation.

<img src="noaa_hurricane_cone.png" alt="NOAA National Hurricane Center cone of uncertainty forecast graphic" style="width: 100%; height: auto; border-radius: 12px;" />

**Source:**
National Hurricane Center. (n.d.). *Tropical cyclone track and watches/warnings: Cone of uncertainty example* [Image]. National Oceanic and Atmospheric Administration. Retrieved January 24, 2026, from https://www.nhc.noaa.gov/images/cone_5day_with_wind.png


## Your Critique (edit this cell)

- **Claim (1 sentence):** The chart communicates the increasing margin of error in Hurricane Laura’s path over time, but it inadvertently suggests the storm itself is physically expanding.
- **What the cone *does* mean (in your words):** The cone indicates that future forecasts grow more uncertain over time.
- **What people *often think* it means (the common misread):** People usually view the cone as the size of the storm over time, which may lead those outside the cone to have a false feeling of safety.
- **Where uncertainty is shown well / poorly (be specific):** It is shown well by the widening of the cone, as it indicates that the margin of error in long-term forecasts are more uncertain over time. It is shown poorly by the solid black border, as it creates a psychological “cutoff” which may imply that no danger exists outside of the cone. 
- **Color + symbols:** The letters (S, H, M) are categorical (representing storm stages), while the "Current wind extent" and wind speeds indicated are quantitative. The red/blue coastal lines are hard to decode because the distinction between "Watch" (lighter colors) and "Warning" (darker colors) is harder to distinguish on different screens.
- **Accessibility:** If the viewer can’t rely on color, the distinction between the “Watches”, “Warnings”, and “Current Wind Extent” is lost in the map/chart. They must rely entirely on the letter icons (H vs S vs M) to understand the intensity of the storm.
- **Redesign proposal:** 
    1. Replace the solid white border with a "faded" or gradient edge to show that the storm doesn't end at a specific line/border. 
    2. Use spaghetti plots inside the cone to show that it is made of many possible paths.



# Case Study 2 — Unemployment Rate Over Time (Context + Annotation)

This is a FRED (Federal Reserve Bank of St. Louis) chart of the U.S. unemployment rate over time.
It includes recession shading (context), which can be helpful—but also easy to over-interpret as "causes" rather than timing cues.
Your critique should focus on reading the axes, the meaning of annotations, and what story the chart supports (and does not support).

<img src="fred_unrate.png" alt="FRED line chart of U.S. unemployment rate (UNRATE)" style="width: 100%; height: auto; border-radius: 12px;" />

**Source:**
Federal Reserve Bank of St. Louis. (n.d.). *Unemployment Rate (UNRATE)* [Chart]. FRED. Retrieved January 24, 2026, from https://fred.stlouisfed.org/series/UNRATE


## Your Critique (edit this cell)

- **Claim (1–2 sentences):** This chart makes it easy to conclude that U.S. unemployment is highly cyclical, consistently spiking during or immediately following economic recessions. It frames the current labor market as relatively stable and historically low following the unprecedented volatility of the 2020 COVID-19 shock
- **Task(s):** The primary tasks are to Summarize long-term historical trends and Detect anomalies. This supports high-level policy-making decisions or economic forecasting by identifying where the current economy sits within the boom-bust cycle
- **Axes + units:** 
    - Y-Axis: Percent (Unemployment Rate), ranging from 0.0 to 15.0.
    - X-Axis: Time in years (1948 to 2026).
    - The reader should verify the frequency of the data as monthly or quarterly and account for the seasonality of the data.

- **Annotations:** The gray shaded regions represent U.S. Recessions as defined by the NBER. A viewer might wrongly infer causality because unemployment often begins to rise before a recession is officially declared or continues to rise well after the shaded recession has technically ended. This is often referred to as a jobless recovery.
- **Perception:** The scale is appropriate for trend detection over a 75 year period. However, the compressed X-axis makes it difficult to see small but significant 0.1% to 0.2% fluctuations that matter to economists. Perceived structure is highly dependent on scale and resolution.
- **Redesign proposal:**
    - Contextual Callouts: Add text labels to major peaks to provide historical context for why a spike occurred alongside when it happened.
    - Benchmark Lines: Add a horizontal reference line representing the natural rate of unemployment to help discern low unemployment versus unsustainably low unemployment



# Case Study 3 — CO₂ Emissions per Capita (Color + Line Overload)

This Our World in Data chart shows CO₂ emissions per person over time for multiple countries.
Multi-line charts often fail when they rely on too many similar colors, require legend hunting, or hide important context (like per-capita vs total).
Your critique should focus on whether the color and labeling choices support the intended comparisons.

<img src="owid_co_emissions_per_capita.png" alt="Our World in Data multi-line chart of CO2 emissions per capita for selected countries" style="width: 100%; height: auto; border-radius: 12px;" />

**Source:**
Our World in Data. (n.d.). *CO₂ emissions per capita* [Chart]. Retrieved January 24, 2026, from https://ourworldindata.org/grapher/co-emissions-per-capita


## Your Critique (edit this cell)

- **Claim (1–2 sentences):** The viewer will likely compare how CO₂ emissions per person differ across countries over time, noticing that the United States and Russia emit far more per capita than the global average. Another comparison is the long-term decline in per-capita emissions in European countries, while China shows a noticeable increase in recent decades.

- **Data meaning:** “Per capita” implies that CO₂ emissions are adjusted for population size, reflecting the average emissions of an individual in each country rather than total national output. If totals were shown instead, countries with very large populations, such as China and India, would dominate the chart, making smaller but high-emitting countries appear less significant.

- **Color + labeling:** Although each country is color-coded, it becomes difficult to identify them quickly when multiple lines intersect or cluster together. Legend hunting occurs on the right side of the chart, especially near the most recent years where several lines converge, making it harder to match colors to countries with similar hues.

- **Perception:** Countries with consistently high per-capita CO₂ emissions, particularly the United States and Russia, stand out because their lines are clearly separated from the rest. Comparisons among mid-range countries can be misleading due to overlapping lines, which may give the impression that differences are minimal when meaningful gaps actually exist, especially given the relatively large y-axis intervals.

- **Accessibility:** This chart may be difficult for color-vision-deficient viewers because it relies heavily on color alone to differentiate countries, with several lines using similar tones. The lack of patterns, markers, or direct labels makes some series hard to distinguish.

- **Redesign proposal:**
  - **Highlight + de-emphasize:** Emphasizing key countries such as the United States, China, and the global average with bold colors while fading other countries into light gray would reduce visual clutter and help viewers focus on the most important comparisons. This approach guides viewer attention intentionally, preventing less relevant lines from competing visually with the main narrative of the chart.

  - **Small multiples / fewer series:** Splitting the chart into small multiples by grouping countries (e.g., high-income vs. developing nations) or reducing the number of lines shown would make trends easier to compare and eliminate the need for constant legend hunting. By isolating groups, viewers can more clearly observe patterns and differences that may be overlooked in a single crowded chart.


# Case Study 4 — FluView (Missing Data ≠ Zero + Visual Emphasis)

This CDC FluView chart reports the influenza hospitalization rate among long-term care facility residents for a season.
The gray shading indicates preliminary data—an important detail that many viewers miss when they read the line as fully final.
Your critique should focus on how missing/preliminary data is encoded and whether the chart communicates uncertainty clearly.

<img src="cdc_ltcf_influenza_rate.png" alt="CDC FluView line chart showing long-term care influenza hospitalization rate" style="width: 100%; height: auto; border-radius: 12px;" />

**Source:**
Centers for Disease Control and Prevention. (2026). *Long-term care influenza hospitalization rate among residents, reported to NHSN, national summary, 2025–2026 season* [Chart]. CDC FluView. Retrieved January 24, 2026, from https://www.cdc.gov/fluview/media/images/2026/01/LTCF02.gif


## Your Critique (edit this cell)

- **Claim (1–2 sentences):** The chart invites the viewer to see that influenza hospitalization rates for residents who are in long term care were very low in most of the season but it reached a peak during week 53. At the end, it is showing that the influenza hospitalization rate is now in a sharp decline.

- **Missing vs preliminary:** The uncertainty is indicated through the gray shaded region covering the final 3 weeks of the season. There was also a subtitle stating that "Preliminary data are shaded in gray." For discoverability, I can say that it was not strongly encoded. The shade is subtle and the explanation is separated from the data points. It forces the viewers to "hunt the legend" or read extra details to udnerstand the status of the line. This can lead into an interpretation error.

- **Axes + units:** The unit is hospitalization rate per 100,000 residents. The population are the residents of long-term care facilities nationwide. The chart used a common scale or using the unit on y-axis which is the right channel for comparing magnitude.

- **Color critique:** The magenta line creates a strong preattentive cue that draw the focus of the viewer to the trend. But, it maintains the same hue and saturation even in the gray shaded region. Using this single hue all throughout the line does not differentiate between final and uncertain data. While magenta usually provides high contrast against white and gray background, it does not use redundant encoding to help viewers with visual needs or those who miss the shading of the background.

- **Interpretation risk:** A rushed reader might experience an Interpretation Bug. They might see the sharp downturn at the end of magenta line as a definitive sign that the flu season or wave has passed. But they might fail to realize that those final points are preliminary and could be revised upward as more definitive reports arrive. This chart presents a single line that implies false precision.

- **Redesign proposal:** First, I will change the magenta line from a solid line to a dashed line once it enters the gray shaded region. This ensures that the data is encoded in the marks itself not only in the background. Second, I will place the text "Preliminary Data" directly above the shaded area or near the final data point. This reduces the cognitive load of looking back at the subtitle. Lastly, I will use like a shaded interval around the line in the gray zone to represent potential spread of the preliminary data.


# Case Study 5 — Nightingale Polar Area Diagram (Area Encoding in the Wild)

This classic polar area diagram is historically influential—and also a useful critique target.
It encodes values with sector area/angle, which can make comparisons hard (humans are much better at comparing position/length).
Your critique should focus on what tasks are supported well vs poorly, and what a modern redesign would change.

<img src="wellcome_nightingale_polar_area.jpg" alt="Historical polar area chart attributed to Florence Nightingale (Wellcome Collection plate)" style="width: 100%; height: auto; border-radius: 12px;" />

**Source:**
Wellcome Collection. (n.d.). *Army in the East* [Chart image]. In *Mortality of the British army: at home and abroad, and during the Russian war, as compared with the mortality of the civil population in England; illustrated by tables and diagrams* (Wellcome Collection item b20452433, plate b20452433_0050). Retrieved January 24, 2026, from https://wellcomecollection.org/works/gxtkyqp8


## Your Critique (edit this cell)

- **Claim (1–2 sentences):** The chart shows that the annual mortality rate in the army was high during the first year (right circle) compared to the second year (left circle). It emphasizes the peak death rate in January 1855.
- **What task(s) this supports well:** It shows a before and after comparision, showing the reduction in the overall mortality between the two years.
- **What task(s) it makes hard (be specific):** It is extremely difficult to read specific values or compare the rates of consecutive month due to the design choice where it uses irregular areas.
- **Encoding critique:** What is encoded by **angle**, **radius**, and/or **area**? Why can that mislead? The rate is encoded by the radius/radical line (length from the center), but the eyes perceives it as a area. Since area scales with the length of the radius, it makes bad months the one with large numbers visually worse than they are actually.
- **Color critique:** What does color represent here? Is the legend discoverable and unambiguous? Since it uses no color at all it lacks a clear legend that should be helpful to represent a single month.
- **Accessibility:** If a viewer can’t rely on color, what would you add? I would add direct/clear labels and thick width border to seperate segments clearly. 
- **Redesign proposal:** How would you show the same story with a more readable encoding? A line chart where the x-axis represents the month and y-axis would represent the mortality rate this would allow for the trend analysis without the area that is used in the diagram.


# Synthesis

## Rules of Thumb (edit this cell)

Write concise rules you can reuse when designing and critiquing charts.

### Color (5 rules)
1. Use distinct variations in saturation to ensure the hierarachy remains legible on different types of screens.
2. Define a simple color system with consistent tokens across all charts in a project to build visual cohesion and user trust
3. Avoid assigning similar colors to multiple lines in a dense chart, as this increases legend hunting and makes overlapping trends difficult to distinguish.
4. Use hue only to distinguish categories and lightness to show magnitude, ensuring you add redundant encoding (like dashed lines) so the chart remains accessible without relying on color alone.
5. Use distinct textures or high contrast patterns to differentiate distinct categories, this ensures that the chart will remail legible even if the color is monochrome.

### Reading charts (5 rules)
1. Distinguish between physical scale and statistical uncertainty when interpreting geometric shapes, so that the viewers can understand the context of the chart.
2. Evaluate the impact of smoothing or moving averages as they can hide significant spikes or invent trends by shifting the timing of data peaks.
3. When multiple lines overlap or cluster closely together, meaningful differences can be easily overlooked, so the data should be simplified or separated.
4. Audit the visualization pipeline by checking if the axes and encoding support the claim, while specifically looking for preliminary or missing data that could hide the true distribution.
5. When reading visualization that encodes linear data that uses radius, this can mislead the reader to perceive area as the main encoding, which scales with radius. This cause large values to appear extreme than what the actual data is.

### One workflow you will repeat
(Example format: *state the claim → list tasks → check units/denominators → verify axes → validate encodings → check color/accessibility → revise*)


## Summary Table (edit this cell)

| Case | Main pitfall | Why it misleads | Best fix | Proposed redesign |
|---:|---|---|---|---|
| 1 |Misinterpreting the "Cone of Uncertainty" as the physical size or footprint of the storm.  |Widening geometric shape can suggest an increase in storm size as time passes |Use a gradient edge for the cone to indicate that the storm does not end at the line border.  |Spaghetti plots in the cone to indicate many possible paths  |
| 2 |Lagging indicator misinterpretation  |Viewers wrongly infer that unemployment peaks and ends within the shaded recession bars, missing the fact that peaks often occur after the recession technically ends.  | Add a rule of thumb and contextual annotations to clarify that unemployment is a lagging indicator. | Add a vertical reference line at the peak and a contextual label explaining the delay in labor market recovery relative to the recession window |
| 3 | Legend hunting and cluttered line chart | Meaningful differences may be overlooked because overlapping lines appear too similar and require constant reference to the legend | Reduce visual clutter by separating the data or simplifying the number of series shown | Use small multiples and highlight key countries to reduce clutter and emphasize important trends |
| 4 | False precision caused by mapping preliminary data and final data to the same solid line | Viewers might see the sharp drop at the end as a confirmed trend, failing to realize those values are uncertain and subject to revision. | Add redundant encoding to the line itself and use direct labels to reduce the load on working memory | Change the solid line to a dashed line in the gray area and add a shaded interval to show the distribution of possible outcomes |
| 5 | Encoding mismatch | Viewer will perceive area as the main encoding rather than the radius, since area scale with radius this may exaggerate the difference between the values   | Switch to linear encoding (length/position) and use distinct labels. This ensures the visual magnitude matches the data | A line chart with months on the x-axis and mortality rate on the y-axis |


## Rubric (20 points)

- **Case critiques (15 pts total; 3 pts each × 5 cases):**
  - (1) Correctly identifies claim + task
  - (1) Diagnoses the main perception/encoding issue
  - (1) Proposes concrete redesign fixes
- **Synthesis + summary + references (5 pts):**
  - Rules of thumb are actionable
  - Summary table is complete
  - References are complete and consistently formatted


# References

National Hurricane Center. (n.d.). *Tropical cyclone track and watches/warnings: Cone of uncertainty example* [Image]. National Oceanic and Atmospheric Administration. Retrieved January 24, 2026, from https://www.nhc.noaa.gov/images/cone_5day_with_wind.png

Federal Reserve Bank of St. Louis. (n.d.). *Unemployment Rate (UNRATE)* [Chart]. FRED. Retrieved January 24, 2026, from https://fred.stlouisfed.org/series/UNRATE

Our World in Data. (n.d.). *CO₂ emissions per capita* [Chart]. Retrieved January 24, 2026, from https://ourworldindata.org/grapher/co-emissions-per-capita

Centers for Disease Control and Prevention. (2026). *Long-term care influenza hospitalization rate among residents, reported to NHSN, national summary, 2025–2026 season* [Chart]. CDC FluView. Retrieved January 24, 2026, from https://www.cdc.gov/fluview/media/images/2026/01/LTCF02.gif

Wellcome Collection. (n.d.). *Army in the East* [Chart image]. In *Mortality of the British army: at home and abroad, and during the Russian war, as compared with the mortality of the civil population in England; illustrated by tables and diagrams* (Wellcome Collection item b20452433, plate b20452433_0050). Retrieved January 24, 2026, from https://wellcomecollection.org/works/gxtkyqp8
