-
Notifications
You must be signed in to change notification settings - Fork 0
3.2 Using Data Journalism Techniques to Advance Data Literacies
This guide adapts and expands introductory material on data journalism into a practical resource for educators, learners, and practitioners seeking to develop data literacies through hands-on, inquiry-driven approaches.
Adaptation note: This guide is adapted from ILDA – “¿Qué es el Periodismo de Datos?” and enriched with concepts from the Data Journalism Handbook. https://datajournalism.com/read/handbook/two/working-with-data/experiencing-data/data-methods-in-journalism
Data journalism is best understood as an evolution of traditional journalism in a digital age. Rather than relying solely on interviews, observations, or anecdotal evidence, it uses data as both a source and a lens for storytelling. Data represents real-world phenomena—people, behaviours, institutions—and journalists analyse it to uncover patterns, trends, and hidden issues. Technology plays a key role by enabling journalists to process large datasets, automate analysis, and present findings through compelling visual formats such as maps, charts, and interactive graphics. However, data alone is not enough: meaningful data journalism combines analytical insights with contextual reporting, ensuring that stories remain grounded in human experience and public interest. Ultimately, it is journalism that is evidence-driven, investigative, and enhanced by digital tools. See
- https://www.thebureauinvestigates.com/explainers/what-is-data-journalism
- https://infogram.com/blog/data-journalism-definition-examples/
- Data journalism is journalism that uses data, technology, and analysis to find, understand, and tell stories
- It treats data as information about people, communities, and systems, not just numbers
- It is a form of reporting that uses quantitative data to uncover trends, patterns, and insights that might otherwise be hidden
- It relies on empirical evidence (data) rather than anecdotal reporting, offering a more objective perspective
At its core, it transforms raw data into meaningful narratives that are accessible to wider audiences.
Data journalists act as a bridge between data and the public.
| Skill | Description |
|---|---|
| Critical thinking | Asking questions of data and interpreting meaning |
| Data handling | Cleaning, structuring, and analysing datasets |
| Communication | Presenting findings clearly through stories |
| Visual literacy | Understanding charts, maps, and graphics |
flowchart TD
A["Define Question<br/>Frame the problem"] --> B["Collect Data<br/>Gather relevant datasets"]
B --> C["Clean Data<br/>Prepare and structure data"]
C --> D["Analyse Data<br/>Explore patterns and trends"]
D --> E["Interpret Findings<br/>Develop insights"]
E --> F["Visualise<br/>Represent data visually"]
F --> G["Tell Story<br/>Communicate findings"]
%% Soft gradient-inspired pastel styling
classDef step1 fill:#F3E5F5,stroke:#CE93D8,color:#000;
classDef step2 fill:#E1F5FE,stroke:#81D4FA,color:#000;
classDef step3 fill:#E8F5E9,stroke:#A5D6A7,color:#000;
classDef step4 fill:#FFF3E0,stroke:#FFCC80,color:#000;
classDef step5 fill:#FCE4EC,stroke:#F48FB1,color:#000;
class A step1;
class B,C step2;
class D,E step3;
class F step4;
class G step5;
| Category | Technique | Description |
|---|---|---|
| Core Technique | Data collection | Collecting datasets from sources such as open data portals and FOI requests (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
| Core Technique | Data analysis | Examining data to find patterns, trends, and anomalies (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Core Technique | Data cleaning | Preparing and fixing messy, incomplete, or inconsistent data (see https://ejc.net/resources/doing-journalism-with-data-first-steps-skills-and-tools) |
| Core Technique | Story discovery | Identifying newsworthy insights and angles from datasets (see https://ejc.net/resources/doing-journalism-with-data-first-steps-skills-and-tools) |
| Core Technique | Data visualisation | Presenting data using charts, maps, and dashboards (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Core Technique | Combining data with reporting | Integrating data with interviews, context, and local knowledge (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
| Skill Area | Skill | Description |
|---|---|---|
| Key Skill | Data analysis | Ability to interpret large datasets and identify meaningful patterns (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Key Skill | Visualisation | Presenting data through charts, graphs, maps, and infographics (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Key Skill | Coding (optional) | Automating tasks, scraping data, and cleaning datasets (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Key Skill | Critical thinking | Evaluating data quality, bias, and ethical considerations (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Key Skill | Communication & storytelling | Turning complex data into clear, engaging narratives (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Category | Tool Type | Examples / Description |
|---|---|---|
| Data Analysis & Handling | Spreadsheets | Excel, Google Sheets |
| Data Analysis & Handling | Coding tools | Python, R, scraping scripts (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
| Data Visualisation | Visualisation platforms | Infogram (charts, maps, dashboards) (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Data Visualisation | Interactive media | Infographics, interactive charts and graphics (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Data Sources | Open data platforms | https://data.gov/ (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Data Sources | Global datasets | https://data.worldbank.org/ (see https://infogram.com/blog/data-journalism-definition-examples/) |
| Data Sources | Public records | FOI (Freedom of Information) requests (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
| Newsroom Tools & Techniques | Automation | Bots for automated reporting (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
| Newsroom Tools & Techniques | Data extraction | Web scraping tools (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
| Newsroom Tools & Techniques | Participation | Crowdsourcing data collection (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism) |
- What issue are you investigating?
- What do you want to understand or explain?
Example:
- How has air pollution changed over time in my city?
Good data should be:
- Relevant to social or economic topics
- From reliable sources
- Reusable (open licences)
Sources include:
- Open data portals
- Government datasets
- Research repositories
Checklist:
- Are dates consistent?
- Are there missing values?
- Are columns structured correctly?
| Issue | Action |
|---|---|
| Missing values | Fill, remove, or explain |
| Inconsistent formats | Standardise |
| Duplicate entries | Remove |
Common techniques:
- Sorting (rankings)
- Filtering (focus on subsets)
- Aggregation (pivot tables)
Enhancement strategies:
- Add geographic coordinates
- Combine datasets
- Create indicators (rates, averages)
| Goal | Recommended Visual |
|---|---|
| Compare values | Bar chart |
| Show proportions | Pie chart |
| Show trends | Line graph |
| Show spatial data | Map |
A good data story should:
- Be accurate and transparent
- Cite sources and methods
- Be tailored to the audience
- Use engaging narratives
flowchart LR
A["Define Question<br/>What do you want to investigate?"] --> B["Collect Data<br/>Find relevant, reliable, reusable data"]
B --> C["Clean & Prepare Data<br/>Fix formats, missing values, structure"]
C --> D["Analyse Data<br/>Sort, filter, aggregate"]
D --> E["Enrich Data<br/>Combine datasets, add context"]
E --> F["Visualise Data<br/>Charts, maps, graphs"]
F --> G["Tell the Story<br/>Communicate insights clearly"]
%% Soft pastel progression (left to right gradient feel)
classDef step1 fill:#F3E5F5,stroke:#CE93D8,color:#000;
classDef step2 fill:#E1F5FE,stroke:#81D4FA,color:#000;
classDef step3 fill:#E8F5E9,stroke:#A5D6A7,color:#000;
classDef step4 fill:#FFF3E0,stroke:#FFCC80,color:#000;
classDef step5 fill:#FCE4EC,stroke:#F48FB1,color:#000;
class A step1;
class B step2;
class C step2;
class D step3;
class E step3;
class F step4;
class G step5;
Telling the story is the culminating stage of the data journalism workflow. At this point, the goal is not simply to present results, but to translate data-driven insights into meaningful narratives that audiences can understand, question, and act upon.
A strong data story connects evidence with context. It explains not only what the data shows, but also why it matters and who it affects. In educational settings, this stage is particularly powerful, as it supports learners in moving from analysis to communication and reflection.
A good data story should:
- Be accurate and transparent
- Clearly cite sources and methods
- Be tailored to the intended audience
- Use engaging and accessible narratives
Beyond communication, storytelling opens up opportunities for applied learning, where learners actively use data to explore real-world issues.
These activities help bridge data analysis and real-world application, reinforcing both technical and critical data literacy skills.
Activity 1: Comparing Regions
Learners investigate how a particular issue varies across different locations.
- Select a social issue (e.g., unemployment, pollution, healthcare access)
- Compare across geographical regions
- Standardise the data (e.g., per capita or percentage measures)
- Interpret differences and consider underlying causes
Activity 2: Tracking Change Over Time
Learners explore how a phenomenon evolves, developing temporal understanding of data.
- Analyse trends over time (e.g., vaccination rates, employment levels)
- Identify patterns, spikes, or declines
- Detect anomalies or unexpected changes
- Reflect on possible explanations (policy, events, external factors)
Activity 3: Mapping Local Issues
Learners connect data to place-based contexts, making learning more relevant and participatory.
- Map environmental, social, or infrastructure challenges
- Use geographic data to visualise distribution
- Engage with community knowledge or citizen-generated data
- Reflect on local impact and potential interventions
flowchart LR
Q([Ask Question]) --> D[/Data Collection/]
D --> P[Preparation]
P --> A["(Analysis)"]
A --> C{Critical Interpretation}
C --> V["[Visualisation]"]
V --> S[/Storytelling/]
S --> R([Reflection & Literacy Development])
%% Pastel styling with black text
classDef q fill:#F3E5F5,stroke:#CE93D8,color:#000;
classDef d fill:#E1F5FE,stroke:#81D4FA,color:#000;
classDef p fill:#E8F5E9,stroke:#A5D6A7,color:#000;
classDef a fill:#FFF3E0,stroke:#FFCC80,color:#000;
classDef c fill:#FCE4EC,stroke:#F48FB1,color:#000;
class Q,R q;
class D,S d;
class P p;
class A a;
class C,V c;
-
Blood doping in athletics http://features.thesundaytimes.co.uk/web/public/2015/the-doping-scandal/index.html#/ (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism)
-
UK foreign aid spending investigation https://medium.com/digital-times/foreignaid-investigation-behind-the-numbers-68ddfb01d542 (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism)
-
Missing asylum-seeking children investigation https://www.buzzfeed.com/maevemcclenaghan/hundreds-of-children-seeking-uk-asylum-are-disappearing-from (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism)
-
Fishing rights monopolisation http://energydesk.greenpeace.org/2016/05/15/investigation-big-fish-quota-barons-squeeze-out-small-scale-fishermen/ (see https://www.thebureauinvestigates.com/explainers/what-is-data-journalism)
-
Panama Papers (ICIJ) https://www.icij.org/investigations/panama-papers/ (see https://infogram.com/blog/data-journalism-definition-examples/)
-
The Guardian – “The Counted” (police killings database) https://www.theguardian.com/us-news/series/counted-us-police-killings (see https://infogram.com/blog/data-journalism-definition-examples/)
-
New York Times – The Upshot https://www.nytimes.com/international/section/upshot (see https://infogram.com/blog/data-journalism-definition-examples/)
-
USA Today presidential longevity visualization https://eu.usatoday.com/in-depth/graphics/2023/02/18/carters-long-life-how-longevity-compares-presidents/8126321002/ (see https://infogram.com/blog/data-journalism-definition-examples/)
-
FOX timeline infographic (DC sniper case) https://www.fox5dc.com/news/dc-sniper-shootings-20-years-later-an-interactive-timeline-dc-snipers-beltway-podcast-three-weeks-of-hell (see https://infogram.com/blog/data-journalism-definition-examples/)
-
USA Today Oscars box office data visualisation https://eu.usatoday.com/in-depth/graphics/2023/03/08/oscars-best-picture-winners-box-office/11386099002/ (see https://infogram.com/blog/data-journalism-definition-examples/)
-
News18 US midterm elections dashboard https://www.news18.com/topics/us-midterm-elections/ (see https://infogram.com/blog/data-journalism-definition-examples/)