Skip to content

jsoma/2026-databases

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data and Databases Curriculum Revamp

Second-semester core course in the Columbia Journalism School Data Journalism MS program. This course teaches students to work with datasets that require more than pandas and a laptop - government databases with millions of records, document dumps from investigations, and long-term data projects that multiple journalists need to access.

Under active development and so, so, so much of the writing is lazily done via Claude Code. Have I verified that any of these investigations it's talking about exist? Absolutely not!

Course Materials

Core Materials

  • 📚 Curriculum: Week-by-week topics, concepts, and skills
  • 📝 Assignments: Progressive exercises with Foundation/Extension/Innovation tiers
  • 🔧 Tech Stack: All tools and technologies with documentation links
  • 📖 Readings: Investigations and methodologies for each week
  • 🏃 Speed Run: Self-study guide for learning the tech stack

Quick Navigation

By Week

Weeks 1-2: Large, Large Databases

Weeks 3-4: Cloud Infrastructure

  • "You just shared a giant database with zero configuration"
  • Datasette Cloud, Backblaze B2, DigitalOcean
  • AssignmentReadingsTech

Weeks 5-6: Long-term data projects

  • "Your scraper ran automatically while you slept"
  • GitHub Actions scrapers, versioning, documenting
  • AssignmentReadingsTech

Weeks 7-8: Collaborative Investigation

  • "Cross-newroom, cross-border, cross-language, cross-everything investigations"
  • OpenAleph, DocumentCloud, Datasette
  • AssignmentReadingsTech

Week 9: Graph Databases

Weeks 10-11: Public-Facing Tools

  • "Your investigation tool is live on the internet"
  • Flask, Jinja2, render.com
  • AssignmentReadingsTech

Weeks 12-13: AI Document Processing

  • "AI just read 100 documents in 30 seconds"
  • LLMs via API, NotebookLM, LM Studio
  • AssignmentReadingsTech

Week 14: Sustainability & Handoffs

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published