A collection of data pipelines for exploring public datasets.
- berlin-weather — Historical weather data for Berlin
- stackoverflow-trends — Stack Overflow activity trends (2008–present)
- google-trends — Google search trends analysis
- nyc-taxi — NYC taxi trip data
- flightradar24 — Flight tracking data
- chess-dot-com — Chess.com game analytics
- Bruin — Pipeline orchestration, data quality, and materialization
- BigQuery — Data warehouse (+ public datasets)
- Python / Pandas — Raw data ingestion from APIs
- SQL — Staging transformations and aggregations
- Streamlit / Altair — Interactive dashboards
- BigQuery public datasets (Stack Overflow, Google Trends)
- Stack Exchange API
- Open-Meteo API
- NYC TLC trip record data
- Chess.com API
- Flightradar24