Skip to content

stopsack/ser2025_datascience

Repository files navigation

Epidemiology is a data science – if only epidemiologists knew: Success stories of modernizing data science practice and education in epidemiology

Symposium description

Most applied research and education in epidemiology does not yet benefit from modern data science. Fledgling epidemiologists may receive cutting-edge education on the theory of epidemiologic methods, but remain largely untrained in how to collect data effectively, how to apply modern analytical methods to real data sets, how to reproducibly document code and results, and how to effectively work in teams in a digital workplace. Despite their own nagging concerns, they may rely on Dr. Google as their training on algorithms, document study procedures in e-mail chains, store data in spreadsheets, copy-paste analytical code, hard-code observations per person into separate variables, and manually type out estimates into results tables – only to discover that they are requested to do it all over when three study participants turn out to be ineligible for an analysis.

This symposium will illustrate success stories on how to efficiently practice data science in epidemiology and how to teach it along the way. There will be no exhortations how Excel is bad and that good people practice code sharing. Instead, the symposium will discuss cutting-edge approaches and real-life use cases of how modern data science has made research and teaching more efficient. The goal is for attendees to bring home a sparkling, vetted toolkit of new ideas and tools for research and teaching.

Allocation of time

Total 90 minutes

Overview and Introduction

Use Case 1: No more hand-typed numbers – Epidemiologic results tables in 2025

Slides

Konrad Stopsack (20 min)

Emily watches Whova Q&A

Use Case 2: No more moving around arrows on slides – DAGs, causal workflows, and how to build them with large-language models in 2025

Travis Gerke (20 min)

Slides

Malcolm watches Whova Q&A

Use Case 3: No more digging through e-mails to understand analyses – Project organization and metadata in 2025

Emily Riederer (20 min)

Slides - Live Slides - PDF

Travis watches Whova Q&A

Wrap up and Use Case 4: No more just hoping that code does what it should – Making code review efficient and educational in 2025 + GitHub including Harvard

Slides

Malcolm Barrett (20 min)

Konrad watches Whova Q&A

Q&A (10 min)

Konrad watches Whova Q&A

About

SER Symposium: Epidemiology is a data science

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages