This repository is a synthetic World Bank-style coding-agent training project. It teaches how reusable agent skills improve output on realistic development research and operations tasks.
Participants who cannot use git can download
lumora-agent-skills-training-demo.zip from the repository root and unzip it
locally.
The mock study is the Lumora Family Support Grant Evaluation, a fictional cash transfer program with baseline, follow-up, payment, field visit, survey log, and back-check data.
- Synthetic raw CSV data in
data/raw/ - Participant-facing Stata pipeline in
code/stata/ - Pre-generated Stata tables and figures for the manager deck exercise in
outputs/ - A baseline deck for participants to update:
decks/baseline_round1_deck.pptx
- A source library of agent skills in
skills/ - Exercise-specific skill setup instructions in
.agents/README.md - Role-specific exercises in
exercises/
Open Stata from the project root and run:
do "code/stata/MasterDoFile.do"The Stata pipeline imports raw CSVs, writes intermediate .dta files, creates
the processed household panel, exports tables, figures, and QA notes.
Expected outputs include:
data/intermediate/*.dtadata/processed/analysis_household_panel.csvoutputs/tables/*.csvoutputs/figures/targeting_funnel.pngoutputs/figures/payment_provider_delay.pngoutputs/figures/welfare_change.pngoutputs/figures/enumerator_quality_flags.pngoutputs/qa_reports/stata_pipeline.log
Each exercise uses the same before/after workflow:
- clear
.agents/skills/and run the task with a general coding agent; - evaluate the output against the exercise checklist;
- copy in only the role-specific skills;
- run the same substantive task again;
- compare what changed and what still needs human judgment.
See exercises/comparison_prompts.md for the shared scorecard.
The source skills live in skills/. For the workshop, copy only the skills
needed for the current exercise into .agents/skills/.
See .agents/README.md for role-specific copy commands.
You can run a baseline prompt first, then activate the relevant skill set and compare the difference.
Managers:
- update
decks/baseline_round1_deck.pptxwith follow-up outputs; - inspect the Stata do-files first;
- produce an update log mapping slide claims to generated outputs.
Field coordinators:
- run high-frequency data quality checks;
- produce an actionable follow-up list before the next payment cycle.
Research assistants:
- clean and construct the household panel;
- validate IDs and merges;
- generate tables and figures reproducibly.
All data are synthetic. Welfare changes are descriptive monitoring evidence, not causal estimates of program impact.