Data extraction and analysis project for stenographic protocols of sessions in the Austrian parlament. See woswormeileistung.marioslab.io for more information.
You can run the data extraction pipeline locally to generate the persons.json
and sessions.json
files yourself.
- Install NodeJS +19
- Run
npm run build
- Run
node build/process-data.js
You can find these files in the data/
directory after successful completion:
persons.json
: all persons who spoke in parlament, with or without mandate, since December 2002sessions.json
: all sessions since December 2002plaque.json
: list of persons who put plaques on the speakers podium during their speach, including the number of plaques and their content if availablemissing.json
: list of sessions, each with a list of persons who were reported abscent during that session, as well as the session transcript source text from which abscences were extracted.screamers.json
: list of call outs by persons during speeches of other persons.
- Install & run Docker
- Install NodeJS +19
npm run dev
In VS Code run the `dev`` launch configurations.