What does it do? 🖨

This code is the result of a Civic Tech hackathon in Göteborg in September 2018 🤓. It was written by Emil Hemdal, Albin and Pierre Mesure.

What does it do? 🖨

This is a script that scraps data from regeringen.se. For now, it only looks for remisser, lists them and builds a dataset containing information about these remissprocesses.

You can read more on how to run the script and which data it saves.

The result is saved in an SQL database.

Why? 📕📗📘

Regeringskansliet, the Swedish government's chancellery, has very little open data. Theoretically, all of their documents are available publicly on regeringen.se, but a weak search engine (which doesn't search for keywords through the content of the page or documents) and the lack of structured data (most of the documents are available as PDF with various formattings) make it virtually impossible for journalists, researchers and activists from civil society to delve in this information and scrutinize the government's actions.

Apart from this valuable work by counterpowers, this lack of open structured data also prevents civil servants from working efficiently and other public agencies to use the information released.

The Swedish parliament, Riksdagen, includes some of the government's documents on their website (bills, SOUs) but they get this as PDF from Regeringskansliet and have to parse the content to display it, making it harder for them to inform citizens.

The end result is impossible to use in a good way. Formatting is lost, page numbers and tables get inserted in the text and no structure is preserved. See example here.

Goal 🎯

Our goal with this small script is to show how ridiculously hard it is to find simple information such as:

which organisations does the government ask?
which organisations answer?
how many times have they asked each organisation and how many times do they answer?

Statistical information would already help counterpowers to say who the government is picking to help them to shape a bill. Big interest groups? NGOs? Local associations? These are important democratic issues.

And that would only require a structured list of remisser and remisssvar.

If Regeringskansliet was also publishing remissvar's content in a structured format, techniques such as semantic analysis could be applied to analyze the proximity of a remissvar and a bill to determine which organisations influenced most. This is already done to study the impact of lobbying in the EU and the US parliaments.

First results 📊

** EDIT: The script has now been improved and you can find better updated results in this spreadsheet.** The hackathon was just one day and we realised while doing it that the quality of the "data" was poorer than expected:

regeringen.se only shows the last 894 remisser, the oldest one being from the 02/04/2015. We harvested the name and link of 23353 attached files.
we used the file names as the name of the organisation (for lack of something better) and although that worked most of the time, a lot of names include a typo (Stockholm universitet instead of Stockholms universitet), a number (35. Stockholms universitet) or something else which requires manual cleaning of the data.
we had no choice but to group local branches of national organisations (LO Dalarna becoming LO) and suborganisations (Sverigesingenjörer, Sveriges Läkarförbund, etc. becoming SACO, which explains the high number for them as they usually send their answers separately for each remiss)

However, we managed to get a shortlist of organisations sorted by which answers the most to remisser, and we filtered it manually to show only interest groups:

🏆

Rank	Organisation	#
🥇	Sveriges advokatsamfund	124
🥈	Svenskt näringsliv	118
🥉	SACO	115
4	LO	86
5	Företagarna	80
6	Lantbrukarnas Riksförbund	73
7	Fastighetsägarna	57
7	Skogsindustrierna	54
9	Näringslivets Regelnämnd	52
10	Avfall Sverige	48
10	Svenska Bankföreningen	48
12	Sveriges Byggindustrier	47
13	Svenska kyrkan	43
14	Naturskyddsföreningen	40
15	Svensk Handel	39
16	Energigas Sverige	33
17	Energiföretagen Sverige	30
18	Svensk Försäkring	30
19	BIL Sweden	28
20	Villaägarnas Riksörbund	28

You can run the script yourself to get the raw data or find it in this Google spreadsheet. Don't hesitate to use it to make your own rankings and visualisations.

What does that show us? 🧐

There's really no conclusion we can jump to by seeing that Svenskt näringsliv answers to more remisser than LO. But as basic as this information can be, the fact that you have to know how to code and spend a Sunday in front of a screen to access it shows us a lack of transparency.

In just one day, we didn't manage to extract, clean and restructure enough data to make conclusions on which organisations the government listens to most. But we hope it will motivate others to dive into more of this data and create more robust models to answer that question and more!

How can I contribute? 🙌

Did we manage to get your attention with this small demo? Do you want to see more insights on all the information that Regeringskansliet has but doesn't publish in a usable format?

There are 3 ways you can help:

if you can script, fork this project and try to fetch and analyse more data from regeringen.se! Write some code to clean the data or to visualise it! 👩🏽‍💻
if you can't but have a profession/occupation which would benefit from having this data available (journalist, politician, researcher), contact the government to ask them to release it! Explain to them what you could do with it. 👨🏻‍⚕️
ask the government to reform its offentlighetsprincip to require that any piece of public data be available online in a structured format. Canada, the UK, Germany or France are doing it, it's time for Sweden to catch up! 🙋🏻‍♀️

And don't hesitate to contact us, we love hearing from opengov enthusiasts! ❤️

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
api		api
database		database
docs		docs
service		service
test/service		test/service
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
build_remissinstans_list.py		build_remissinstans_list.py
clean_data.py		clean_data.py
download_data.py		download_data.py
illustration.svg		illustration.svg
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What does it do? 🖨

Why? 📕📗📘

Goal 🎯

First results 📊

🏆

What does that show us? 🧐

How can I contribute? 🙌

About

Releases

Packages

Contributors 2

Languages

License

DinRiksdag/OpenRemiss

Folders and files

Latest commit

History

Repository files navigation

What does it do? 🖨

Why? 📕📗📘

Goal 🎯

First results 📊

🏆

What does that show us? 🧐

How can I contribute? 🙌

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages