Skip to content

j-jayes/mantalsregister-1909

Repository files navigation

Readme

Mantalsregister 1909

Purpose

Scraping data from Stockholms city archive for Anton.

Link

Source

It is digitized and in a nice table on the website of Stockholm city archives.

This is what it looks like:

Data

If you want the data, you can download it as:

1855 data including birth year

The data from 1855 also has birth year included. I have scraped this in a separeate file and link it for download here in excel format.

Scraping

The scraper is in in the code folder.

Summary statistics

My scraped data shows that I have are:

Number of records per year
Scraped vs web portal
Year On web portal Scraped Difference
1800 45 838 45 837 1
1810 40 003 39 982 21
1820 43 416 43 396 20
1830 38 953 38 948 5
1840 27 312 27 310 2
1850 33 802 33 798 4
1860 41 264 41 254 10
1870 73 899 73 878 21
1880 111 130 111 130 0

Where are missing records??

Where there are repeated records on the same page, these are removed when I scrape them.

For example, Sven Fagerlund is repeated in 1870 on page 384.

For this reason, the number of records differs a small amount from the data portal and the scraped data.

Most common titles in each year are:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published