Scraping data from Stockholms city archive for Anton.
It is digitized and in a nice table on the website of Stockholm city archives.
This is what it looks like:
If you want the data, you can download it as:
Download data | |
Stata format | |
CSV | |
RDS |
The data from 1855 also has birth year included. I have scraped this in a separeate file and link it for download here in excel format.
The scraper is in in the code folder.
My scraped data shows that I have are:
Number of records per year | |||
Scraped vs web portal | |||
Year | On web portal | Scraped | Difference |
---|---|---|---|
1800 | 45 838 | 45 837 | 1 |
1810 | 40 003 | 39 982 | 21 |
1820 | 43 416 | 43 396 | 20 |
1830 | 38 953 | 38 948 | 5 |
1840 | 27 312 | 27 310 | 2 |
1850 | 33 802 | 33 798 | 4 |
1860 | 41 264 | 41 254 | 10 |
1870 | 73 899 | 73 878 | 21 |
1880 | 111 130 | 111 130 | 0 |
Where there are repeated records on the same page, these are removed when I scrape them.
For example, Sven Fagerlund is repeated in 1870 on page 384.
For this reason, the number of records differs a small amount from the data portal and the scraped data.
Most common titles in each year are: