This repository contains data showing the "most popular" datasets of 2018 from the Berlin Open Data portal at https://daten.berlin.de.
"Popularity" here only means the number of page views of the dataset details page for a particular dataset. This number does not necessarily give any indication of how often the data itself was accessed or used.
The repository accompanies a slightly more comprehensive article looking back at 2018 at https://daten.berlin.de/interaktion/artikel/jahresrueckblick-2018.
The following data files are contained:
- daten_berlin_de.page_stats.datensaetze.summed.csv - A cleaned and annotated version of https://daten.berlin.de/sites/default/files/data/berlin_dataportal_usage/daten_berlin_de.page_stats.datensaetze.csv as of 2019-01-01.
- daten_berlin_de.page_stats.cleaned.2018.xlsx - An Excel file based on the CSV file, containing only the data for 2018, and with additional formulas for the year total and rank.
- January: Liste der häufigen Vornamen 2016
- February: Liste der häufigen Vornamen 2017
- March: Liste der häufigen Vornamen 2017
- April: Liste der häufigen Vornamen 2017
- May: Liste der häufigen Vornamen 2017
- June: Liste der häufigen Vornamen 2017
- July: ALKIS Berlin (Amtliches Liegenschaftskatasterinformationssystem)
- August: ALKIS Berlin (Amtliches Liegenschaftskatasterinformationssystem)
- September: ALKIS Berlin (Amtliches Liegenschaftskatasterinformationssystem)
- October: ALKIS Berlin (Amtliches Liegenschaftskatasterinformationssystem)
- November: ALKIS Berlin (Amtliches Liegenschaftskatasterinformationssystem)
- December: VBB-Fahrplandaten via GTFS
The CSV and Excel file contained here were generated based on the raw data from the Usage Statistics dataset. However, the raw data was cleaned by summing certain entries that (should) belong the same actual dataset. This was necessary because a dataset's URL in the portal (for which we have page view stats) can change when the internal id of the dataset changes. E.g., the id of the dataset ALKIS Berlin (Amtliches Liegenschaftskatasterinformationssystem) changed from alkis-berlin
to alkis-berlin-0
and finally to alkis-berlin-amtliches-liegenschaftskatasterinformationssystem
over time.
The Ruby script sum_rows.rb implements this cleaning process. The input to this script is https://daten.berlin.de/sites/default/files/data/berlin_dataportal_usage/daten_berlin_de.page_stats.datensaetze.csv.
All software in this repository is published under the MIT License. All data in this repository (in particular the .csv
and .xlsx
files) is published under CC BY 3.0 DE.
Dataset URL: https://daten.berlin.de/datensaetze/datenportal-jahresruckblick-2018
(based on Zugriffsstatistik daten.berlin.de, published under CC BY 3.0 DE)
2018, Knud Möller, BerlinOnline Stadtportal GmbH & Co. KG