Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Citybike usage statistics #1
The HSL Developer Community page links to Citybike usage statistics (warning: page over 8 MB) and mentions they are "experimental". I downloaded them and just thought I'd share here some things I discovered.
The data is available as small json files with timestamps included in the filenames. Some of them are packaged into zip files (at the moment for May, June, and July) and the rest listed directly on that page. The first available file is
The timestamp in the file name represents 2016-05-10 at 09:56:01, and if I recall correctly, the system opened on 2 May, so some data is missing from the very beginning. As shown above, all files include three statistics:
Some apparent issues:
Leaving out the month of May, the data looks like this:
The data is clearly not increasing monotonically as one would expect for cumulative totals.
To see in more detail how those decreasing values happen, here is a look at the few very first days, from 10 May until 14 May:
The transition between 11 and 12 May looks like one would expect: most rentals happening at daytime and a slowdown during the night. All the others however look weird: the numbers drop, and pretty much erase the day's increase in the total. These drops seem to happen at exactly 21:00:00, which in UTC time corresponds to midnight in Finland during the daylight savings time.
I'm not sure what to take from this, but just thought I'd share it. The kind folks at HSLdevcom created this repository so that I could post the issue here.
If anyone is interested, the data I downloaded (from 2016-05-10 09:56:01 until 2016-10-18 17:38:01) is available here: citybike-stats.csv.gz