I have just downloaded data for February and found problems in the following files:
There is at least one case in each file where 2 records are not separated by new line.
BTW I can send you full report with probematic records if you need
I hope it helps :-)
Looking at the data I found that repository URI is:
Is there any difference between them? It looks that It should be safe to merge events from both URIs. But maybe I'm missing something?
It looks like almost all repositories have both types. But I'm not sure about it since I just manually checked few most popular
Sigh, thought I caught that one.. Is that in 2012 archives?
The problem with Uri is in february and march 2012
I find the same problem for the files in June and July 2012
@chengjun see #53 (comment)