This code is Java code based on sample code to connect to the Wikipedia IRC stream of recent changes and dump the output to text files. It was used to monitor the recent changes to Wikipedia in multiple language editions and the results published in a recent article.
The code creates one file per day with a filename in the format of irc-logger--yyyy-mm-dd--mm--ss.txt. Filenames are rotated around midnight localtime, but may be slightly later if the IRC channel is unusually quiet. The datetime each message is sent is recorded from the system clock and prepended to the line followed by a tab.
Adjust which channels (projects) you want to record in SimpleLogger.java. Compile and run WikipediaLogger.java to start capturing.
RecentChange.java provides example code to parse lines from the files.
If you use this code in support of an academic publication, please cite:
Hale, S. A. (2014). Multilinguals and Wikipedia Editing. http://arxiv.org/abs/1312.0976