Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

Analyzing contributors to Wikipedia's user-generated content

branch: master

no longer necessary to separate parsing pages-meta-history XML and co…

…mputing diffs; P-M-H is sharded to 170 files
latest commit 275f31167a
lsb authored September 27, 2012
Octocat-spinner-32 README looking production-ready May 27, 2011
Octocat-spinner-32 diffs.sh looking production-ready May 27, 2011
Octocat-spinner-32 mh-diffs.rb process all pages September 27, 2012
Octocat-spinner-32 parse-stubs.rb process all pages September 27, 2012
README
Howdy.

This accompanies "Who Writes Wikipedia", at
http://slightlynew.blogspot.com/2011/05/who-writes-wikipedia-information.html
if you'd like to reproduce the results at home.
Get your 7zip data from http://dumps.wikimedia.org/enwiki/20110317/
and your Ruby Enterprise Edition from http://rubyenterpriseedition.com/download.html
and SQLite from http://www.sqlite.org/download.html
and you're good to go.

Cheers,
Lee
Something went wrong with that request. Please try again.