An algorithm to compute token-level provenance and changes for Wiki revisioned content. Tested at +95% accuracy for EN.Wikipedia.
Switch branches/tags
Nothing to show
Clone or download
Pull request Compare This branch is 17 commits ahead, 1 commit behind maribelacosta:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
WikiWho fix examples Feb 5, 2018
.gitignore new wikiwho Mar 22, 2017
LICENSE.txt new wikiwho Mar 22, 2017
README.rst update api url in readme Sep 20, 2017 add requirements to and update readme Jul 21, 2017



An algorithm to identify the revision origin of - and all changes ever applied to - the tokens of a revisioned Wiki document. This enables, e.g., detecting authorship and editor interactions.

Also check out the WikiWho API for current data from Wikipedia.

Requirements and Installation

WikiWho runs both on python 2 and 3.

requests package is required to get revision meta data and text from Wikipedia api.

mwxml package is required to get revision meta data and text from xml dumps. This package runs on only python 3.

Install WikiWho package using pip:

pip install git+git://

Running WikiWho

You can check example scripts under WikiWho/WikiWho/examples to see how to run WikiWho.


  • Fabian Floeck: fabian.floeck[.]
  • Maribel Acosta: maribel.acosta[.]
  • Kenan Erdogan: kenan.erdogan[.]


This work is licensed under MIT.

Developed at Karlsruhe Institute of Technology and GESIS - Leibniz Institute for the Social Sciences