Skip to content

Commit

Permalink
Add python mapper/reducer files to create diffs
Browse files Browse the repository at this point in the history
  • Loading branch information
Diederik van Liere committed Jan 23, 2013
1 parent 3e50088 commit ca7d2fe
Show file tree
Hide file tree
Showing 6 changed files with 2,822 additions and 0 deletions.
17 changes: 17 additions & 0 deletions python/README.md
@@ -0,0 +1,17 @@
Revision Differ

This script was written to be a streaming mapper for wikihadoop
(see https://github.com/whym/wikihadoop). By default, this script runs under
pypy (much faster), but it can also be run under CPython 2.7+.

Required to run this script:
- revision_differ.py (provided)
- diff_match_patch.py (provided)
- xml_simulator.py (provided)
- wikimedia-utilities (https://bitbucket.org/halfak/wikimedia-utilities)

Author: Aaron Halfaker (aaron.halfaker@gmail.com)

This software licensed as GPLv2(http://www.gnu.org/licenses/gpl-2.0.html) and
is provided WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
implied.

0 comments on commit ca7d2fe

Please sign in to comment.