Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
|Failed to load latest commit information.|
Simple scripts to generate a reverse index from a collection of text files, based on tf-idf weights. We also use a shingling technique to calculate text containment between the files of the collection. The tree-tagger English parameter file is available from [here](http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html).