Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP

Compare: Tutorial

Showing with 2 additions and 0 deletions.
  1. +2 −0  Tutorial.textile
View
2  Tutorial.textile
@@ -31,6 +31,8 @@ The first step is to convert a set of documents to a Behemoth corpus :
bc. ./bin/hadoop jar ./behemoth-core-*-job.jar com.digitalpebble.behemoth.util.CorpusGenerator
-i "path to corpus" -o "path for output file"
+N.B. directory path's should be qualified like file:/path/to/directory
+
Use the --recurse option if you want CorpusGenerator to process the input path recursively e.g.
bc. ./bin/hadoop jar ./behemoth-core-*-job.jar com.digitalpebble.behemoth.util.CorpusGenerator
Something went wrong with that request. Please try again.