Skip to content


Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
branch: master
Failed to load latest commit information.
sample Smaller sample size first commit
README reduced sample size due to folks reporting long run times first commit first commit
convertsearch.jar first commit


To run the sample, take the following steps:
1. Put sample emails from data folder into HDFS
2. Run hadoop job: 
   hadoop jar convertsearch.jar ConvertEmailsToSequence <sample email dir> <output dir>
   hadoop jar convertsearch.jar SearchEmail <sequence file dir> 
3. The sample data contains small set of .msg files (all copies) and the results in your /tmp dir should be identical to this
Something went wrong with that request. Please try again.