Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Java
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
sample
ConvertEmailsToSequence.java
README
SearchEmail.java
WholeFileInputFormat.java
WholeFileInputReader.java
convertsearch.jar

README

To run the sample, take the following steps:
1. Put sample emails from data folder into HDFS
2. Run hadoop job: 
   hadoop jar convertsearch.jar ConvertEmailsToSequence <sample email dir> <output dir>
   hadoop jar convertsearch.jar SearchEmail <sequence file dir> 
3. The sample data contains small set of .msg files (all copies) and the results in your /tmp dir should be identical to this
Something went wrong with that request. Please try again.