This project uses Enron's Text Corpus data for building machine learning model, that processes emails (sent emails) by respective authors and then from the given email text, tries to predict who the author is.
Project includes multiple solutions, each solution in a separate folder. The
docs folder contains few reference docs.
The instructions about how-to generate features and run scripts (atleast for soln1) are present here