Predict author based on given text sample - use Enron email text corpus
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
docs
soln1
soln2
.gitignore
README.md
requirements.txt

README.md

Predict Author

This project uses Enron's Text Corpus data for building machine learning model, that processes emails (sent emails) by respective authors and then from the given email text, tries to predict who the author is.

Project includes multiple solutions, each solution in a separate folder. The docs folder contains few reference docs. The instructions about how-to generate features and run scripts (atleast for soln1) are present here