Unsupervised-Decomposition-of-a-Multi-Author-Document using NLP and ML techniques
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Backup
Dependencies
Images
Papers
Pickle
Plots
Projects
Report
Src Code
dataset
.gitattributes
LICENSE
Presentation.pptx
README.md
run.sh

README.md

Unsupervised-Decomposition-of-a-Multi-Author-Document

In this project we decompose a Multi-Author-Document in individual Authorial components sentence-wise, using NLP and ML techniques.

Requirements

  • python (>2.7.11)
  • scikit-learn
  • nltk
  • numpy

Usage

Baseline

python Src\ Code/baseline.py

Improvements

python Src\ Code/Our\ Methods/words_method.py
python Src\ Code/Our\ Methods/parser_method.py
python Src\ Code/Our\ Methods/hybrid_words_parser_method.py

Contributing

  1. Fork it!
  2. Create your feature branch: git checkout -b my-new-feature
  3. Commit your changes: git commit -am 'Add some feature'
  4. Push to the branch: git push origin my-new-feature
  5. Submit a pull request :)