Skip to content

siddsax/DocClassify

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Basic Document Representation and Classification

Dependencies

Numpy
argparse
Sklearn

DataSet

Download the large movie dataset from here and place in the same folder named aclImdb.

Contains

Representations

  • Binary Bag of Words
  • Normalized TF-IDF
  • TF-IDF

Classifiers

  • Logistic Regression
  • Naive Bayes
  • SVM
  • FeedForward Net

Running them

python model.py [--n=#Number of training points to be used] [--C=#Classifier to be used.]

About

Very Basic Document Classification

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages