Skip to content

supervised classification (implement feature generation: bag of words, TF-IDF)

Notifications You must be signed in to change notification settings

yh1008/textMining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Sentiment prediction using supervised learning

supervised classification using Multinomial Naive Bayes & Randon Forest

classification_v1.py implemented Bag of words, TF-IDF(with normalization)weighting, removed stop-words for feature engineering. classification_v2.py uses sklearn to process text.

used cross-validation's split_train_test() to generate hold out test set

print out performance metrics includes: accuracy, confusion matrix

plot ROC curve with Matplotlib

How to run?

$ python classfition_v1.py

About

supervised classification (implement feature generation: bag of words, TF-IDF)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages