Skip to content

kambasht/Spam-Email-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Spam-Email-Classification

Classifying Spam emails with the help of countvectorizer, tf-idf and linear svc

Making use of

  1. tokenization through WordNetLemmatizer

  2. tf-idf transformer[https://kavita-ganesan.com/tfidftransformer-tfidfvectorizer-usage-differences/#.Xoesx4hKiUk]

  3. Linear SVC for classification [https://www.youtube.com/watch?v=efR1C6CvhmE]

  4. Gridsearch Cross validation to find optimal parameters for classifier model[https://www.youtube.com/watch?v=fSytzGwwBVw]

  5. Evaluate the model using plot_precision_recall_curve to ensure we are covering our bases in case of an imbalanced datasets [https://machinelearningmastery.com/roc-curves-and-precision-recall-curves-for-imbalanced-classification/]

About

Classifying Spam emails with the help of countvectorizer, tf-idf and linear svc

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors