Skip to content

reedbertolotti/SVM_Spam_Classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SVM Spam Classifier

Classify emails in text files as spam or not spam.

Main script to run: svmSpam_main.m

Components:

  • split the email dataset into training and test sets
  • create a vocabulary list from the training set emails
    • list of most frequently occuring words in emails
    • features of examples will correspond to having a word from the vocabulary list (1) or not (0)
  • create design matrix and labels for the training and test sets
  • determine best SVM hyperparameters C and gamma using repeated k-fold cross validation
  • train an SVM using LIBSVM library on training set
  • evaluate SVM performance on test set

About

Classify emails in text files as spam or not spam.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published