Skip to content

a34729t/coding-dojo-ml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spam Classification Problem for LA Coding Dojo

Tasks:

  1. Process data into Weka ARFF format
  2. Train a classifier using Weka
  3. Test your classifier against test data

DATA PROCESSING

TRAINING

TESTING

NOTES

  • I would recommend using Naive Bayes and the Decision Tree first as they give you good feedback about which features are useful
  • Machine Learning is about quality data and features—what model you use is much less important

WEKA CLASSIFIERS (From simplest to more complicated)

  • weka.classifiers.bayes.NaiveBayes
  • weka.classifiers.trees.J48
  • weka.classifiers.functions.Logistic
  • weka.classifiers.lazy.kstar
  • weka.classifiers.rules.JRip
  • weka.classifiers.functions.SMO

About

spam filtering problem

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages