An implementation of the Naive Bayes machine learning algorithm, applied to spam filtering
Python
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
Data
.gitignore
BayesClassifier.py
Corpus.py
Message.py
README.mdown
Stemmer.py
StemmerExample.py
stemmedvoc.txt
stop_words.txt
utils.py
voc.txt

README.mdown

Bayesian Spam Filter

We implement a Bayesian spam filter in Python.

We first implemented the vanilla Bayesian filter, training on the data provided, and then extended it in various directions.

Extensions

  • Stemming using Porter's algorithm
  • Cosine normalisation of feature weights
  • Feature selection using WEKA.