Skip to content

The easiest way to classify SMS data (Spam/Ham) using python. πŸ“Ÿ

Notifications You must be signed in to change notification settings

vaibhavmagon/Python-Spam-Ham-SMS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

21 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Python Spam Ham SMS Bifurcation Script πŸ“Ÿ

The code contains step by step process to classify a set of over 5,000 SMS using python in Naive Bayes Classification Algorithm.

More on Naive Bayes: https://machinelearningmastery.com/naive-bayes-classifier-scratch-python/

To Run

python spam.py

Data

Data folder containes the spam-ham.csv file with data.

Apart from this it contains a split of 2 different files to train on test data and then run on real data (production):
1. spam-sms.py (to train)
2. spam-real.py (to run in production)

ROC Curve for current Data

screen shot 2018-09-08 at 11 59 27 am

Prerequisits

  • Python 2.7
  • Machine learning libraries
  • Anaconda/Spyder

Contributers

  • Vaibhav Magon