Spam Classifier with Naive Bayes

About The Project

In this project we are going to:

Create a Spam Classifier with Naive Bayes Classifier
Create a simple Flask application
Create a Docker container to run our classifier

Built With

Naive Bayes Classifier

Naive Bayes is a classification technique that is based on Bayes’ Theorem with an assumption that all the features that predicts the target value are independent of each other. It calculates the probability of each class and then pick the one with the highest probability. It has been successfully used for many purposes, but it works particularly well with natural language processing (NLP) problems.

In order to train our model we used the "spam_data.csv" file, containing both ham and spam email content. In the "train.py" you will find the complete code used to train and save the trained model to a pickle file. Sklearn has a very good module for using Naive Bayes methods, you can check its documentation by clicking here.

Flask

Flask is a web application framework written in Python. It has multiple modules that make it easier for a web developer to write applications without having to worry about the details like protocol management, thread management, etc.

Because of its simplicity, its a very known tool when creating APIs and deploy ML models, as we are gonna see.

Docker

Docker is an open source containerization platform. It enables developers to package applications into containers-standardized executable components combining application source code with the operating system (OS) libraries and dependencies required to run that code in any environment.

Getting Started

1. Running a Container

To run it with Docker just build the Dockerfile as follows:

  docker build -t "name of your image" .

And then, run:

  docker run -i -d -p 5000:5000 "name of your image"

2. Running Locally

If you want to run it locally instead, do the following:

  git clone https://github.com/nicholascomuni/Spam-Classifier
  cd Spam-Classifier
  pip install -r requirements.txt

Run the flask application:

flask run --host 0.0.0.0

Usage

Once the flask app is deployed and running localy or on a server, you can request a spam/ham classification by sending to the "/predict" endpoint a POST request, it has to contain the body of the email in JSON syntax as follows: {'content':"email body"}. As a response you will get the predicted class!

Sending the POST request:

curl --location --request POST 'localhost:5000/predict' \
--header 'Content-Type: application/json' \
--data-raw '{"content":"Dear friend! How you doing? Its been a long time since we dont meet!"}'

Server Response

{
    "prediction": "ham"
}

References

https://medium.com/analytics-vidhya/na%C3%AFve-bayes-algorithm-5bf31e9032a2

https://scikit-learn.org/stable/modules/naive_bayes.html

https://www.ibm.com/cloud/learn/docker

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
images		images
.gitinore		.gitinore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
predictor.pkl		predictor.pkl
requirements.txt		requirements.txt
spam_data.csv		spam_data.csv
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spam Classifier with Naive Bayes

About The Project

Built With

Naive Bayes Classifier

Flask

Docker

Getting Started

1. Running a Container

2. Running Locally

Usage

References

About

Releases

Packages

Languages

nicholascomuni/Spam-Classifier

Folders and files

Latest commit

History

Repository files navigation

Spam Classifier with Naive Bayes

About The Project

Built With

Naive Bayes Classifier

Flask

Docker

Getting Started

1. Running a Container

2. Running Locally

Usage

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages