Spam-Classifier

In this repository i will show you how i did Spam Classifier with naive Bayes classifier

Github link to repository: https://github.com/MadoDoctor/Spam-Classifier

In this Machine Learning we worked with packages:

NumPy;
Pandas;
re.

Introduction to naive Bayes classification

In statistics, naive Bayes classifier are a family of simple "probabilistic classifiers" based on applying Bayes' theorem with strong (naïve) independence assumptions between the features. Bayes' theorem is stated mathematically as the following equation:

$\mathbf{P(A|B)=\frac{P(B|A)P(A)}{P(B)}}$

$\mathbf{P(A|B)}$ - is a conditional probability: the probability of event A occurring given that B is true. It is also called the posterior probability of A given B.

$\mathbf{P(B|A)}$ - is also a conditional probability: the probability of event B occurring given that A is true. It can also be interpreted as the likelihood of A given a fixed B because $\mathbf{P(B|A)=L(A|B)}$ .

$\mathbf{P(A)}$ and $\mathbf{P(B)}$ are the probabilities of observing A and B respectively without any given conditions; they are known as the marginal probability or prior probability.

A and B must be different events.

To learn more about Bayes' theorem follow the link: https://en.wikipedia.org/wiki/Bayes%27_theorem

$\mathbf{P(w_i{|spam})=\frac{N_{spam}(w_1)+\alpha}{N_{spam}+\alpha\times N_{Voc}}$

$\mathbf{N_{spam}(w_i)}$ - The number of $\mathbf{w_i}$ words repeated in spam messages.

$\mathbf{N_{spam}}$ - Total number of words in spam messages.

$\mathbf{N_{Voc}}$ - The total number of unique words in all messages.

$\mathbf{\alpha}$ - Anti-aliasing parameter.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md
dataset.py		dataset.py
main.ipynb		main.ipynb
model.py		model.py
spam.csv		spam.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spam-Classifier

Introduction to naive Bayes classification

About

Releases

Packages

Languages

MadoDoctor/Spam-Classifier

Folders and files

Latest commit

History

Repository files navigation

Spam-Classifier

Introduction to naive Bayes classification

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages