Skip to content
master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 

ML-troll-filter

A ML project for filtering spam and troll comments/emails.

DESCRIPTION

Are you tired from people flooding your website/platform with spam and deceitful comments? Well, you are not the only one.

If you were wondering what big platforms like Youtube, Tripavisor, and others have done to solve (or at list limit)
this issue, Well, two words: Machine Learning. This project will help you train your machine to recognize potential spam messages by feeding it 16541 snippets of spam/ham messages.

Scikit-learn is required (check dependencies below) to train our Naive Bayes classifier.

Dataset: We are using the first 3 parts of the Enron spam dataset (minimize processing time).

If you would like to add even more accuracy to the model, you can add more parts to the data dir.

DEPENDENCIES

  1. Python 3.7.+
  2. numpy
  3. $ pip install numpy
    
  4. scikit-learn
  5. $ pip install scikit-learn
    

INSTALLATION

  1. Clone this project
  2. $ git clone https://github.com/tahaHichri/ML-troll-filter.git
    
  3. Launch message keyword dictionary/features generations
  4. $ python classif.py 
    
  5. Check whether a message is a spam by passing it as an argument to check.py. The output should be "SPAM" or "NOT SPAM"
  6. $ python check.py "Hey, this is an example comment"
    

About

A ML project for filtering spam and troll comment/emails.

Topics

Resources

License

Releases

No releases published

Packages

No packages published