The project determines whether email is spam or not on the basis of text that it contains. We have used count vectorizer to extract the words from it. The model that we have used is SVM. It helps in classification of email into spam or ham (not spam).
Here's the dataset that I have used.
Key Properties used in this project are
-
Python for machine learning
-
SVM for training the model
-
CountVectorizer for extracting the words
-
GridSearchCV for increasing the accuracy