Training Naive Bayes classifier to classify spam and ham emails.
1. pre-processing dataset containing raw emails.(Tokenizing, Stemming, HTML removing, Extracting non-stopwords
Using NLTK library).
2. Visualizing dataset and Creating WordClouds
3. Using Naive bayes algorithm to find probability of spam emails and word frequencies.(Training)
4. Testing and Evaluating model.
-Model has accuracy of 97.7%