Spam_filter

Naive Bayes Spam Filter used in the lab works for "Software testing and validation" course in Polytechnique Montreal, where I was a TA. More information about the system can be found in the pdf file, which lab1 description. Here's a brief overview: Each module of the system is implemented in the separate python file. The main system modules are:

crud.py: module to achieve the functionality CRUD (Create Read Update Delete) for users and groups. User is represented by his e-mail address and the information he receives associated. Group is a collection of users.
text_cleaner.py: module for cleaning and tokenization of the text.
Vocabulary creation: module to create vocabulary with frequency words found in spam and ham messages.
renege.py: module for processing new emails. Executed the logic of adding / updating information about users and groups.
email_analyzer.py: module to calculate the probability that e-mail is spam or ham.
main.py: main module to manage the system. Understand a spam detection accuracy evaluation function.

To run the system execute: python3 main.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
LOG3430_TP1.pdf		LOG3430_TP1.pdf
README.md		README.md
crud.py		crud.py
email_analyzer.py		email_analyzer.py
groups.json		groups.json
mails.json		mails.json
main.py		main.py
renege.py		renege.py
test300.json		test300.json
text_cleaner.py		text_cleaner.py
train.json		train.json
train700.json		train700.json
users.json		users.json
vocabulary.json		vocabulary.json
vocabulary_creator2.py		vocabulary_creator2.py

dgumenyuk/Spam_filter

Folders and files

Latest commit

History

Repository files navigation

Spam_filter

About

Resources

Stars

Watchers

Forks

Languages