I want to detec the spam emails by the collection of the sample spam messages and improve the data base of the spam mesages by the detecting the attribute of the each spam emails, also if the amount and size of the emails are very long we can use the inverted index algorithm in order to quickly execute the program.
in order to use my cod you can run the PegahTorkamandi.py , also the DOCUMENT.txt is collection of the spam messages' attributes.