Latest commit 7a7d2b0 Jan 19, 2017 @fedelemantuano fedelemantuano Donate link
Failed to load latest commit information.
mailparser Minor change Jan 13, 2017
mailparser_version Changed version Jan 13, 2017
tests Minor change Jan 13, 2017
.gitignore Bugfix Nov 11, 2016
LICENSE.txt Initial commit Sep 8, 2016 Version 0.5.0 Nov 24, 2016
NOTICE.txt Initial commit Sep 8, 2016
README New RC version Jan 11, 2017 Donate link Jan 19, 2017
requirements.txt Porting to Python 3 Jan 11, 2017
setup.cfg Include Sep 9, 2016 Porting to Python 3 Jan 11, 2017



mail-parser is a wrapper for email Python Standard Library. It's the key module of SpamScope.

From version 1.0.0rc1 mail-parser supports Python 3.


mail-parser takes as input a raw mail and generates a parsed object. This object is a tokenized mail with the all parts of mail and some indicator:

  • body
  • headers
  • subject
  • from
  • to
  • attachments
  • message id
  • date
  • charset mail
  • sender IP address

We have also two indicator:

  • anomalies: mail without message id or date
  • defects: mail with some not compliance RFC part


These defects can be used to evade the antispam filter. An example are the mails with a malformed boundary that can hide a not legitimate epilogue (often malware). This library can take these epilogues.

Apache 2 Open Source License

mail-parser can be downloaded, used, and modified free of charge. It is available under the Apache 2 license. Donate


Main Author

Fedele Mantuano (Twitter: @fedelemantuano)


Clone repository

git clone

and install mail-parser with

cd mail-parser

python install

or use pip:

pip install mail-parser

Usage in a project

Import MailParser class:

from mailparser import MailParser

parser = MailParser()

Then you can get all parts

parser.text_plain_list: only text plain mail parts in a list
parser.attachments_list: list of all attachments
parser.parsed_mail_obj: tokenized mail in a object
parser.parsed_mail_json: tokenized mail in a JSON
parser.defects: defect RFC non compliance
parser.defects_category: only defects categories

Usage from command-line

If you installed mailparser with pip or you can use it with command-line.

These are all swithes:

usage: mailparser [-h] (-f FILE_ | -s STRING_) [-j] [-b] [-a] [-r] [-t] [-m]
                  [-u] [-d] [-n]

Wrapper for email Python Standard Library

optional arguments:
  -h, --help            show this help message and exit
  -f FILE_, --file FILE_
                        Raw email file (default: None)
  -s STRING_, --string STRING_
                        Raw email string (default: None)
  -j, --json            Show the JSON of parsed mail (default: False)
  -b, --body            Print the body of mail (default: False)
  -a, --attachments     Print the attachments of mail (default: False)
  -r, --headers         Print the headers of mail (default: False)
  -t, --to              Print the to of mail (default: False)
  -m, --from            Print the from of mail (default: False)
  -u, --subject         Print the subject of mail (default: False)
  -d, --defects         Print the defects of mail (default: False)
  -n, --anomalies       Print the anomalies of mail (default: False)
  -i Trust mail server string, --senderip Trust mail server string
                        Extract a reliable sender IP address heuristically
                        (default: None)
  -v, --version         show program's version number and exit

It takes as input a raw mail and generates a parsed object.


$ mailparser -f example_mail -j

This example will show you the tokenized mail in a JSON pretty format.