-
-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parsing mbox files? #11
Comments
Hi, thanks! I wasn't planning on supporting mbox but it could be a nice addition. I'll keep the issue open and as soon as I have some spare time I'll add the functionality. It shouldn't take too much time to implement it. |
Thanks for the update, that'd be fantastic. If you start on it, I can maybe help with some PRs as well. Let me know! |
Hi, I just pushed to master the MBox parser as well as an example under the |
@mdecimus - fantastic, thank you! I've tried it out on more than 20k mbox files, totaling over 4M emails. Seems to work fine, at least for my case. There are some weird&infrequent corner cases where people copy paste emails (including all the metadata) and paste them in the body of the email; however, these are really hard to detect so this is good enough for me. I hope mail-parser becomes more popular and the standard way of parsing emails in Rust, it works really nice so far! |
That is probably a bug in the process that generated the mbox file. When writing a message to an mbox file, any lines beginning with
Thanks, I hope that too! |
The library is great, thanks for putting it together! I was wondering if there are plans to have a parser for reading&parsing files in mbox format?
The text was updated successfully, but these errors were encountered: