New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to teach Bayes ham and spam #1438
Comments
Hello, If you move a mail from the Junk folder back to any folder (except the Trash folder), Mailu will learn them as ham. Not straightforward, but maybe that would work ? A more straightforward way would be to modify the existing behaviour as defined in dovecot |
Sounds scary, but would it work? Will it be "unlearned" from spam after moving it back? |
I did it and got a strange message (this is neural, so not directly relevant) ; lua; neural.lua:487: cannot learn ANN tRFANN9D954503A5235C34260: too many ham samples: 162 |
This value was unchanged by the move in/out |
I did look into the code, and we are maybe missing a "fuzzy_del" action when we are moving a from ham to spam and vice versa. |
Thank for looking into it. Please post your PR # so I can follow up |
Yes I will do.
Any new message that will be put in this folder will be learnt as ham (for both bayes and fuzzy)
This should learn every file located in the Ham_Learn mailbox from user@example.com |
Thanks. I know about the rspamc method, but I tried to run it from the rspamd container. Maybe put this info somewhere in the docs? |
1440: Relearn messages for fuzzy storage r=mergify[bot] a=ofthesun9 ## What type of PR? enhancement, bugfix ## What does this PR do? This PR add a rspamc fuzzy_del to ham & spam scripts, in order to cover [relearning](https://rspamd.com/doc/faq.html#can-i-relearn-messages-for-fuzzy-storage-or-for-statistics) from Junk list to Ham list and vice versa ### Related issue(s) #1438 ## Prerequistes Before we can consider review and merge, please make sure the following list is done and checked. If an entry in not applicable, you can check it or remove it from the list. - [x] Added 1438.bugfix Co-authored-by: ofthesun9 <olivier@ofthesun.net>
In the meantime you can also learn spam/ham in the webinterface of rspamd under yourmaildomain.tld/admin/antispam/#scan (which only works if you are logged in as an administrator) |
As I wrote in the original posting: "The spam/learn tab in rspamd is ridiculous - should I manually paste in hundreds of ham messages?" |
1470: Adding faq entry: How to teach Bayes ham and spam #1438 r=mergify[bot] a=ofthesun9 Fix #1438 ## What type of PR? documentation (faq) ## What does this PR do? This PR add an faq entry to cover #1438 ### Related issue(s) closes #1438 ## Prerequistes - [x] In case of feature or enhancement: documentation updated accordingly Co-authored-by: ofthesun9 <olivier@ofthesun.net>
1470: Adding faq entry: How to teach Bayes ham and spam #1438 r=muhlemmer a=ofthesun9 Fix #1438 ## What type of PR? documentation (faq) ## What does this PR do? This PR add an faq entry to cover #1438 ### Related issue(s) closes #1438 ## Prerequistes - [x] In case of feature or enhancement: documentation updated accordingly Co-authored-by: ofthesun9 <olivier@ofthesun.net>
The requirement for 200 ham for Bayes classification is not easy to meet with low-volume servers. rspamd works reasonably for the usual viagnigeria spam, but it fails completely for the dozens of requests per day to submit my publication to some fake-open-source journal
I am used to use sa-learn with Spamassassin or rspamc with rspamd to use existing mail for ham education, but ther is no rspamc. The spam/learn tab in rspamd is ridiculous - should I manually paste in hundreds of ham messages?
What is the best method to instruct a mailu installation to use an existing clean mailbox as ham-fodder?
The text was updated successfully, but these errors were encountered: