Spam Classification #2991

ewdurbin · 2018-02-19T23:14:26Z

Pursuant #2982

ewdurbin · 2018-02-19T23:14:44Z

@di @dstufft thoughts on the modeling thus far?

di · 2018-02-20T03:58:18Z

warehouse/spam/models.py

+    release_version = Column(Text, nullable=False)
+    source = Column(report_source, nullable=False)
+    reporter_user_id = Column(UUID, nullable=True)
+    result = Column(Boolean, nullable=False)


Is this supposed to be some metric for "spaminess"?

negative. in initial research of akismet and smythe the response to a classification request is a boolean. True/False or verdict:BLOCK/verdict:ALLOW (which can be trivially modeled as a Boolean)

di · 2018-02-20T15:46:49Z

Seems to make sense, this would capture both results from automated services and user-generated reports, and we could aggregate them for every project. I like that the analyze_upload task is asynchronous as well.

ewdurbin · 2018-02-20T15:47:55Z

dope, I'll continue fleshing this out tonight.

initial pass at modelling a spam_report table

9809881

ewdurbin force-pushed the spam_classification branch from 3a7f701 to 9809881 Compare February 20, 2018 00:04

di reviewed Feb 20, 2018

View reviewed changes

brainwane mentioned this pull request Mar 6, 2018

Ongoing strategies for spam #2982

Open

di assigned ewdurbin Apr 2, 2018

ewdurbin closed this Apr 24, 2018

dstufft deleted the spam_classification branch November 28, 2018 18:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spam Classification #2991

Spam Classification #2991

ewdurbin commented Feb 19, 2018

ewdurbin commented Feb 19, 2018

di Feb 20, 2018

ewdurbin Feb 20, 2018

di commented Feb 20, 2018

ewdurbin commented Feb 20, 2018

Spam Classification #2991

Spam Classification #2991

Conversation

ewdurbin commented Feb 19, 2018

ewdurbin commented Feb 19, 2018

di Feb 20, 2018

Choose a reason for hiding this comment

ewdurbin Feb 20, 2018

Choose a reason for hiding this comment

di commented Feb 20, 2018

ewdurbin commented Feb 20, 2018