Re-generate topics and re-train fraud detection #1

Philipp-Sc · 2023-04-13T15:13:07Z

Re-generate topics and re-train fraud detection with bigger dataset of governance proposals.

governance_proposal_spam_ham.csv 
---------------
count spam: 172
count ham: 2551

Note: This will be great to reduce false positives, since the model has not yet seen many ham (and spam) data for governance proposals.

Note: consider reducing the ham dataset by filtering some of the rejected proposals with high votes against. To make sure not to train likely spam as ham.

The text was updated successfully, but these errors were encountered:

Philipp-Sc · 2023-04-13T15:23:40Z

add DAO governance proposals ~~first~~

Philipp-Sc · 2023-06-07T09:33:28Z

refactor dataset loading: instead of loading a boolean load the label as f64. That way the float label from governance_proposal_spam_ham.csv can be used.

Philipp-Sc · 2023-07-02T08:47:26Z

Instead of predicting all topics at once (the sum of the predictions equal to 1) predict (binary) topic pairs e.g ["hot","cold"]

evaluate performance vs previous technique.

New technique performs better. A potential drawback is that a higher number of topics might increase the inference time and makes it take to long on CPU only systems.

Philipp-Sc · 2023-07-02T09:11:06Z

consider feature selection, to improve inference time. (relevant for CPU only systems)

Philipp-Sc mentioned this issue Apr 13, 2023

Milestones: Cosmos Governance Briefings Philipp-Sc/cosmos-rust-bot#4

Open

7 tasks

Philipp-Sc closed this as completed Jul 2, 2023

Philipp-Sc reopened this Jul 2, 2023

Philipp-Sc self-assigned this Jul 2, 2023

Philipp-Sc mentioned this issue Jul 3, 2023

add DAO governance proposals dataset #5

Open

Philipp-Sc closed this as completed Jul 3, 2023

Philipp-Sc mentioned this issue Jul 3, 2023

feature selection #6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-generate topics and re-train fraud detection #1

Re-generate topics and re-train fraud detection #1

Philipp-Sc commented Apr 13, 2023 •

edited

Loading

Philipp-Sc commented Apr 13, 2023 •

edited

Loading

Philipp-Sc commented Jun 7, 2023 •

edited

Loading

Philipp-Sc commented Jul 2, 2023 •

edited

Loading

Philipp-Sc commented Jul 2, 2023 •

edited

Loading

Re-generate topics and re-train fraud detection #1

Re-generate topics and re-train fraud detection #1

Comments

Philipp-Sc commented Apr 13, 2023 • edited Loading

Philipp-Sc commented Apr 13, 2023 • edited Loading

Philipp-Sc commented Jun 7, 2023 • edited Loading

Philipp-Sc commented Jul 2, 2023 • edited Loading

Philipp-Sc commented Jul 2, 2023 • edited Loading

Philipp-Sc commented Apr 13, 2023 •

edited

Loading

Philipp-Sc commented Apr 13, 2023 •

edited

Loading

Philipp-Sc commented Jun 7, 2023 •

edited

Loading

Philipp-Sc commented Jul 2, 2023 •

edited

Loading

Philipp-Sc commented Jul 2, 2023 •

edited

Loading