Detector submission update#131
Conversation
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): ODean v17z+soulRelease date: 2026-05-04 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 89.97 and a TPR of 57.91% at FPR=5% and 39.89% at FPR=1%. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
|
@kprofundis thanks for the submission and the kind message! Let me know if you'd like this to be merged. |
|
hey Liam! thank you for testing. could you merge my last results, that was aur 93? the V14 model? i‘m going to be fixing the bug in this recent submission but would love to see my previous score on the leadership board! |
|
Sure thing @kprofundis , I reopened your previous PR #124 and kicked off eval. Once that's done I'll ask for a final approval from you on that thread. |
|
Thank you! I’ll keep an eye out for that message |
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): ODean v17z+soul × v14 routed-merge v2Release date: 2026-05-04 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.30 and a TPR of 82.76% at FPR=5% and 69.20% at FPR=1%. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
af89d34 to
c777994
Compare
c777994 to
b965649
Compare
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): Kareem Elsamadicy (Independent Researcher)Release date: 2026-05-04 I've committed detailed results of this detector's performance on the test set to this PR. On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 92.93 and a TPR of 82.42% at FPR=5% and 71.89% at FPR=1%. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
Independent research detector. Display name set via metadata.json. GitHub: https://github.com/KProfundis