CNI-v4 submission#149
Conversation
|
It looks like this eval run failed. Please check the workflow logs to see what went wrong, then push a new commit to your PR to rerun the eval. |
|
Eval run succeeded! Link to run: link Here are the results of the submission(s): CNI v4 — Cognitive Negligence IndexRelease date: 2025-05-21 I've committed detailed results of this detector's performance on the test set to this PR. Warning No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated. Warning No aggregate score across all non-adversarial settings is reported here as some domains/generator models/decoding strategies/repetition penalties were not included in the submission. If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID! |
No description provided.