added a submission named gambit-mage
Eval run succeeded! Link to run: link
Here are the results of the submission(s):
Gambit-mage
Release date: 2025-01-19
I've committed detailed results of this detector's performance on the test set to this PR.
On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 86.72 and a TPR of 61.85% at FPR=5% and 47.86% at FPR=1%. Without adversarial attacks, it achieved AUROC of 91.21 and a TPR of 75.43% at FPR=5% and 62.24% at FPR=1%.
If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!
@yashasviraii thanks for the submission! Let me know at any point if you'd like to merge!
@liamdugan Thanks for evaluating the first one! Could you evaluate this one too so I can compare results? Also, will these results be added to the leaderboard?
Eval run succeeded! Link to run: link
Here are the results of the submission(s):
Mage-baseline
Release date: 2024-05-21
I've committed detailed results of this detector's performance on the test set to this PR.
[!WARNING] Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved.
If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!
@yashasviraii they will be added to the leaderboard if we merge the pull request. If you want only one of them to be added at a time then you can make a commit removing the other submission from this PR and add it as a separate PR.
Let me know what you'd like to do