raid icon indicating copy to clipboard operation
raid copied to clipboard

Trying luminar_classifier_RAID_none_PrismAI

Open TheItCrOw opened this issue 6 months ago • 1 comments

After the evaluation changes, testing new luminar model instances. I'll probably add more.

TheItCrOw avatar Jul 28 '25 09:07 TheItCrOw

Eval run succeeded! Link to run: link

Here are the results of the submission(s):

e5-small-lora

Release date: 2024-11-07

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 96.85 and a TPR of 85.69% at FPR=5% and 73.08% at FPR=1%. Without adversarial attacks, it achieved AUROC of 98.61 and a TPR of 93.87% at FPR=5% and 82.81% at FPR=1%.

luminar_classifier_RAID_none_PrismAI

Release date: 2025-05-17

I've committed detailed results of this detector's performance on the test set to this PR.

[!WARNING] Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved.

LLMDet

Release date: 2023-05-24

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 62.90 and a TPR of 26.70% at FPR=5% and 14.91% at FPR=1%. Without adversarial attacks, it achieved AUROC of 65.92 and a TPR of 33.40% at FPR=5% and 20.16% at FPR=1%.

Luminar

Release date: 2025-05-17

I've committed detailed results of this detector's performance on the test set to this PR.

[!WARNING] Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved.

SpeedAI

Release date: 2025-05-08

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 99.85 and a TPR of 99.62% at FPR=5% and 98.55% at FPR=1%. Without adversarial attacks, it achieved AUROC of 99.91 and a TPR of 99.86% at FPR=5% and 99.55% at FPR=1%.

It's AI

Release date: 2025-04-01

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.27 and a TPR of 94.15% at FPR=5% and 89.36% at FPR=1%. Without adversarial attacks, it achieved AUROC of 95.21 and a TPR of 95.75% at FPR=5% and 92.22% at FPR=1%.

RoBERTa-base (GPT2)

Release date: 2019-08-24

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 72.29 and a TPR of 51.77% at FPR=5% and 34.57% at FPR=1%. Without adversarial attacks, it achieved AUROC of 74.95 and a TPR of 59.19% at FPR=5% and 39.83% at FPR=1%.

RoBERTa (ChatGPT)

Release date: 2023-01-18

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 60.26 and a TPR of 26.64% at FPR=5% and 19.63% at FPR=1%. Without adversarial attacks, it achieved AUROC of 77.29 and a TPR of 42.52% at FPR=5% and 32.84% at FPR=1%.

Desklib

Release date: 2024-10-03

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.91 and a TPR of 83.76% at FPR=5% and 68.22% at FPR=1%. Without adversarial attacks, it achieved AUROC of 97.39 and a TPR of 92.38% at FPR=5% and 86.27% at FPR=1%.

RoBERTa-large (GPT2)

Release date: 2019-08-24

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 69.28 and a TPR of 50.70% at FPR=5% and 34.67% at FPR=1%. Without adversarial attacks, it achieved AUROC of 71.60 and a TPR of 55.73% at FPR=5% and 38.83% at FPR=1%.

SuperAnnotate AI Detector

Release date: 2024-10-27

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 88.87 and a TPR of 64.87% at FPR=5% and 38.87% at FPR=1%. Without adversarial attacks, it achieved AUROC of 90.96 and a TPR of 70.34% at FPR=5% and 44.53% at FPR=1%.

GLTR

Release date: 2019-06-10

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 70.90 and a TPR of 51.48% at FPR=5% and 36.48% at FPR=1%. Without adversarial attacks, it achieved AUROC of 72.68 and a TPR of 59.69% at FPR=5% and 45.57% at FPR=1%.

Desklib AI Text Detector v1.01

Release date: 2025-02-16

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 94.83 and a TPR of 91.17% at FPR=5% and 76.47% at FPR=1%. Without adversarial attacks, it achieved AUROC of 97.34 and a TPR of 94.87% at FPR=5% and 89.11% at FPR=1%.

Binoculars

Release date: 2024-01-22

I've committed detailed results of this detector's performance on the test set to this PR.

[!WARNING] No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated. Without adversarial attacks, it achieved AUROC of 84.40 and a TPR of 78.98% at FPR=5% and 69.54% at FPR=1%.

RADAR

Release date: 2023-07-07

I've committed detailed results of this detector's performance on the test set to this PR.

On the RAID dataset as a whole (aggregated across all generation models, domains, decoding strategies, repetition penalties, and adversarial attacks), it achieved an AUROC of 81.92 and a TPR of 63.91% at FPR=5% and 43.12% at FPR=1%. Without adversarial attacks, it achieved AUROC of 81.89 and a TPR of 65.61% at FPR=5% and 48.09% at FPR=1%.

Gaussian Extreme

Release date: 2025-05-17

I've committed detailed results of this detector's performance on the test set to this PR.

[!WARNING] Failed to find threshold values that achieve False Positive Rate(s): (['5%', '1%']) on all domains. This submission will not appear in the main leaderboard for those FPR values; it will only be visible within the splits in which the target FPR was achieved.

FastDetectGPT

Release date: 2023-10-08

I've committed detailed results of this detector's performance on the test set to this PR.

[!WARNING] No aggregate score across all settings is reported here as some domains/generator models/decoding strategies/repetition penalties/adversarial attacks were not included in the submission. This submission will not appear in the main leaderboard; it will only be visible within the splits in which all samples were evaluated.

[!WARNING] No aggregate score across all non-adversarial settings is reported here as some domains/generator models/decoding strategies/repetition penalties were not included in the submission.

If all looks well, a maintainer will come by soon to merge this PR and your entry/entries will appear on the leaderboard. If you need to make any changes, feel free to push new commits to this PR. Thanks for submitting to RAID!

github-actions[bot] avatar Jul 28 '25 22:07 github-actions[bot]