safety-evaluation topic

List safety-evaluation repositories

Stars

Forks

Watchers

Benchmark evaluation code for "SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal" (ICLR 2025)

Stars

108

Forks

Watchers

Learn How To Observe, Manage, and Scale, Agentic AI Apps Using Azure AI Foundry - with this hands-on workshop