PyRIT
PyRIT copied to clipboard
FEAT: Add nesting and prepend/append jailbreaks from papers
Description
Add jailbreak prompt templates from following papers:
-
A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily
- Code nesting
- Table nesting
- Text continuation nesting
-
Jailbroken: How Does LLM Safety Training Fail?
- Prefix injection
- Refusal suppression
- Style injection
- Wikipedia with title
Tests and Documentation
No tests for templates
@microsoft-github-policy-service agree company="Centific"
Can you make the description and title more specific (e.g. which jailbreaks/papers)