Zum Inhalt springen

#bypass

1 approved public terms with this tag.

Jailbreak

/ˈdʒeɪlbreɪk/noun/verb
AI & Technology

Automatischer Uebersetzungsentwurf (German) for "Jailbreak": A technique used to bypass the safety filters and content policies of an AI model, typically by framing harmful requests in ways the model's defenses don't recognize. Jailbreaks often use role-play scenarios, hypothetical framings, or encoded instructions to make the model comply with prohibited requests.

Beispielentwurf: The "DAN" jailbreak asked the model to pretend it was an AI with no restrictions.