Saltar al contenido

#bypass

1 approved public terms with this tag.

Jailbreak

/ˈdʒeɪlbreɪk/noun/verb
AI & Technology

Borrador de traduccion automatica (Spanish) for "Jailbreak": A technique used to bypass the safety filters and content policies of an AI model, typically by framing harmful requests in ways the model's defenses don't recognize. Jailbreaks often use role-play scenarios, hypothetical framings, or encoded instructions to make the model comply with prohibited requests.

Ejemplo en borrador: The "DAN" jailbreak asked the model to pretend it was an AI with no restrictions.