#moderation
1 approved public terms with this tag.
Guardrails
/ˈɡɑːrdreɪlz/noun
Borrador de traduccion automatica (Spanish) for "Guardrails": Safety constraints and filters applied to AI systems to prevent harmful, offensive, or out-of-scope outputs. Guardrails can be implemented at the model level (via training), prompt level (system instructions), or application level (output classifiers) to keep AI behavior within acceptable boundaries.
“Ejemplo en borrador: The guardrails blocked the model from providing detailed instructions on dangerous activities.”