Skip to content

Constitutional AI

[/ˌkɒnstɪˈtjuːʃənəl eɪ aɪ/]

nounAI & Technology#ai#safety#alignment#anthropic
0 views1 definitions

Definitions

1
+1111

A training methodology developed by Anthropic where a set of guiding principles (a "constitution") is used to self-supervise and refine AI outputs. The model critiques and rewrites its own responses according to the constitution, reducing the need for human labelers for harmful content.

Constitutional AI lets the model identify and self-correct its own harmful outputs using defined principles.
by @aisafety1/1/1970

Related Terms

Related terms are generated only from public tags, classes, translations, and explicit relationships. No unavailable semantic relationships are fabricated.