Constitutional AI
[/ˌkɒnstɪˈtjuːʃənəl eɪ aɪ/]
nounAI & Technology#ai#safety#alignment#anthropic0 views1 definitions
Definitions
1
+1111
A training methodology developed by Anthropic where a set of guiding principles (a "constitution") is used to self-supervise and refine AI outputs. The model critiques and rewrites its own responses according to the constitution, reducing the need for human labelers for harmful content.
“Constitutional AI lets the model identify and self-correct its own harmful outputs using defined principles.”
by @aisafety1/1/1970