AI Alignment
[/eɪ aɪ əˈlaɪnmənt/]
nounAI & Technology#ai#safety#ethics#goals0 views1 definitions
Definitions
1
+1647
The research field focused on ensuring that AI systems pursue goals that match human values and intentions. A misaligned AI might optimize for a metric that appears correct but produces harmful or unintended outcomes at scale.
“AI alignment researchers worry that optimizing for user engagement could misalign with genuine user wellbeing.”
by @aisafety1/1/1970