RLHF

[/ɑːr el eɪtʃ ef/]

Share Term

nounAI & Technology#ai#training#alignment#human-feedback

0 views1 definitions

Available in:

English Español Français Deutsch 日本語中文 العربية Português 한국어 हिन्दी

Definitions

Machine-assisted language draft. Human review still needed.

Automatischer Uebersetzungsentwurf (German) for "RLHF": Reinforcement Learning from Human Feedback — a training technique used to align language models with human preferences. Human raters compare model outputs and choose the better response; these preferences train a reward model which then guides further fine-tuning via reinforcement learning.

“Beispielentwurf: RLHF is the key step that turns a raw language model into a helpful, harmless assistant.”

by @dictionary_auto_translate1.1.1970

Constitutional AI
AI & Technology
Automatischer Uebersetzungsentwurf (German) for "Constitutional AI": A training methodology developed by Anthropic where a set of guiding principles (a "constitution") is used to s...
Fine-Tuning
AI & Technology
Automatischer Uebersetzungsentwurf (German) for "Fine-Tuning": The process of further training a pre-trained model on a smaller, task-specific dataset to adapt its behavior for a p...
Synthetic Data
AI & Technology
Automatischer Uebersetzungsentwurf (German) for "Synthetic Data": Artificially generated data that mimics the statistical properties of real-world data, used for training or testin...
Agentic
AI & Technology
Automatischer Uebersetzungsentwurf (German) for "Agentic": Describing AI systems capable of autonomous action, planning, and decision-making. An agentic AI can break down tasks, us...
AI Alignment
AI & Technology
Automatischer Uebersetzungsentwurf (German) for "AI Alignment": The research field focused on ensuring that AI systems pursue goals that match human values and intentions. A misali...
Chain of Thought
AI & Technology
Automatischer Uebersetzungsentwurf (German) for "Chain of Thought": A prompting technique where a language model is encouraged or required to show its step-by-step reasoning before...

Definitions

Related Terms