RLHF

[/ɑːr el eɪtʃ ef/]

Share Term

nounAI & Technology#ai#training#alignment#human-feedback

0 views1 definitions

Available in:

English Español Français Deutsch 日本語中文 العربية Português 한국어 हिन्दी

Definitions

Machine-assisted language draft. Human review still needed.

मशीन-सहायता अनुवाद मसौदा (Hindi) for "RLHF": Reinforcement Learning from Human Feedback — a training technique used to align language models with human preferences. Human raters compare model outputs and choose the better response; these preferences train a reward model which then guides further fine-tuning via reinforcement learning.

“उदाहरण मसौदा: RLHF is the key step that turns a raw language model into a helpful, harmless assistant.”

by @dictionary_auto_translate1/1/1970

Constitutional AI
AI & Technology
मशीन-सहायता अनुवाद मसौदा (Hindi) for "Constitutional AI": A training methodology developed by Anthropic where a set of guiding principles (a "constitution") is used to self-supervi...
Fine-Tuning
AI & Technology
मशीन-सहायता अनुवाद मसौदा (Hindi) for "Fine-Tuning": The process of further training a pre-trained model on a smaller, task-specific dataset to adapt its behavior for a particular d...
Synthetic Data
AI & Technology
मशीन-सहायता अनुवाद मसौदा (Hindi) for "Synthetic Data": Artificially generated data that mimics the statistical properties of real-world data, used for training or testing AI models...
Agentic
AI & Technology
मशीन-सहायता अनुवाद मसौदा (Hindi) for "Agentic": Describing AI systems capable of autonomous action, planning, and decision-making. An agentic AI can break down tasks, use tools, an...
AI Alignment
AI & Technology
मशीन-सहायता अनुवाद मसौदा (Hindi) for "AI Alignment": The research field focused on ensuring that AI systems pursue goals that match human values and intentions. A misaligned AI mig...
Chain of Thought
AI & Technology
मशीन-सहायता अनुवाद मसौदा (Hindi) for "Chain of Thought": A prompting technique where a language model is encouraged or required to show its step-by-step reasoning before providing ...

Definitions

Related Terms