Reinforcement learning from human feedback

Reinforcement learning from human feedback

video
advanced
Technical Deep Dive
60 min

About This Resource

Deep dive on RLHF state, progress, and limitations from one of the technique's pioneers.

Author:John Schulman
Source:YouTube

Learn with AI

Get personalized help understanding this resource from leading AI assistants

Explain It Simply
Tutor Mode
Test My Understanding

Click any AI assistant to open it with a pre-filled learning prompt. You can edit before sending.

Sign in to track your progress and mark this resource as completed.