Company
Date Published
Author
Jason Lopatecki
Word count
204
Language
English
Hacker News points
None

Summary

InstructGPT was one of the first major applications of reinforcement learning with human feedback to train large language models, it is the precursor to ChatGPT, and its creators are now discussing the future of aligning language models to human intention. The podcast episode features Long Ouyang and Ryan Lowe, scientists at OpenAI who developed InstructGPT, and explores the major ideas behind this breakthrough technology.