ChatGPT and InstructGPT: Aligning Language Models to Human Intention

Company

Arize

Date Published

Jan. 19, 2023

Author

Jason Lopatecki

Word count

204

Language

English

Hacker News points

None

URL

arize.com/blog/podcast-openai-chatgpt

Summary

InstructGPT was one of the first major applications of reinforcement learning with human feedback to train large language models, it is the precursor to ChatGPT, and its creators are now discussing the future of aligning language models to human intention. The podcast episode features Long Ouyang and Ryan Lowe, scientists at OpenAI who developed InstructGPT, and explores the major ideas behind this breakthrough technology.