How ChatGPT actually works
ChatGPT is based on the Reinforcement Learning with Human Feedback (RLHF) methodology, which consists of three main steps: supervised fine-tuning (SFT), reinforcement learning from human feedback (RLHF), and evaluating the resulting model. In the SFT step, a pre-trained language model is fine-tuned on high-quality instruction data. In the RLHF step, the model is trained with an additional reward model based on human feedback to optimize its output for human preferences. Finally, the performance of the resulting model is evaluated by human labelers on several criteria including helpfulness, truthfulness, and harmlessness.
Company
AssemblyAI
Date published
Dec. 23, 2022
Author(s)
Marco Ramponi
Word count
3262
Hacker News points
4
Language
English