Training 175B Parameter Language Models at 1000 GPU scale with Alpa and Ray
What's this blog post about?
Company
Anyscale
Date published
March 22, 2023
Author(s)
Jiao Dong, Hao Zhang, Lianmin Zheng, Jun Gong, Jules S. Damji, Phi Nguyen
Word count
2713
Language
English
Hacker News points
None found.