/plushcap/analysis/anyscale/anyscale-training-175b-parameter-language-models-at-1000-gpu-scale-with-alpa-and-ray

Training 175B Parameter Language Models at 1000 GPU scale with Alpa and Ray

What's this blog post about?

Company
Anyscale

Date published
March 22, 2023

Author(s)
Jiao Dong, Hao Zhang, Lianmin Zheng, Jun Gong, Jules S. Damji, Phi Nguyen

Word count
2713

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.