/plushcap/analysis/baseten/baseten-using-fractional-h100-gpus-for-efficient-model-serving

Using fractional H100 GPUs for efficient model serving

What's this blog post about?

Company
Baseten

Date published
March 28, 2024

Author(s)
Matt Howard, Vlad Shulman, Pankaj Gupta, Philip Kiely

Word count
1086

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.