Content Deep Dive
Using fractional H100 GPUs for efficient model serving
Company
Baseten
Date Published
March 28, 2024
Author
Matt Howard, Vlad Shulman, Pankaj Gupta, Philip Kiely
Word count
1086
Language
English
Hacker News points
None
URL
www.baseten.co/blog/using-fractional-h100-gpus-for-efficient-model-serving
Summary
No summary generated yet.