Using fractional H100 GPUs for efficient model serving
What's this blog post about?
Company
Baseten
Date published
March 28, 2024
Author(s)
Matt Howard, Vlad Shulman, Pankaj Gupta, Philip Kiely
Word count
1086
Language
English
Hacker News points
None found.