How to serve 10,000 fine-tuned LLMs from a single GPU
What's this blog post about?
Company
Baseten
Date published
July 23, 2024
Author(s)
Pankaj Gupta, Philip Kiely
Word count
1895
Language
English
Hacker News points
None found.