/plushcap/analysis/baseten/baseten-how-to-serve-10-000-fine-tuned-llms-from-a-single-gpu

How to serve 10,000 fine-tuned LLMs from a single GPU

What's this blog post about?

Company
Baseten

Date published
July 23, 2024

Author(s)
Pankaj Gupta, Philip Kiely

Word count
1895

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.