Running Llama 3 8B with TensorRT-LLM on Serverless GPUs
What's this blog post about?
Company
Cerebrium
Date published
May 16, 2024
Author(s)
Michael Louis
Word count
1410
Language
English
Hacker News points
None found.