/plushcap/analysis/together-ai/together-ai-the-mamba-in-the-llama-distilling-and-accelerating-hybrid-models

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

What's this blog post about?

Company
Together AI

Date published
Sept. 9, 2024

Author(s)
Junxiong Wang, Daniele Paliotta, Avner May, Alexander M. Rush, Tri Dao

Word count
2582

Language
English

Hacker News points
4


By Matt Makai. 2021-2024.