The Mamba in the Llama: Distilling and Accelerating Hybrid Models
What's this blog post about?
Company
Together AI
Date published
Sept. 9, 2024
Author(s)
Junxiong Wang, Daniele Paliotta, Avner May, Alexander M. Rush, Tri Dao
Word count
2582
Language
English
Hacker News points
4