Content Deep Dive
Mamba-3B-SlimPJ: State-space models rivaling the best Transformer architecture
Company
Together AI
Date Published
Dec. 12, 2023
Author
Tri Dao, Albert Gu
Word count
550
Language
English
Hacker News points
None
URL
www.together.ai/blog/mamba-3b-slimpj
Summary
No summary generated yet.