/plushcap/analysis/together-ai/together-ai-mamba-3b-slimpj

Mamba-3B-SlimPJ: State-space models rivaling the best Transformer architecture

What's this blog post about?

Company
Together AI

Date published
Dec. 12, 2023

Author(s)
Tri Dao, Albert Gu

Word count
550

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.