BASED: Simple linear attention language models balance the recall-throughput tradeoff
What's this blog post about?
Company
Together AI
Date published
March 4, 2024
Author(s)
Simran, Sabri, Michael, Aman, Silas, Dylan, James, Atri, Chris
Word count
2303
Language
English
Hacker News points
165