FlashAttention: Fast and memory-efficient exact attention with IO-Awareness
What's this blog post about?
Company
Together AI
Date published
May 17, 2023
Author(s)
Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher RĂ©
Word count
347
Language
English
Hacker News points
None found.