Content Deep Dive
FlashAttention: Fast and memory-efficient exact attention with IO-Awareness
Company
Together AI
Date Published
May 17, 2023
Author
Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher RĂ©
Word count
347
Language
English
Hacker News points
None
URL
www.together.ai/blog/flashattentionfandm
Summary
No summary generated yet.