/plushcap/analysis/together-ai/together-ai-flashattentionfandm

FlashAttention: Fast and memory-efficient exact attention with IO-Awareness

What's this blog post about?

Company
Together AI

Date published
May 17, 2023

Author(s)
Tri Dao, Daniel Y. Fu, Stefano Ermon, Atri Rudra, Christopher RĂ©

Word count
347

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.