HN Points | HN Title (Links to original post) | Submitted Date |
---|---|---|
287 | FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-Precision | 2024-07-11 |
221 | Paving the way to efficient architectures: StripedHyena-7B | 2023-12-08 |
165 | Based: Simple linear attention language models | 2024-03-05 |
143 | Dragonfly: A large vision-language model with multi-resolution zoom | 2024-06-06 |
236 | RedPajama v2 Open Dataset with 30T Tokens for Training LLMs | 2023-10-30 |