/plushcap/analysis/together-ai/together-ai-redpajama-data-v2

RedPajama-Data-v2: An open dataset with 30 trillion tokens for training large language models

What's this blog post about?

Company
Together AI

Date published
Oct. 30, 2023

Author(s)
Together

Word count
2223

Hacker News points
1

Language
English


By Matt Makai. 2021-2024.