RedPajama-Data-v2: An open dataset with 30 trillion tokens for training large language models
What's this blog post about?
Company
Together AI
Date published
Oct. 30, 2023
Author(s)
Together
Word count
2223
Hacker News points
1
Language
English