Content Deep Dive
RedPajama-Data-v2: An open dataset with 30 trillion tokens for training large language models
Company
Together AI
Date Published
Oct. 30, 2023
Author
Together
Word count
2223
Language
English
Hacker News points
1
URL
www.together.ai/blog/redpajama-data-v2
Summary
No summary generated yet.