togethercomputer/RedPajama-Data-1T
Text GenerationEN
Togethercomputer/RedPajama-Data-1T is a text generation dataset in EN from togethercomputer in Parquet format.
About togethercomputer/RedPajama-Data-1T
RedPajama is a clean-room, fully open-source implementation of the LLaMa dataset.
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- togethercomputer
- Year
- 2023