Skip to content

togethercomputer/RedPajama-Data-1T

Text GenerationEN

Togethercomputer/RedPajama-Data-1T is a text generation dataset in EN from togethercomputer in Parquet format.

About togethercomputer/RedPajama-Data-1T

RedPajama is a clean-room, fully open-source implementation of the LLaMa dataset.

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
togethercomputer
Year
2023
Download

Related Text Generation datasets

FAQ