monology/pile-uncopyrighted
General NLPEnglish
Created by monology at 2023, the monology/pile-uncopyrighted is a General NLP dataset in English in Parquet format.
About monology/pile-uncopyrighted
Pile Uncopyrighted
In response to authors demanding that LLMs stop using their works, here's a copy of The Pile with all copyrighted content removed.Please consider using this dataset to train your future LLMs, to respect authors and abide by c...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- monology
- Year
- 2023