Skip to content

monology/pile-uncopyrighted

General NLPEnglish

Created by monology at 2023, the monology/pile-uncopyrighted is a General NLP dataset in English in Parquet format.

About monology/pile-uncopyrighted

Pile Uncopyrighted In response to authors demanding that LLMs stop using their works, here's a copy of The Pile with all copyrighted content removed.Please consider using this dataset to train your future LLMs, to respect authors and abide by c...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
monology
Year
2023
Download

Related General NLP datasets

FAQ