oscar-corpus/OSCAR-2301
Fill MaskText GenerationEnglish
Created by oscar-corpus at 2023, the oscar-corpus/OSCAR-2301 is a fill mask dataset in English in Parquet format.
About oscar-corpus/OSCAR-2301
The Open Super-large Crawled Aggregated coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the Ungoliant architecture.\
Details
- Task
- Fill Mask, Text Generation
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- oscar-corpus
- Year
- 2023