Skip to content

oscar-corpus/OSCAR-2301

Fill MaskText GenerationEnglish

Created by oscar-corpus at 2023, the oscar-corpus/OSCAR-2301 is a fill mask dataset in English in Parquet format.

About oscar-corpus/OSCAR-2301

The Open Super-large Crawled Aggregated coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the Ungoliant architecture.\

Details

Task
Fill Mask, Text Generation
Language
English
Format
Parquet
Rows / instances
N/A
Creator
oscar-corpus
Year
2023
Download

Related Fill Mask, Text Generation datasets

FAQ