Skip to content

oscar-corpus/oscar

Text GenerationFill MaskAF, ALS, AM

Oscar-corpus/oscar is a text generation dataset in AF, ALS, AM from oscar-corpus in Parquet format.

About oscar-corpus/oscar

The Open Super-large Crawled ALMAnaCH coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.\

Details

Task
Text Generation, Fill Mask
Language
AF, ALS, AM
Format
Parquet
Rows / instances
N/A
Creator
oscar-corpus
Year
2022
Download

Related Text Generation, Fill Mask datasets

FAQ