oscar-corpus/oscar
Text GenerationFill MaskAF, ALS, AM
Oscar-corpus/oscar is a text generation dataset in AF, ALS, AM from oscar-corpus in Parquet format.
About oscar-corpus/oscar
The Open Super-large Crawled ALMAnaCH coRpus is a huge multilingual corpus obtained by language classification and filtering of the Common Crawl corpus using the goclassy architecture.\
Details
- Task
- Text Generation, Fill Mask
- Language
- AF, ALS, AM
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- oscar-corpus
- Year
- 2022