fancyzhx/dbpedia_14
Text ClassificationENcc-by-sa-3.0
Created by fancyzhx at 2022, the fancyzhx/dbpedia_14 is a text classification dataset in EN containing 630,000 records in Parquet format. With 21.1K downloads and 35 likes, it is actively used by the community. It is released under the cc-by-sa-3.0 license and is a 100K<n<1M-scale dataset.
About fancyzhx/dbpedia_14
Dataset Card for DBpedia14
Dataset Summary
The DBpedia ontology classification dataset is constructed by picking 14 non-overlapping classes
from DBpedia 2014. They are listed in classes.txt. From each of thse 14 ontology classes, we
...
Details
- Task
- Text Classification
- Language
- EN
- Format
- Parquet
- Rows / instances
- 630000
- Size
- 100K<n<1M
- Creator
- fancyzhx
- Year
- 2022
- License
- cc-by-sa-3.0
- Downloads
- 21074
- Likes
- 35