Skip to content

fancyzhx/dbpedia_14

Text ClassificationENcc-by-sa-3.0

Created by fancyzhx at 2022, the fancyzhx/dbpedia_14 is a text classification dataset in EN containing 630,000 records in Parquet format. With 21.1K downloads and 35 likes, it is actively used by the community. It is released under the cc-by-sa-3.0 license and is a 100K<n<1M-scale dataset.

About fancyzhx/dbpedia_14

Dataset Card for DBpedia14 Dataset Summary The DBpedia ontology classification dataset is constructed by picking 14 non-overlapping classes from DBpedia 2014. They are listed in classes.txt. From each of thse 14 ontology classes, we ...

Details

Task
Text Classification
Language
EN
Format
Parquet
Rows / instances
630000
Size
100K<n<1M
Creator
fancyzhx
Year
2022
License
cc-by-sa-3.0
Downloads
21074
Likes
35
Download Homepage

Related Text Classification datasets

FAQ