Skip to content

eriktks/conll2003

Token ClassificationEN

The eriktks/conll2003 dataset is a EN token classification resource from eriktks at 2022 comprising 20,744 examples. With 30K downloads and 170 likes, it is actively used by the community. It is released under the other license and is a 10K<n<100K-scale dataset.

About eriktks/conll2003

The shared task of CoNLL-2003 concerns language-independent named entity recognition. We will concentrate on four types of named entities: persons, locations, organizations and names of miscellaneous entities that do not belong to the previous thr...

Details

Task
Token Classification
Language
EN
Format
Parquet
Rows / instances
20744
Size
10K<n<100K
Creator
eriktks
Year
2022
License
other
Downloads
30015
Likes
170
Download Homepage

Related Token Classification datasets

FAQ