Skip to content

CoNLL 2003 ++

Named Entity Recognition (NER)English

Created by Wang et al. at 2020, the CoNLL 2003 ++ is a Named Entity Recognition (NER) dataset in English containing 20,744 records in Text format.

About CoNLL 2003 ++

Similar to the original CoNLL except test set has been corrected for label mistakes. The dataset is split into training, development, and test sets, with 14,041, 3,250, and 3,453 instances respectively.

Details

Task
Named Entity Recognition (NER)
Language
English
Format
Text
Rows / instances
20,744
Creator
Wang et al.
Year
2020
Download Paper

Related Named Entity Recognition (NER) datasets

FAQ