eriktks/conll2003
Token ClassificationEN
The eriktks/conll2003 dataset is a EN token classification resource from eriktks at 2022 comprising 20,744 examples. With 30K downloads and 170 likes, it is actively used by the community. It is released under the other license and is a 10K<n<100K-scale dataset.
About eriktks/conll2003
The shared task of CoNLL-2003 concerns language-independent named entity recognition. We will concentrate on
four types of named entities: persons, locations, organizations and names of miscellaneous entities that do
not belong to the previous thr...
Details
- Task
- Token Classification
- Language
- EN
- Format
- Parquet
- Rows / instances
- 20744
- Size
- 10K<n<100K
- Creator
- eriktks
- Year
- 2022
- License
- other
- Downloads
- 30015
- Likes
- 170