Skip to content

coastalcph/multi_eurlex

Text ClassificationBG, CS, DAcc-by-sa-4.0

Coastalcph/multi_eurlex is a text classification dataset in BG, CS, DA from coastalcph with 1,109,739 records in Parquet format. It is distributed under the cc-by-sa-4.0 license and falls in the 10K<n<100K size category, and has been downloaded 1.7K times.

About coastalcph/multi_eurlex

MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource). Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU. As with the English EURLEX, the goal is to predict the relevant E...

Details

Task
Text Classification
Language
BG, CS, DA
Format
Parquet
Rows / instances
1109739
Size
10K<n<100K
Creator
coastalcph
Year
2022
License
cc-by-sa-4.0
Downloads
1731
Likes
46
Download Homepage

Related Text Classification datasets

FAQ