Helsinki-NLP/europarl
TranslationBG, CS, DA
The Helsinki-NLP/europarl dataset is a BG, CS, DA translation resource from Helsinki-NLP at 2022 comprising 185,506,545 examples. With 10.9K downloads and 39 likes, it is actively used by the community. It is released under the unknown license and is a 100M<n<1B-scale dataset.
About Helsinki-NLP/europarl
Dataset Card for OPUS Europarl (European Parliament Proceedings Parallel Corpus)
Dataset Summary
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh).
The main intended use is t...
Details
- Task
- Translation
- Language
- BG, CS, DA
- Format
- Parquet
- Rows / instances
- 185506545
- Size
- 100M<n<1B
- Creator
- Helsinki-NLP
- Year
- 2022
- License
- unknown
- Downloads
- 10855
- Likes
- 39