utter-project/EuroWeb-2512
General NLPEnglish
The utter-project/EuroWeb-2512 dataset is a English General NLP resource from utter-project at 2026.
About utter-project/EuroWeb-2512
EuroWeb-2512
EuroWeb is a dataset of collecting multilingual web data from various sources. It was processed with standard practices and then classified with utter-project/EuroFilter-v1.
For more information read the EuroLLM-22B: Techni...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- utter-project
- Year
- 2026