openbmb/Ultra-FineWeb
Text GenerationEN, ZH
The openbmb/Ultra-FineWeb dataset is a EN, ZH text generation resource from openbmb at 2025.
About openbmb/Ultra-FineWeb
Ultra-FineWeb
π Technical Report |
π¦ UltraData Collection |
π UltraData |
π€ MiniCPM4 Series |
π€ MiniCPM5 Series
English |
δΈζ
π Introduction
Ultra-FineWeb is a large-scale, high-quality, and efficiently-filtered datas...
Details
- Task
- Text Generation
- Language
- EN, ZH
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- openbmb
- Year
- 2025