Skip to content

openbmb/UltraData-Math

Text GenerationEN, ZH

The openbmb/UltraData-Math dataset is a EN, ZH text generation resource from openbmb at 2026.

About openbmb/UltraData-Math

UltraData-Math πŸ€— Dataset | πŸ’» Source Code | πŸ‡¨πŸ‡³ δΈ­ζ–‡ README UltraData-Math is a large-scale, high-quality mathematical pre-training dataset totaling 290B+ tokens across three progressive tiersβ€”L1 (170.5B tokens web corpus), L2 (33.7B token...

Details

Task
Text Generation
Language
EN, ZH
Format
Parquet
Rows / instances
N/A
Creator
openbmb
Year
2026
Download

Related Text Generation datasets

FAQ