Skip to content

allenai/dolma3_dolmino_pool

Text GenerationENodc-by

Created by allenai at 2025, the allenai/dolma3_dolmino_pool is a text generation dataset in EN in Parquet format. With 17.6K downloads and 8 likes, it is actively used by the community. It is released under the odc-by license.

About allenai/dolma3_dolmino_pool

⚠️ IMPORTANT NOTICE ⚠️ This is the Dolma 3 Dolmino pool; it hasn't been mixed. If you are interested in the data used to train: Olmo 3 7B: allenai/dolma3_dolmino_mix-100B-1025 Olmo 3 32B: allenai/dolma3_dolmino_mix-100B-1125 Dolma...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
allenai
Year
2025
License
odc-by
Downloads
17649
Likes
8
Download Homepage

Related Text Generation datasets

FAQ