Question 1

What is the OctoThinker/MegaMath-Web-Pro-Max dataset?

Accepted Answer

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

The Curation of MegaMath-Web-Pro-Max

Step 1: Uniformly and randomly sample millions of documents from the MegaMath-Web corpus, stratified by publication year;
Ste...

Question 2

Is OctoThinker/MegaMath-Web-Pro-Max a benchmark?

Accepted Answer

OctoThinker/MegaMath-Web-Pro-Max is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download OctoThinker/MegaMath-Web-Pro-Max?

Accepted Answer

OctoThinker/MegaMath-Web-Pro-Max is available at its source: https://huggingface.co/datasets/OctoThinker/MegaMath-Web-Pro-Max.

OctoThinker/MegaMath-Web-Pro-Max

About OctoThinker/MegaMath-Web-Pro-Max