Skip to content

nvidia/OpenScience

General NLPEnglish

Nvidia/OpenScience is a General NLP dataset in English from nvidia in Parquet format.

About nvidia/OpenScience

Dataset Description: OpenScience is a multi-domain synthetic dataset designed to improve general-purpose reasoning in large language models (LLMs). The dataset contains multiple-choice question-answer pairs with detailed reasoning traces and sp...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2025
Download

Related General NLP datasets

FAQ