Skip to content

nvidia/embed-nemotron-dataset-v1

Text RetrievalText RankingSentence SimilarityText ClassificationMULTILINGUAL

Created by nvidia at 2025, the nvidia/embed-nemotron-dataset-v1 is a text retrieval dataset in MULTILINGUAL in Parquet format.

About nvidia/embed-nemotron-dataset-v1

Embed Nemotron Dataset V1 Versions Date Commit Changes 2026-01-05 8808454 Initial Release Dataset Description This dataset is a compilation of high quality fine-tuning datasets that support NVIDIA's release...

Details

Task
Text Retrieval, Text Ranking, Sentence Similarity, Text Classification
Language
MULTILINGUAL
Format
Parquet
Rows / instances
N/A
Creator
nvidia
Year
2025
Download

FAQ