nvidia/embed-nemotron-dataset-v1
Text RetrievalText RankingSentence SimilarityText ClassificationMULTILINGUAL
Created by nvidia at 2025, the nvidia/embed-nemotron-dataset-v1 is a text retrieval dataset in MULTILINGUAL in Parquet format.
About nvidia/embed-nemotron-dataset-v1
Embed Nemotron Dataset V1
Versions
Date
Commit
Changes
2026-01-05
8808454
Initial Release
Dataset Description
This dataset is a compilation of high quality fine-tuning datasets that support NVIDIA's release...
Details
- Task
- Text Retrieval, Text Ranking, Sentence Similarity, Text Classification
- Language
- MULTILINGUAL
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nvidia
- Year
- 2025