Skip to content

GAIR/daVinci-Dev

General NLPEN

GAIR/daVinci-Dev is a General NLP dataset in EN from GAIR in Parquet format. It is distributed under the other license and falls in the 1M<n<10M size category, and has been downloaded 2.9K times.

About GAIR/daVinci-Dev

daVinci-Dev Dataset: Agent-native Mid-training for Software Engineering This dataset release contains agent-native trajectories used in daVinci-Dev: Agent-native Mid-training for Software Engineering. Dataset at a glance It i...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
GAIR
Year
2026
License
other
Downloads
2891
Likes
51
Download Homepage

Related General NLP datasets

FAQ