microsoft/NextCoderDataset
Text GenerationEN
The microsoft/NextCoderDataset dataset is a EN text generation resource from microsoft at 2025.
About microsoft/NextCoderDataset
NextCoderDataset
GitHub | Paper
NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'2025)
Data Overview
NextCoderdataset is the instruction-variant of synthetic dataset, used for training models on...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- microsoft
- Year
- 2025