ronantakizawa/github-top-code
Text GenerationCODE
Ronantakizawa/github-top-code is a text generation dataset in CODE from ronantakizawa in Parquet format.
About ronantakizawa/github-top-code
GitHub Top Developer Source Code
A curated dataset of 1.3M+ source code files from GitHub's top ranked developers (2015-2025).
This dataset is based on the top ranked developers from this dataset: https://huggingface.co/datasets/ronantakizawa/g...
Details
- Task
- Text Generation
- Language
- CODE
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- ronantakizawa
- Year
- 2026