Skip to content

ronantakizawa/github-top-code

Text GenerationCODE

Ronantakizawa/github-top-code is a text generation dataset in CODE from ronantakizawa in Parquet format.

About ronantakizawa/github-top-code

GitHub Top Developer Source Code A curated dataset of 1.3M+ source code files from GitHub's top ranked developers (2015-2025). This dataset is based on the top ranked developers from this dataset: https://huggingface.co/datasets/ronantakizawa/g...

Details

Task
Text Generation
Language
CODE
Format
Parquet
Rows / instances
N/A
Creator
ronantakizawa
Year
2026
Download

Related Text Generation datasets

FAQ