Skip to content

jtatman/python-code-dataset-500k

Text GenerationEnglishmit

Jtatman/python-code-dataset-500k is a text generation dataset in English from jtatman with 559,515 records in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 922 times.

About jtatman/python-code-dataset-500k

Attention: This dataset is a summary and reformat pulled from github code. You should make your own assumptions based on this. In fact, there is another dataset I formed through parsing that addresses several points: out of 500k python related...

Details

Task
Text Generation
Language
English
Format
Parquet
Rows / instances
559515
Size
100K<n<1M
Creator
jtatman
Year
2024
License
mit
Downloads
922
Likes
80
Download Homepage

Related Text Generation datasets

FAQ