jtatman/python-code-dataset-500k
Text GenerationEnglishmit
Jtatman/python-code-dataset-500k is a text generation dataset in English from jtatman with 559,515 records in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 922 times.
About jtatman/python-code-dataset-500k
Attention: This dataset is a summary and reformat pulled from github code.
You should make your own assumptions based on this.
In fact, there is another dataset I formed through parsing that addresses several points:
out of 500k python related...
Details
- Task
- Text Generation
- Language
- English
- Format
- Parquet
- Rows / instances
- 559515
- Size
- 100K<n<1M
- Creator
- jtatman
- Year
- 2024
- License
- mit
- Downloads
- 922
- Likes
- 80