jinaai/code_exercises
Text GenerationENcc-by-nc-sa-4.0
Jinaai/code_exercises is a text generation-focused dataset in EN that provides 1,468,146 labeled examples distributed in Parquet format. It is distributed under the cc-by-nc-sa-4.0 license and falls in the 1M<n<10M size category, and has been downloaded 1.8K times.
About jinaai/code_exercises
Dataset Card for "code_exercises"
Code exercise
This dataset is composed of a diverse set of ~120k Python code exercises (~120m total tokens) generated by ChatGPT 3.5. It is designed to distill ChatGPT 3.5 knowledge about Python codi...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- 1468146
- Size
- 1M<n<10M
- Creator
- jinaai
- Year
- 2023
- License
- cc-by-nc-sa-4.0
- Downloads
- 1769
- Likes
- 40