Skip to content

jinaai/code_exercises

Text GenerationENcc-by-nc-sa-4.0

Jinaai/code_exercises is a text generation-focused dataset in EN that provides 1,468,146 labeled examples distributed in Parquet format. It is distributed under the cc-by-nc-sa-4.0 license and falls in the 1M<n<10M size category, and has been downloaded 1.8K times.

About jinaai/code_exercises

Dataset Card for "code_exercises" Code exercise This dataset is composed of a diverse set of ~120k Python code exercises (~120m total tokens) generated by ChatGPT 3.5. It is designed to distill ChatGPT 3.5 knowledge about Python codi...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
1468146
Size
1M<n<10M
Creator
jinaai
Year
2023
License
cc-by-nc-sa-4.0
Downloads
1769
Likes
40
Download Homepage

Related Text Generation datasets

FAQ