Skip to content

bigcode/bigcodebench

General NLPCODEapache-2.0

The bigcode/bigcodebench dataset is a CODE General NLP resource from bigcode at 2024 comprising 5,700 examples. With 40.4K downloads and 84 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 1K<n<10K-scale dataset.

About bigcode/bigcodebench

BigCodeBench The dataset has 2 variants: BigCodeBench-Complete: Code Completion based on the structured docstrings.  BigCodeBench-Instruct: Code Generation based on the NL-oriented instructions. The overall statistics of the dataset are ...

Details

Task
General NLP
Language
CODE
Format
Parquet
Rows / instances
5700
Size
1K<n<10K
Creator
bigcode
Year
2024
License
apache-2.0
Downloads
40405
Likes
84
Download Homepage

Related General NLP datasets

FAQ