bigcode/bigcodebench
General NLPCODEapache-2.0
The bigcode/bigcodebench dataset is a CODE General NLP resource from bigcode at 2024 comprising 5,700 examples. With 40.4K downloads and 84 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 1K<n<10K-scale dataset.
About bigcode/bigcodebench
BigCodeBench
The dataset has 2 variants:
BigCodeBench-Complete: Code Completion based on the structured docstrings.
BigCodeBench-Instruct: Code Generation based on the NL-oriented instructions.
The overall statistics of the dataset are ...
Details
- Task
- General NLP
- Language
- CODE
- Format
- Parquet
- Rows / instances
- 5700
- Size
- 1K<n<10K
- Creator
- bigcode
- Year
- 2024
- License
- apache-2.0
- Downloads
- 40405
- Likes
- 84