bigcode/commitpack
General NLPCODEmit
Bigcode/commitpack is a General NLP dataset in CODE from bigcode in Parquet format. It is distributed under the mit license, and has been downloaded 11.2K times.
About bigcode/commitpack
CommitPack is is a 4TB dataset of commits scraped from GitHub repositories that are permissively licensed.
Details
- Task
- General NLP
- Language
- CODE
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- bigcode
- Year
- 2023
- License
- mit
- Downloads
- 11185
- Likes
- 78