Skip to content

bigcode/commitpack

General NLPCODEmit

Bigcode/commitpack is a General NLP dataset in CODE from bigcode in Parquet format. It is distributed under the mit license, and has been downloaded 11.2K times.

About bigcode/commitpack

CommitPack is is a 4TB dataset of commits scraped from GitHub repositories that are permissively licensed.

Details

Task
General NLP
Language
CODE
Format
Parquet
Rows / instances
N/A
Creator
bigcode
Year
2023
License
mit
Downloads
11185
Likes
78
Download Homepage

Related General NLP datasets

FAQ