Skip to content

bigcode/commitpackft

General NLPCODEmit

Bigcode/commitpackft is a General NLP-focused dataset in CODE distributed in Parquet format. It is distributed under the mit license and falls in the 100K<n<1M size category, and has been downloaded 36.3K times.

About bigcode/commitpackft

CommitPackFT is is a 2GB filtered version of CommitPack to contain only high-quality commit messages that resemble natural language instructions.

Details

Task
General NLP
Language
CODE
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
bigcode
Year
2026
License
mit
Downloads
36325
Likes
110
Download Homepage

Related General NLP datasets

FAQ