Skip to content

codeparrot/self-instruct-starcoder

General NLPENBenchmarkbigscience-openrail-m

Created by codeparrot at 2023, the codeparrot/self-instruct-starcoder is a General NLP benchmark dataset in EN containing 9,631 records in Parquet format. With 313 downloads and 63 likes, it is actively used by the community. It is released under the bigscience-openrail-m license and is a 1K<n<10K-scale dataset.

📊 This dataset is used as an LLM benchmark. See model leaderboards →

About codeparrot/self-instruct-starcoder

Self-instruct-starcoder Summary Self-instruct-starcoder is a dataset that was generated by prompting starcoder to generate new instructions based on some human-written seed instructions. The underlying process is explained in the pap...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
9631
Size
1K<n<10K
Creator
codeparrot
Year
2023
License
bigscience-openrail-m
Downloads
313
Likes
63
Download Homepage

Related General NLP datasets

FAQ