Question 1

What is the codeparrot/self-instruct-starcoder dataset?

Accepted Answer

Self-instruct-starcoder

Summary

Self-instruct-starcoder is a dataset that was generated by prompting starcoder to generate new instructions based on some human-written seed instructions.
The underlying process is explained in the pap...

Question 2

Is codeparrot/self-instruct-starcoder a benchmark?

Accepted Answer

Yes — codeparrot/self-instruct-starcoder is used as an LLM benchmark. See model leaderboards in the Benchmarks section.

Question 3

Where can I download codeparrot/self-instruct-starcoder?

Accepted Answer

codeparrot/self-instruct-starcoder is available at its source: https://huggingface.co/datasets/codeparrot/self-instruct-starcoder.

Question 4

What license is codeparrot/self-instruct-starcoder released under?

Accepted Answer

codeparrot/self-instruct-starcoder is distributed under the bigscience-openrail-m license.

codeparrot/self-instruct-starcoder

About codeparrot/self-instruct-starcoder

Details

Related General NLP datasets

FAQ