OpenCoder-LLM/opc-annealing-corpus
General NLPEnglish
The OpenCoder-LLM/opc-annealing-corpus dataset is a English General NLP resource from OpenCoder-LLM at 2024.
About OpenCoder-LLM/opc-annealing-corpus
OpenCoder Dataset
The OpenCoder dataset is composed of the following datasets:
opc-sft-stage1: the sft data used for opencoder sft-stage1
opc-sft-stage2: the sft data used for opencoder sft-stage2
opc-annealing-corpus: the synthetic data & alg...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- OpenCoder-LLM
- Year
- 2024