yuyijiong/Long-Instruction-with-Paraphrasing
Text GenerationZH, EN
The yuyijiong/Long-Instruction-with-Paraphrasing dataset is a ZH, EN text generation resource from yuyijiong at 2023.
About yuyijiong/Long-Instruction-with-Paraphrasing
🔥 Updates
[2024.6.4] Add a slim version. The sample number is reduced from about 20k to 10k.
[2024.5.28]
The data format is converted from "chatml" to "messages", which is more convenient to use tokenizer.apply_chat_template. The old version...
Details
- Task
- Text Generation
- Language
- ZH, EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- yuyijiong
- Year
- 2023