Skip to content

yuyijiong/Long-Instruction-with-Paraphrasing

Text GenerationZH, EN

The yuyijiong/Long-Instruction-with-Paraphrasing dataset is a ZH, EN text generation resource from yuyijiong at 2023.

About yuyijiong/Long-Instruction-with-Paraphrasing

🔥 Updates [2024.6.4] Add a slim version. The sample number is reduced from about 20k to 10k. [2024.5.28] The data format is converted from "chatml" to "messages", which is more convenient to use tokenizer.apply_chat_template. The old version...

Details

Task
Text Generation
Language
ZH, EN
Format
Parquet
Rows / instances
N/A
Creator
yuyijiong
Year
2023
Download

Related Text Generation datasets

FAQ