Skip to content

zai-org/LongCite-45k

Text GenerationQuestion AnsweringEN, ZH

Zai-org/LongCite-45k is a text generation-focused dataset in EN, ZH distributed in Parquet format.

About zai-org/LongCite-45k

LongCite-45k 🤗 [LongCite Dataset] • 💻 [Github Repo] • 📃 [LongCite Paper] LongCite-45k dataset contains 44,600 long-context QA instances paired with sentence-level citations (both English and Chinese, up to 128,000 words). The data can su...

Details

Task
Text Generation, Question Answering
Language
EN, ZH
Format
Parquet
Rows / instances
N/A
Creator
zai-org
Year
2024
Download

Related Text Generation, Question Answering datasets

FAQ