Skip to content

BAAI/CCI3-HQ

Text GenerationZH

BAAI/CCI3-HQ is a text generation dataset in ZH from BAAI in Parquet format. And falls in the 10M<n<100M size category, and has been downloaded 3.6K times.

About BAAI/CCI3-HQ

Data Description To address the scarcity of high-quality safety datasets in the Chinese, we open-sourced the CCI (Chinese Corpora Internet) dataset on November 29, 2023. Building on this foundation, we continue to expand the data source, adopt...

Details

Task
Text Generation
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
10M<n<100M
Creator
BAAI
Year
2024
Downloads
3613
Likes
60
Download Homepage

Related Text Generation datasets

FAQ