BAAI/CCI3-HQ
Text GenerationZH
BAAI/CCI3-HQ is a text generation dataset in ZH from BAAI in Parquet format. And falls in the 10M<n<100M size category, and has been downloaded 3.6K times.
About BAAI/CCI3-HQ
Data Description
To address the scarcity of high-quality safety datasets in the Chinese, we open-sourced the CCI (Chinese Corpora Internet) dataset on November 29, 2023.
Building on this foundation, we continue to expand the data source, adopt...
Details
- Task
- Text Generation
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10M<n<100M
- Creator
- BAAI
- Year
- 2024
- Downloads
- 3613
- Likes
- 60