BAAI/CCI4.0-M2-CoT-v1
General NLPEnglish
BAAI/CCI4.0-M2-CoT-v1 is a General NLP dataset in English from BAAI in Parquet format. And falls in the 100M<n<1B size category, and has been downloaded 1.9K times.
About BAAI/CCI4.0-M2-CoT-v1
CCI4.0-M2 v1 Dataset Documentation
Tech Report👁
Overview
CCI4.0-M2 v1 is a comprehensive dataset collection consisting of two specialized subsets designed for language model training.
CCI4.0-M2-Base v1
CCI4.0-M2-CoT v1
D...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100M<n<1B
- Creator
- BAAI
- Year
- 2025
- Downloads
- 1903
- Likes
- 57