Skip to content

BAAI/CCI4.0-M2-CoT-v1

General NLPEnglish

BAAI/CCI4.0-M2-CoT-v1 is a General NLP dataset in English from BAAI in Parquet format. And falls in the 100M<n<1B size category, and has been downloaded 1.9K times.

About BAAI/CCI4.0-M2-CoT-v1

CCI4.0-M2 v1 Dataset Documentation Tech Report👁 Overview CCI4.0-M2 v1 is a comprehensive dataset collection consisting of two specialized subsets designed for language model training. CCI4.0-M2-Base v1 CCI4.0-M2-CoT v1 D...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100M<n<1B
Creator
BAAI
Year
2025
Downloads
1903
Likes
57
Download Homepage

Related General NLP datasets

FAQ