opendatalab/AICC
Text GenerationMULTILINGUAL
Opendatalab/AICC is a text generation-focused dataset in MULTILINGUAL distributed in Parquet format.
About opendatalab/AICC
🔧 🔧 Our New-Gen Html Parser MinerU-HTML Now Realease!
AICC: AI-ready Common Crawl Dataset
Paper | Project page
News
[2025-12-24] 🔥 CC-MinerU-Code Updated! We have updated our specialized high-quality code dataset CC-Miner...
Details
- Task
- Text Generation
- Language
- MULTILINGUAL
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- opendatalab
- Year
- 2025