bigscience-data/roots_zh-cn_wikipedia
General NLPZHcc-by-sa-3.0
Bigscience-data/roots_zh-cn_wikipedia is a General NLP-focused dataset in ZH distributed in Parquet format. It is distributed under the cc-by-sa-3.0 license and falls in the 100K<n<1M size category, and has been downloaded 26 times.
About bigscience-data/roots_zh-cn_wikipedia
ROOTS Subset: roots_zh-cn_wikipedia
wikipedia
Dataset uid: wikipedia
Description
Homepage
Licensing
Speaker Locations
Sizes
3.2299 % of total
4.2071 % of en
5.6773 % of ar
3.3416...
Details
- Task
- General NLP
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- bigscience-data
- Year
- 2022
- License
- cc-by-sa-3.0
- Downloads
- 26
- Likes
- 32