Skip to content

bigscience-data/roots_zh-cn_wikipedia

General NLPZHcc-by-sa-3.0

Bigscience-data/roots_zh-cn_wikipedia is a General NLP-focused dataset in ZH distributed in Parquet format. It is distributed under the cc-by-sa-3.0 license and falls in the 100K<n<1M size category, and has been downloaded 26 times.

About bigscience-data/roots_zh-cn_wikipedia

ROOTS Subset: roots_zh-cn_wikipedia wikipedia Dataset uid: wikipedia Description Homepage Licensing Speaker Locations Sizes 3.2299 % of total 4.2071 % of en 5.6773 % of ar 3.3416...

Details

Task
General NLP
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
bigscience-data
Year
2022
License
cc-by-sa-3.0
Downloads
26
Likes
32
Download Homepage

Related General NLP datasets

FAQ