Skip to content

Duxiaoman-DI/FinCorpus

General NLPZHapache-2.0

Duxiaoman-DI/FinCorpus is a General NLP dataset in ZH from Duxiaoman-DI in Parquet format. It is distributed under the apache-2.0 license and falls in the 100K<n<1M size category, and has been downloaded 244 times.

About Duxiaoman-DI/FinCorpus

中文金融资讯数据集,包括(压缩前): 上市公司公告 announcement_data.jsonl 20G 金融资讯/新闻 fin_news_data.jsonl 30G fin_articles_data.jsonl 10G 金融试题 fin_exam.jsonl 370M 数据格式: { "text": <文本内容>, "meta": { "source": <数据来源> } }

Details

Task
General NLP
Language
ZH
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
Duxiaoman-DI
Year
2023
License
apache-2.0
Downloads
244
Likes
80
Download Homepage

Related General NLP datasets

FAQ