Duxiaoman-DI/FinCorpus
General NLPZHapache-2.0
Duxiaoman-DI/FinCorpus is a General NLP dataset in ZH from Duxiaoman-DI in Parquet format. It is distributed under the apache-2.0 license and falls in the 100K<n<1M size category, and has been downloaded 244 times.
About Duxiaoman-DI/FinCorpus
中文金融资讯数据集,包括(压缩前):
上市公司公告 announcement_data.jsonl 20G
金融资讯/新闻
fin_news_data.jsonl 30G
fin_articles_data.jsonl 10G
金融试题 fin_exam.jsonl 370M
数据格式:
{
"text": <文本内容>,
"meta": {
"source": <数据来源>
}
}
Details
- Task
- General NLP
- Language
- ZH
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- Duxiaoman-DI
- Year
- 2023
- License
- apache-2.0
- Downloads
- 244
- Likes
- 80