Skip to content

hatakeyama-llm-team/PMC

General NLPEnglish

Created by hatakeyama-llm-team at 2024, the hatakeyama-llm-team/PMC is a General NLP dataset in English containing 819,253 records in Parquet format. With 13.3K downloads and 2 likes, it is actively used by the community and is a 100K<n<1M-scale dataset.

About hatakeyama-llm-team/PMC

Data collected from PMC Only CC-BY, CC-BY-SA licenses are included. For all records, check the jsonl files in the data folder

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
819253
Size
100K<n<1M
Creator
hatakeyama-llm-team
Year
2024
Downloads
13329
Likes
2
Download Homepage

Related General NLP datasets

FAQ