Skip to content

cfahlgren1/hub-stats

General NLPEnglishapache-2.0

Created by cfahlgren1 at 2024, the cfahlgren1/hub-stats is a General NLP dataset in English in Parquet format. With 4K downloads and 70 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 1M<n<10M-scale dataset.

About cfahlgren1/hub-stats

Changelog NEW Changes March 11th 2026 Added new split: arxiv_papers, sourced from the Hugging Face /api/papers endpoint papers continues to point to daily_papers.parquet, which is the Daily Papers feed NEW Changes July 25th added baseModels ...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
1M<n<10M
Creator
cfahlgren1
Year
2024
License
apache-2.0
Downloads
4050
Likes
70
Download Homepage

Related General NLP datasets

FAQ