cfahlgren1/hub-stats
General NLPEnglishapache-2.0
Created by cfahlgren1 at 2024, the cfahlgren1/hub-stats is a General NLP dataset in English in Parquet format. With 4K downloads and 70 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 1M<n<10M-scale dataset.
About cfahlgren1/hub-stats
Changelog
NEW Changes March 11th 2026
Added new split: arxiv_papers, sourced from the Hugging Face /api/papers endpoint
papers continues to point to daily_papers.parquet, which is the Daily Papers feed
NEW Changes July 25th
added baseModels ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1M<n<10M
- Creator
- cfahlgren1
- Year
- 2024
- License
- apache-2.0
- Downloads
- 4050
- Likes
- 70