bluuebunny/arxiv_metadata_by_year
General NLPENapache-2.0
Bluuebunny/arxiv_metadata_by_year is a General NLP-focused dataset in EN that provides 208,492 labeled examples distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the 1M<n<10M size category, and has been downloaded 94.7K times.
About bluuebunny/arxiv_metadata_by_year
Dataset Card for Dataset Name
This dataset card aims to be a base template for new datasets. It has been generated using this raw template.
Dataset Details
Dataset Description
Curated by: [More Information N...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 208492
- Size
- 1M<n<10M
- Creator
- bluuebunny
- Year
- 2026
- License
- apache-2.0
- Downloads
- 94680
- Likes
- 9