Skip to content

bluuebunny/arxiv_metadata_by_year

General NLPENapache-2.0

Bluuebunny/arxiv_metadata_by_year is a General NLP-focused dataset in EN that provides 208,492 labeled examples distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the 1M<n<10M size category, and has been downloaded 94.7K times.

About bluuebunny/arxiv_metadata_by_year

Dataset Card for Dataset Name This dataset card aims to be a base template for new datasets. It has been generated using this raw template. Dataset Details Dataset Description Curated by: [More Information N...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
208492
Size
1M<n<10M
Creator
bluuebunny
Year
2026
License
apache-2.0
Downloads
94680
Likes
9
Download Homepage

Related General NLP datasets

FAQ