librarian-bots/arxiv-metadata-snapshot
Text GenerationText ClassificationEN
The librarian-bots/arxiv-metadata-snapshot dataset is a EN text generation resource from librarian-bots at 2026 comprising 3,043,230 examples.
About librarian-bots/arxiv-metadata-snapshot
Dataset Card for "arxiv-metadata-oai-snapshot"
More Information needed
This is a mirror of the metadata portion of the arXiv dataset.
The sync will take place weekly so may fall behind the original datasets slightly if there are more r...
Details
- Task
- Text Generation, Text Classification
- Language
- EN
- Format
- Parquet
- Rows / instances
- 3,043,230
- Creator
- librarian-bots
- Year
- 2026