Skip to content

librarian-bots/arxiv-metadata-snapshot

Text GenerationText ClassificationEN

The librarian-bots/arxiv-metadata-snapshot dataset is a EN text generation resource from librarian-bots at 2026 comprising 3,043,230 examples.

About librarian-bots/arxiv-metadata-snapshot

Dataset Card for "arxiv-metadata-oai-snapshot" More Information needed This is a mirror of the metadata portion of the arXiv dataset. The sync will take place weekly so may fall behind the original datasets slightly if there are more r...

Details

Task
Text Generation, Text Classification
Language
EN
Format
Parquet
Rows / instances
3,043,230
Creator
librarian-bots
Year
2026
Download

Related Text Generation, Text Classification datasets

FAQ