obswork/arxiv-ai-ml-100k-papers
General NLPEnglish
Obswork/arxiv-ai-ml-100k-papers is a General NLP dataset in English from obswork in Parquet format.
About obswork/arxiv-ai-ml-100k-papers
license: other
tags:
- arxiv
- ocr
- machine-learning
---
# obswork/arxiv-ai-ml-100k
A 99,999-paper stratified subset of
[`Rendra8631/arxiv-papers`](https://huggingface.co/datasets/Rendra8631/arxiv-papers)
...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- obswork
- Year
- 2026