introvoyz041/uspto-mol
General NLPEnglish
Introvoyz041/uspto-mol is a General NLP-focused dataset in English distributed in Parquet format. It has been downloaded 72.7K times.
About introvoyz041/uspto-mol
An intermediate dataset for US molecular patent grants
Retrieves patent grant data from USPTO weekly releases bulkdata.uspto.gov/data/patent/grant/redbook/{year} and keeps only patents with .mol files for downstream data mining use cases.
Compa...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- introvoyz041
- Year
- 2026
- Downloads
- 72710
- Likes
- 0