Skip to content

xiegeo/uspto-mol

General NLPEnglish

Xiegeo/uspto-mol is a General NLP-focused dataset in English distributed in Parquet format.

About xiegeo/uspto-mol

An intermediate dataset for US molecular patent grants Retrieves patent grant data from USPTO weekly releases bulkdata.uspto.gov/data/patent/grant/redbook/{year} and keeps only patents with .mol files for downstream data mining use cases. Compa...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
xiegeo
Year
2024
Download

Related General NLP datasets

FAQ