xiegeo/uspto-mol
General NLPEnglish
Xiegeo/uspto-mol is a General NLP-focused dataset in English distributed in Parquet format.
About xiegeo/uspto-mol
An intermediate dataset for US molecular patent grants
Retrieves patent grant data from USPTO weekly releases bulkdata.uspto.gov/data/patent/grant/redbook/{year} and keeps only patents with .mol files for downstream data mining use cases.
Compa...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- xiegeo
- Year
- 2024