Skip to content

ByteDance-Seed/THEMol

General NLPENcc-by-nc-4.0

The ByteDance-Seed/THEMol dataset is a EN General NLP resource from ByteDance-Seed at 2026. With 18.2K downloads and 5 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 10M<n<100M-scale dataset.

About ByteDance-Seed/THEMol

THEMol: Torsion, Hessian, Energy of Molecules Dataset Summary THEMol is an open-source collection of quantum mechanical properties tailored for organic molecules. It provides large-scale density functional theory (DFT) data for explo...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Size
10M<n<100M
Creator
ByteDance-Seed
Year
2026
License
cc-by-nc-4.0
Downloads
18192
Likes
5
Download Homepage

Related General NLP datasets

FAQ