ByteDance-Seed/THEMol
General NLPENcc-by-nc-4.0
The ByteDance-Seed/THEMol dataset is a EN General NLP resource from ByteDance-Seed at 2026. With 18.2K downloads and 5 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 10M<n<100M-scale dataset.
About ByteDance-Seed/THEMol
THEMol: Torsion, Hessian, Energy of Molecules
Dataset Summary
THEMol is an open-source collection of quantum mechanical properties tailored for organic molecules. It provides large-scale density functional theory (DFT) data for explo...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 10M<n<100M
- Creator
- ByteDance-Seed
- Year
- 2026
- License
- cc-by-nc-4.0
- Downloads
- 18192
- Likes
- 5