facebook/principia-collection
General NLPENcc-by-nc-sa-4.0
Created by facebook at 2025, the facebook/principia-collection is a General NLP dataset in EN containing 554,399 records in Parquet format. With 340 downloads and 45 likes, it is actively used by the community. It is released under the cc-by-nc-sa-4.0 license and is a 100K<n<1M-scale dataset.
About facebook/principia-collection
Principia Collection
Principia Collection is a large-scale dataset designed to enhance language models’ ability to derive mathematical objects from STEM-related problem statements. Each instance contains a problem statement, a ground truth ans...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 554399
- Size
- 100K<n<1M
- Creator
- Year
- 2025
- License
- cc-by-nc-sa-4.0
- Downloads
- 340
- Likes
- 45