Skip to content

facebook/principia-collection

General NLPENcc-by-nc-sa-4.0

Created by facebook at 2025, the facebook/principia-collection is a General NLP dataset in EN containing 554,399 records in Parquet format. With 340 downloads and 45 likes, it is actively used by the community. It is released under the cc-by-nc-sa-4.0 license and is a 100K<n<1M-scale dataset.

About facebook/principia-collection

Principia Collection Principia Collection is a large-scale dataset designed to enhance language models’ ability to derive mathematical objects from STEM-related problem statements. Each instance contains a problem statement, a ground truth ans...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
554399
Size
100K<n<1M
Creator
facebook
Year
2025
License
cc-by-nc-sa-4.0
Downloads
340
Likes
45
Download Homepage

Related General NLP datasets

FAQ