internlm/Lean-Github
General NLPEnglishapache-2.0
Internlm/Lean-Github is a General NLP dataset in English from internlm in Parquet format. It is distributed under the apache-2.0 license and falls in the 100K<n<1M size category, and has been downloaded 213 times.
About internlm/Lean-Github
We release Lean-Github and InternLM2-Step-Prover with 29K theorems compiled from 100+ Lean 4 repos and a 7B models fine-tuned on Lean-Github and Lean-Workbook with SOTA performance on MiniF2F-test (54.5%), ProofNet (18.1%), and Putnam (5 problems)...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- internlm
- Year
- 2024
- License
- apache-2.0
- Downloads
- 213
- Likes
- 38