Skip to content

internlm/Lean-Github

General NLPEnglishapache-2.0

Internlm/Lean-Github is a General NLP dataset in English from internlm in Parquet format. It is distributed under the apache-2.0 license and falls in the 100K<n<1M size category, and has been downloaded 213 times.

About internlm/Lean-Github

We release Lean-Github and InternLM2-Step-Prover with 29K theorems compiled from 100+ Lean 4 repos and a 7B models fine-tuned on Lean-Github and Lean-Workbook with SOTA performance on MiniF2F-test (54.5%), ProofNet (18.1%), and Putnam (5 problems)...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
internlm
Year
2024
License
apache-2.0
Downloads
213
Likes
38
Download Homepage

Related General NLP datasets

FAQ