Contributing to formal theorem proving in GitHub repos
- Contributing theorem proving to Lean-Matrix-Cookbook
- Contributing theorem proving in PrimeNumberTheoremAnd
- Basic architecture: 2) Lean-dojo + searching + enhanced LLM for lean theorem proving, verify it on simple benchmarks: minif2f 3) a solution to translate informal theorem into formal one (in Lean 4)
I proposed a reinforcement learning framework to iteratively optimize the pretrained LLM by rolling out next tactics and comparing them with the expected ones. The experiment results show that it helps to achieve a higher accuracy compared with directly fine-tuned LLM.
My model proved 13 theorems of 30 in minif2f. 👏🏻
#Positive sample on MiniF2F | Acc on MiniF2F | Acc on Trainset | |
---|---|---|---|
Lean-qwen0.5B-sft | 11 | 36% | 49% |
Lean-qwen0.5B-rl | 13 | 43% | 61% |
@article{lean-reinforced-llm,
title={Reinforced Large Language Model is a formal theorem prover},
author={Zhiling Luo},
year={2025},
journal={arXiv preprint arXiv:2502.08908},
url={https://arxiv.org/abs/2502.08908},
primaryClass={cs.AI}
}