Skip to content

zhilingluo/theorem_prover

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

theorem_prover

Contributing to formal theorem proving in GitHub repos

Goal

Roadmap

  1. Basic architecture: 2) Lean-dojo + searching + enhanced LLM for lean theorem proving, verify it on simple benchmarks: minif2f 3) a solution to translate informal theorem into formal one (in Lean 4)

Process

I proposed a reinforcement learning framework to iteratively optimize the pretrained LLM by rolling out next tactics and comparing them with the expected ones. The experiment results show that it helps to achieve a higher accuracy compared with directly fine-tuned LLM.

My model proved 13 theorems of 30 in minif2f. 👏🏻

#Positive sample on MiniF2F Acc on MiniF2F Acc on Trainset
Lean-qwen0.5B-sft 11 36% 49%
Lean-qwen0.5B-rl 13 43% 61%

tech report

Models

Lean-qwen0.5b-sft

Lean-qwen0.5b-rl

Citation

@article{lean-reinforced-llm,
      title={Reinforced Large Language Model is a formal theorem prover}, 
      author={Zhiling Luo},
      year={2025},
      journal={arXiv preprint arXiv:2502.08908},
      url={https://arxiv.org/abs/2502.08908},
      primaryClass={cs.AI}
}

Ref

Ref

About

contributing to formal theorem proving in github

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published