OpenCodePapers

automated-theorem-proving-on-minif2f-valid

Mathematical ProofsAutomated Theorem Proving
Dataset Link
Results over time
Click legend items to toggle metrics. Hover points for model names.
Leaderboard
PaperCodePass@8Pass@64Pass@1Pass@100ModelNameReleaseDate
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics✓ Link29.323.9Lean GPT-f2021-08-31
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics✓ Link21Metamath GPT-f2021-08-31
HyperTree Proof Search for Neural Theorem Proving58.6Evariste2022-05-23
HyperTree Proof Search for Neural Theorem Proving47.5Evariste-7d2022-05-23
HyperTree Proof Search for Neural Theorem Proving47.3GPT-f2022-05-23
HyperTree Proof Search for Neural Theorem Proving46.7Evariste-1d2022-05-23
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics✓ Link16.8Lean tidy2021-08-31
LEGO-Prover: Neural Theorem Proving with Growing Libraries✓ Link57.0LEGO-Prover ChatGPT2023-10-01
Lyra: Orchestrating Dual Correction in Automated Theorem Proving✓ Link52.0Lyra + GPT-42023-09-27
Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs✓ Link43.9DSP (62B Minerva informal)2022-10-21