Survey: AI Agents for Formal Mathematics (External Landscape, 2026)

Sat, 07 Mar 2026 00:00:00 +0000

¶AI Agents for Formal Mathematics: External Landscape Survey

Survey of external tools, research, and practices for LLM agents doing formal mathematics with proof assistants. Focused on what exists outside this repository that could improve agent mathematical capability here.

¶1. LLM-Based Mathematical Reasoning Agents

¶DeepSeek-Prover-V2 (DeepSeek, 2025)

Formal theorem proving in Lean 4. Key innovation: subgoal decomposition — the model writes an informal proof sketch first, decomposes into formal subgoals, then proves each subgoal. Trained with GRPO (not PPO) reinforcement learning. Achieved 88.9% on MiniF2F-test, 49 out of 658 problems on FormalMATH. Open-weight (7B and 671B MoE variants).

Formalization on emsenn.net

Survey: AI Agents for Formal Mathematics (External Landscape, 2026)

¶AI Agents for Formal Mathematics: External Landscape Survey

¶1. LLM-Based Mathematical Reasoning Agents

¶DeepSeek-Prover-V2 (DeepSeek, 2025)