Achieving Expert-Level Agent from Foundation Model via Complexity Curriculum Reinforcement Learning with Synthetic Data
Overview
Taxonomy
Research Landscape Overview
Claimed Contributions
The authors introduce InternGeometry, an LLM-based agent that solves IMO-level geometry problems by iteratively proposing propositions and auxiliary constructions, verifying them with a symbolic engine, and reflecting on feedback. A dynamic memory mechanism enables the agent to conduct over 200 interactions per problem.
The authors propose CBRL, a multi-stage curriculum reinforcement learning framework that progressively increases the difficulty of synthesized geometry problems during training. This approach accelerates learning by adapting problem complexity to the current model capability.
The authors develop InternGeometry-DDAR, an enhanced interactive geometric proof engine based on the open-source DDAR system. It includes advanced definition strategies and a rich theorem library whose search space theoretically covers complete solutions for most IMO geometry problems.
Core Task Comparisons
Comparisons with papers in the same taxonomy category
[2] Aristotle: IMO-level Automated Theorem Proving PDF
Contribution Analysis
Detailed comparisons for each claimed contribution
InternGeometry: a medalist-level LLM agent for geometry problem solving
The authors introduce InternGeometry, an LLM-based agent that solves IMO-level geometry problems by iteratively proposing propositions and auxiliary constructions, verifying them with a symbolic engine, and reflecting on feedback. A dynamic memory mechanism enables the agent to conduct over 200 interactions per problem.
[4] Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving PDF
[8] LeanGeo: Formalizing Competitional Geometry problems in Lean PDF
[51] GeoSketch: A Neural-Symbolic Approach to Geometric Multimodal Reasoning with Auxiliary Line Construction and Affine Transformation PDF
[52] Enhancing the geometric problem-solving ability of multimodal llms via symbolic-neural integration PDF
[53] GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language PDF
[54] Geox: Geometric problem solving through unified formalized vision-language pre-training PDF
[55] Towards Reliable Proof Generation with LLMs: A Neuro-Symbolic Approach PDF
[56] Towards Geometry Problem Solving in the Large Model Era: A Survey PDF
[57] Machine assisted proof PDF
[58] Knowledge crosswords: Geometric reasoning over structured knowledge with large language models PDF
Complexity-Boosting Reinforcement Learning (CBRL)
The authors propose CBRL, a multi-stage curriculum reinforcement learning framework that progressively increases the difficulty of synthesized geometry problems during training. This approach accelerates learning by adapting problem complexity to the current model capability.
[67] Efficient reinforcement finetuning via adaptive curriculum learning PDF
[70] Self-Evolving Curriculum for LLM Reasoning PDF
[66] Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning PDF
[68] Vl-cogito: Progressive curriculum reinforcement learning for advanced multimodal reasoning PDF
[69] Light-r1: Curriculum sft, dpo and rl for long cot from scratch and beyond PDF
[71] Formal mathematics statement curriculum learning PDF
[72] Learning like humans: Advancing llm reasoning capabilities via adaptive difficulty curriculum learning and expert-guided self-reformulation PDF
[73] Ghpo: Adaptive guidance for stable and efficient llm reinforcement learning PDF
[74] SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning PDF
InternGeometry-DDAR: an interactive geometric proof engine
The authors develop InternGeometry-DDAR, an enhanced interactive geometric proof engine based on the open-source DDAR system. It includes advanced definition strategies and a rich theorem library whose search space theoretically covers complete solutions for most IMO geometry problems.