CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Mathematical Reasoning BenchmarkCircuit BenchmarkSymbolic Reasoning

Engineering design operates through hierarchical abstraction from system specifications to component implementations, requiring visual understanding coupled with mathematical reasoning at each level. While Multi-modal Large Language Models (MLLMs) excel at natural image tasks, their ability to extract mathematical models from technical diagrams remains unexplored. We present \textbf{CircuitSense}, a comprehensive benchmark evaluating circuit understanding across this hierarchy through 8,006+ problems spanning component-level schematics to system-level block diagrams. Our benchmark uniquely examines the complete engineering workflow: Perception, Analysis, and Design, with a particular emphasis on the critical but underexplored capability of deriving symbolic equations from visual inputs. We introduce a hierarchical synthetic generation pipeline consisting of a grid-based schematic generator and a block diagram generator with auto-derived symbolic equation labels. Comprehensive evaluation of eight state-of-the-art MLLMs, including both closed-source and open-source models, reveals fundamental limitations in visual-to-mathematical reasoning. Closed-source models achieve over 85% accuracy on perception tasks involving component recognition and topology identification, yet their performance on symbolic derivation and analytical reasoning falls below 19%, exposing a critical gap between visual parsing and symbolic reasoning. Models with stronger symbolic reasoning capabilities consistently achieve higher design task accuracy, confirming the fundamental role of mathematical understanding in circuit synthesis and establishing symbolic reasoning as the key metric for engineering competence. Our synthetic pipeline code is available at \href{https://anonymous.4open.science/r/CircuitSense-8AC7/README.md}{URL}.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

CircuitSense introduces a benchmark for evaluating visual-to-mathematical reasoning in circuit understanding, spanning component-level schematics to system-level block diagrams with over 8,000 problems. The paper resides in the Circuit and Diagram Visual Parsing leaf, which contains only three papers total, including two siblings. This represents a relatively sparse research direction within the broader taxonomy of sixteen papers across ten leaf nodes, suggesting the specific focus on hierarchical circuit understanding with mathematical derivation is not yet densely populated.

The taxonomy reveals neighboring work in Symbolic and Geometric Primitive Extraction and Visual-to-Symbolic Equation Derivation, which address related but distinct challenges. While sibling papers like Neural Circuit Diagrams and CircInspect focus on diagram interpretation, they do not emphasize the complete engineering workflow from perception through symbolic equation derivation. The Mathematical Reasoning branch contains only two papers, indicating that visual-to-symbolic translation remains underexplored compared to pure visual parsing or hierarchical modeling approaches found in other branches.

Among twenty-eight candidates examined, the benchmark contribution and synthetic generation pipeline show no clear refutation across eighteen examined candidates. However, the systematic evaluation contribution encountered two refutable candidates among ten examined, suggesting prior work has explored MLLM limitations in technical reasoning tasks. The limited search scope means these findings reflect top-K semantic matches rather than exhaustive coverage, and the sparse refutation pattern indicates the specific combination of hierarchical circuit understanding with mathematical derivation may offer incremental novelty over existing evaluation frameworks.

The analysis suggests moderate novelty given the sparse taxonomy leaf and limited prior work on complete visual-to-mathematical workflows in circuits. However, the evaluation component overlaps with existing MLLM capability studies, and the twenty-eight candidate scope leaves open questions about broader literature coverage. The hierarchical emphasis and symbolic equation focus appear to differentiate this work within the constrained search space examined.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: visual-to-mathematical reasoning in circuit understanding across hierarchical abstraction levels. The field structure suggested by the taxonomy reflects a multifaceted challenge that spans visual perception, symbolic reasoning, hierarchical modeling, and pedagogical applications. The first branch, Visual Perception and Structural Parsing in Technical Domains, encompasses methods that extract structured representations from diagrams and schematics, often drawing on insights from neuroscience and computer vision to parse complex visual layouts. The second branch, Mathematical Reasoning and Symbolic Derivation, focuses on translating parsed structures into formal equations and constraint systems, bridging perceptual input with symbolic computation. The third branch, Hierarchical Abstraction Frameworks and Architectures, addresses the need to reason at multiple levels of granularity—from low-level component interactions to high-level system behavior—often leveraging hierarchical Bayesian models or reinforcement learning strategies. Finally, the fourth branch, Pedagogical and Multimodal Visualization Systems, explores interactive tools that support learning and explanation, making abstract circuit concepts accessible through dynamic visual feedback. A particularly active line of work centers on integrating visual parsing with hierarchical reasoning, where methods must simultaneously recognize circuit topology and infer mathematical relationships at varying abstraction levels. CircuitSense[0] sits squarely within the Circuit and Diagram Visual Parsing cluster, emphasizing the extraction of structured circuit representations from visual input. It shares common ground with Neural Circuit Diagrams[6] and CircInspect[7], both of which tackle diagram interpretation, yet CircuitSense[0] places stronger emphasis on bridging visual parsing with mathematical derivation across hierarchical layers. This contrasts with works like Hierarchical Quantum Circuits[1] or Hierarchical Process Rewards[10], which prioritize abstraction mechanisms over the initial visual-to-symbolic translation. The interplay between perceptual fidelity and symbolic rigor remains an open question, as does the scalability of these approaches to real-world circuit complexity and diverse abstraction schemes.

Claimed Contributions

CircuitSense benchmark for hierarchical visual-to-mathematical reasoning

8 retrieved papers

The authors introduce CircuitSense, a benchmark comprising over 8,006 problems organized across six hierarchy levels (from resistor networks to system-level block diagrams) and three task categories (Perception, Analysis, and Design). The benchmark uniquely emphasizes symbolic equation derivation from visual circuit representations, combining curated problems from textbooks with synthetically generated circuits.

8 retrieved papers

Hierarchical synthetic generation pipeline with ground-truth symbolic equations

10 retrieved papers

The authors develop a two-part synthetic generation pipeline: a grid-based circuit schematic generator that produces component-level circuits with guaranteed symbolic ground-truth equations, and a block diagram generator for system-level architectures with transfer function ground-truth. This pipeline enables unbiased evaluation while preventing dataset contamination.

10 retrieved papers

Systematic evaluation revealing visual-to-mathematical reasoning gap in MLLMs

Can Refute

10 retrieved papers

Through extensive experiments on eight state-of-the-art MLLMs, the authors demonstrate that while models excel at visual perception tasks (over 85% accuracy for closed-source models), they catastrophically fail at symbolic equation derivation (below 19% accuracy). The study establishes that stronger symbolic reasoning capabilities correlate with better design task performance, confirming mathematical understanding as prerequisite for engineering competence.

10 retrieved papers

Can Refute

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[6] Neural circuit diagrams: Robust diagrams for the communication, implementation, and analysis of deep learning architectures PDF

Vincent Abbott (2024)

[7] CircInspect: Integrating Visual Circuit Analysis, Abstraction, and Real-Time Development in Quantum Debugging PDF

Khan, Mushahid, Nair, Prashant J., Di Matteo, Olivia (2024)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

CircuitSense benchmark for hierarchical visual-to-mathematical reasoning

[37] CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs PDF

Cannot Refute

[38] Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding PDF

Cannot Refute

[39] A survey of reasoning with foundation models PDF

Cannot Refute

[40] Assessing the capabilities of large language models to comprehend analog integrated circuits via netlist analysis PDF

Cannot Refute

[41] Hair: Hierarchical visual-semantic relational reasoning for video question answering PDF

Cannot Refute

[42] Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models PDF

Cannot Refute

[43] PathoHR: Hierarchical Reasoning for Vision-Language Models in Pathology PDF

Cannot Refute

[44] Hierarchical Reasoning with Vision-Language Models for Incident Reports from Dashcam Videos PDF

Cannot Refute

Contribution

Hierarchical synthetic generation pipeline with ground-truth symbolic equations

[17] Quantifying artificial intelligence through algorithmic generalization PDF

Cannot Refute

[18] Lcapy: symbolic linear circuit analysis with Python PDF

Cannot Refute

[19] Verification and control of hybrid systems: a symbolic approach PDF

Cannot Refute

[20] Clifford Circuit Optimization with Templates and Symbolic Pauli Gates PDF

Cannot Refute

[21] Symbolic Execution of Hadamard-Toffoli Quantum Circuits PDF

Cannot Refute

[22] Symbolic Synthesis of Clifford Circuits and Beyond PDF

Cannot Refute

[23] Analog Circuit Design Using Symbolic Math Toolboxes: Demonstrative Examples PDF

Cannot Refute

[24] Symbolic boolean manipulation with ordered binary-decision diagrams PDF

Cannot Refute

[25] Efficient generation of compact symbolic network functions in a nested rational form PDF

Cannot Refute

[26] Design of analog circuits through symbolic analysis PDF

Cannot Refute

Contribution

Systematic evaluation revealing visual-to-mathematical reasoning gap in MLLMs

[27] MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts PDF

Can Refute

[28] Mathverse: Does your multi-modal llm truly see the diagrams in visual math problems? PDF

Can Refute

[29] MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models PDF

Cannot Refute

[30] Mv-math: Evaluating multimodal math reasoning in multi-visual contexts PDF

Cannot Refute

[31] Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency PDF

Cannot Refute

[32] Forgotten polygons: Multimodal large language models are shape-blind PDF

Cannot Refute

[33] MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams PDF

Cannot Refute

[34] Math-llava: Bootstrapping mathematical reasoning for multimodal large language models PDF

Cannot Refute

[35] We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? PDF

Cannot Refute

[36] CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images PDF

Cannot Refute

CircuitSense: A Hierarchical Circuit System Benchmark Bridging Visual Comprehension and Symbolic Reasoning in Engineering Design Process

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[6] Neural circuit diagrams: Robust diagrams for the communication, implementation, and analysis of deep learning architectures PDF

[7] CircInspect: Integrating Visual Circuit Analysis, Abstraction, and Real-Time Development in Quantum Debugging PDF

Contribution Analysis

CircuitSense benchmark for hierarchical visual-to-mathematical reasoning

[37] CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs PDF

[38] Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding PDF

[39] A survey of reasoning with foundation models PDF

[40] Assessing the capabilities of large language models to comprehend analog integrated circuits via netlist analysis PDF

[41] Hair: Hierarchical visual-semantic relational reasoning for video question answering PDF

[42] Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models PDF

[43] PathoHR: Hierarchical Reasoning for Vision-Language Models in Pathology PDF

[44] Hierarchical Reasoning with Vision-Language Models for Incident Reports from Dashcam Videos PDF

Hierarchical synthetic generation pipeline with ground-truth symbolic equations

[17] Quantifying artificial intelligence through algorithmic generalization PDF

[18] Lcapy: symbolic linear circuit analysis with Python PDF

[19] Verification and control of hybrid systems: a symbolic approach PDF

[20] Clifford Circuit Optimization with Templates and Symbolic Pauli Gates PDF

[21] Symbolic Execution of Hadamard-Toffoli Quantum Circuits PDF

[22] Symbolic Synthesis of Clifford Circuits and Beyond PDF

[23] Analog Circuit Design Using Symbolic Math Toolboxes: Demonstrative Examples PDF

[24] Symbolic boolean manipulation with ordered binary-decision diagrams PDF

[25] Efficient generation of compact symbolic network functions in a nested rational form PDF

[26] Design of analog circuits through symbolic analysis PDF

Systematic evaluation revealing visual-to-mathematical reasoning gap in MLLMs

[27] MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts PDF

[28] Mathverse: Does your multi-modal llm truly see the diagrams in visual math problems? PDF

[29] MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models PDF

[30] Mv-math: Evaluating multimodal math reasoning in multi-visual contexts PDF

[31] Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency PDF

[32] Forgotten polygons: Multimodal large language models are shape-blind PDF

[33] MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams PDF

[34] Math-llava: Bootstrapping mathematical reasoning for multimodal large language models PDF

[35] We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? PDF

[36] CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images PDF

Table of Contents