Monitoring LLM-based Multi-Agent Systems Against Corruption Attacks via Node Evaluation

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Multi-Agent SystemsLarge Language Models

Large Language Model (LLM)-based Multi-Agent Systems (MAS) have become a popular focus of contemporary research, with extensive studies demonstrating their effectiveness in enhancing the performance of individual agents. However, trustworthiness issues in MAS remain a critical concern. Unlike challenges in single-agent systems, MAS involve more complex communication processes, making them susceptible to corruption attacks. To mitigate this issue, several defense mechanisms have been developed based on the graph representation of MAS, where agents represent nodes and communications form edges. Nevertheless, these methods predominantly focus on static graph defense, attempting to either detect attacks in a fixed graph structure or optimize a static topology with certain defensive capabilities. To address this limitation, we propose a dynamic defense paradigm for MAS graph structures, which continuously monitors communication within the MAS graph, then dynamically adjusts the graph topology, accurately disrupts malicious communications, and effectively defends against evolving and diverse dynamic attacks. Experimental results in increasingly complex and dynamic MAS environments demonstrate that our method significantly outperforms existing MAS defense mechanisms as well as single-agent defense approaches.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a dynamic defense paradigm for LLM-based multi-agent systems that continuously monitors communication graphs and adjusts topology in real-time to disrupt malicious interactions. Within the taxonomy, it resides in the 'Dynamic Graph Topology Defense' leaf under 'LLM-Based Multi-Agent System Defense', sharing this leaf with only one sibling paper. This represents a relatively sparse research direction within a seven-paper taxonomy, suggesting the specific focus on dynamic topology adjustment for LLM-based MAS is not yet heavily explored in the examined literature.

The taxonomy reveals neighboring work in 'System-Level Anomaly Detection' and 'Temporal Graph Propagation Modeling' within the same parent branch, alongside domain-specific applications in connected vehicles, drone networks, and reinforcement learning communication defense. The paper's emphasis on continuous topology adjustment distinguishes it from static detection frameworks and propagation modeling approaches. The taxonomy's scope notes explicitly exclude static graph defenses and detection-only methods from this leaf, positioning the work at the intersection of real-time monitoring and structural intervention rather than passive observation or fixed-topology optimization.

Among twenty candidates examined across three contributions, no clearly refutable prior work was identified. The 'Dynamic defense paradigm' contribution examined ten candidates with zero refutations, while 'Backward propagation method for agent contribution evaluation' similarly found no overlapping prior work among ten candidates. The 'MAS Graph Backpropagation technique' was not evaluated against any candidates. This limited search scope—twenty papers from semantic matching—suggests the analysis captures highly relevant neighbors but cannot confirm exhaustive novelty. The absence of refutations within this constrained set indicates the specific combination of dynamic topology adjustment and continuous monitoring may be underexplored.

Based on the limited search scope, the work appears to occupy a relatively novel position within LLM-based multi-agent defense, particularly in its emphasis on real-time structural adaptation rather than static detection. However, the small taxonomy size and twenty-candidate search limit confidence in this assessment. A broader literature review covering static graph defenses, general adversarial robustness in multi-agent systems, and non-LLM dynamic topology methods would provide stronger validation of the claimed novelty.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Defending multi-agent systems against corruption attacks through dynamic graph monitoring. The field addresses how to protect collaborative agent networks from adversarial manipulation by continuously analyzing their evolving communication and interaction structures. The taxonomy divides into two main branches: LLM-Based Multi-Agent System Defense, which focuses on protecting language-model-driven agents through mechanisms like dynamic graph topology monitoring and sentinel-based oversight, and Domain-Specific Multi-Agent Defense Applications, which tailors defenses to particular operational contexts such as drone swarms or cyber-physical systems. Within the LLM-based branch, works like GUARDIAN[3] and SentinelAgent[2] exemplify approaches that embed specialized monitoring agents or adaptive policy frameworks to detect and mitigate corrupted nodes, while the domain-specific branch explores how threat detection in UAV networks (e.g., Threat Detection Drones[5], Quantum UAV[7]) or temporal graph anomaly methods (Temporal Graph Anomaly[6]) can be adapted to multi-agent defense scenarios. A particularly active line of work centers on real-time node evaluation and topology-based defenses, where systems must distinguish legitimate agent behavior from adversarial corruption as the interaction graph evolves. Node Evaluation Monitoring[0] sits squarely within this dynamic graph topology defense cluster, emphasizing continuous assessment of individual agents' integrity through graph-structural signals. This contrasts slightly with Node Evaluation Corruptions[1], which investigates the nature and propagation of corruption itself, providing complementary insights into attack vectors. Compared to broader frameworks like GUARDIAN[3] that integrate multiple defense layers, Node Evaluation Monitoring[0] appears more narrowly focused on the monitoring mechanism itself, while Adaptive Policy Learning[4] explores how defenses can evolve over time. The central tension across these works involves balancing detection sensitivity against computational overhead and false-positive rates, especially as agent networks scale and adversaries adapt their strategies.

Claimed Contributions

Dynamic defense paradigm for MAS graph structures

10 retrieved papers

The authors introduce a defense approach that continuously monitors agent communications in Multi-Agent Systems and dynamically adjusts the graph topology to disrupt malicious communications, rather than relying on static graph defenses. This enables adaptation to evolving attack strategies.

10 retrieved papers

MAS Graph Backpropagation technique

0 retrieved papers

The authors develop a backpropagation method that models MAS communication as information propagation over a signed graph, using the chain rule to efficiently compute each agent's influence on final decisions. This enables accurate identification of harmful nodes or edges.

0 retrieved papers

Backward propagation method for agent contribution evaluation

10 retrieved papers

The authors propose a backward propagation algorithm that evaluates each agent's contribution to the system by combining local message scores with global propagation effects, enabling reliable detection of malicious agents in Multi-Agent Systems.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[1] Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation PDF

Zhang Zhixin, Xu Ming-qian, Wei, Zeming, Sun Meng (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Dynamic defense paradigm for MAS graph structures

[18] Dynamic event-triggered control for leader-following consensus of nonlinear multi-agent systems against malicious attacks PDF

Cannot Refute

[19] Dynamic event-based prescribed-time practical consensus for nonlinear multi-agent systems under DoS and deception attacks PDF

Cannot Refute

[20] Anti-attack fuzzy tracking control for nonlinear multi-agent systems with topology switching PDF

Cannot Refute

[21] Deep Learning-Based Adaptive Network Intrusion Detection System (DL-ANIDS) for 5G Mobile Network Security PDF

Cannot Refute

[22] Robust Defensive Cyber Agent for Multi-Adversary Defense PDF

Cannot Refute

[23] Resilient Output Formation-Tracking of Heterogeneous Multi-Agent Systems Against Composite Attacks: A Fully-Distributed Event-Triggered Framework PDF

Cannot Refute

[24] From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks PDF

Cannot Refute

[25] A defense strategy for false data injection attacks in multi-agent systems PDF

Cannot Refute

[26] GNN-enabled Multi-Agent DRL for Adaptive Path Selection in Multi-Network Domains PDF

Cannot Refute

[27] Multi-group consensus of multi-agent systems subject to semi-Markov jump topologies against hybrid cyber-attacks PDF

Cannot Refute

Contribution

MAS Graph Backpropagation technique

Contribution

Backward propagation method for agent contribution evaluation

[8] Sensorimotor intelligent systems PDF

Cannot Refute

[9] Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation PDF

Cannot Refute

[10] A multi-agent intrusion detection system optimized by a deep reinforcement learning approach with a dataset enlarged using a generative model to reduce the â¦ PDF

Cannot Refute

[11] A deep learning-based multi-agent system for intrusion detection PDF

Cannot Refute

[12] Adversarial attacks on heterogeneous multi-agent deep reinforcement learning system with time-delayed data transmission PDF

Cannot Refute

[13] BadMDA: Towards Backdoor Injection during Domain Adaptation to Collapse Multi-Agent Perception PDF

Cannot Refute

[14] A generative multi-agent network for open world intrusion detection: a dissertation in Engineering and Applied Science PDF

Cannot Refute

[15] Enhancing multi-agent communication through credibility and reward-based optimisation PDF

Cannot Refute

[16] Online Multi-Agent Control with Adversarial Disturbances PDF

Cannot Refute

[17] Modifying Neural Networks in Adversarial Agents of Multi-agent Reinforcement Learning Systems PDF

Cannot Refute

Monitoring LLM-based Multi-Agent Systems Against Corruption Attacks via Node Evaluation

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[1] Monitoring LLM-based Multi-Agent Systems Against Corruptions via Node Evaluation PDF

Contribution Analysis

Dynamic defense paradigm for MAS graph structures

[18] Dynamic event-triggered control for leader-following consensus of nonlinear multi-agent systems against malicious attacks PDF

[19] Dynamic event-based prescribed-time practical consensus for nonlinear multi-agent systems under DoS and deception attacks PDF

[20] Anti-attack fuzzy tracking control for nonlinear multi-agent systems with topology switching PDF

[21] Deep Learning-Based Adaptive Network Intrusion Detection System (DL-ANIDS) for 5G Mobile Network Security PDF

[22] Robust Defensive Cyber Agent for Multi-Adversary Defense PDF

[23] Resilient Output Formation-Tracking of Heterogeneous Multi-Agent Systems Against Composite Attacks: A Fully-Distributed Event-Triggered Framework PDF

[24] From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks PDF

[25] A defense strategy for false data injection attacks in multi-agent systems PDF

[26] GNN-enabled Multi-Agent DRL for Adaptive Path Selection in Multi-Network Domains PDF

[27] Multi-group consensus of multi-agent systems subject to semi-Markov jump topologies against hybrid cyber-attacks PDF

MAS Graph Backpropagation technique

Backward propagation method for agent contribution evaluation

[8] Sensorimotor intelligent systems PDF

[9] Adversarially Robust Anomaly Detection through Spurious Negative Pair Mitigation PDF

[10] A multi-agent intrusion detection system optimized by a deep reinforcement learning approach with a dataset enlarged using a generative model to reduce the â¦ PDF

[11] A deep learning-based multi-agent system for intrusion detection PDF

[12] Adversarial attacks on heterogeneous multi-agent deep reinforcement learning system with time-delayed data transmission PDF

[13] BadMDA: Towards Backdoor Injection during Domain Adaptation to Collapse Multi-Agent Perception PDF

[14] A generative multi-agent network for open world intrusion detection: a dissertation in Engineering and Applied Science PDF

[15] Enhancing multi-agent communication through credibility and reward-based optimisation PDF

[16] Online Multi-Agent Control with Adversarial Disturbances PDF

[17] Modifying Neural Networks in Adversarial Agents of Multi-agent Reinforcement Learning Systems PDF

Table of Contents

[10] A multi-agent intrusion detection system optimized by a deep reinforcement learning approach with a dataset enlarged using a generative model to reduce the â¦ PDF