Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 5.5 Download Report PDF

Adversarial ProtectionPrivacy ProtectionMulti-Modal Large Language ModelsHierarchical ReasoningGeographic Inference

Multi-modal large reasoning models (MLRMs) pose significant privacy risks by inferring precise geographic locations from personal images through hierarchical chain-of-thought reasoning. Existing privacy protection techniques, primarily designed for perception-based models, prove ineffective against MLRMs' sophisticated multi-step reasoning processes that analyze environmental cues. We introduce ReasonBreak, a novel adversarial framework specifically designed to disrupt hierarchical reasoning in MLRMs through concept-aware perturbations. Our approach is founded on the key insight that effective disruption of geographic reasoning requires perturbations aligned with conceptual hierarchies rather than uniform noise. ReasonBreak strategically targets critical conceptual dependencies within reasoning chains, generating perturbations that invalidate specific inference steps and cascade through subsequent reasoning stages. To facilitate this approach, we contribute GeoPrivacy-6K, a comprehensive dataset comprising 6,341 ultra-high-resolution images ( $\geq$ 2K) with hierarchical concept annotations. Extensive evaluation across seven state-of-the-art MLRMs (including GPT-o3, GPT-5, Gemini 2.5 Pro) demonstrates ReasonBreak's superior effectiveness, achieving a 14.4% improvement in tract-level protection (33.8% vs 19.4%) and nearly doubling block-level protection (33.5% vs 16.8%). This work establishes a new paradigm for privacy protection against reasoning-based threats.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces ReasonBreak, an adversarial framework targeting hierarchical reasoning in multi-modal large reasoning models for geographic privacy protection, alongside the GeoPrivacy-6K dataset. Within the taxonomy, this work occupies a uniquely sparse position: it is the sole paper in the 'Concept-Aware Adversarial Disruption of Hierarchical Reasoning' leaf under 'Privacy-Preserving Adversarial Perturbations for Geographic Reasoning'. This isolation suggests the paper addresses a nascent research direction with minimal direct prior work in the specific intersection of hierarchical reasoning disruption and geographic privacy.

The taxonomy reveals neighboring branches focused on adversarial defenses (e.g., defensive training against multimodal attacks, robustness probing in vision-language transformers) and spatial representation learning (adversarial methods for urban analysis, domain-adversarial spatiotemporal prediction). These directions emphasize strengthening models or learning spatial patterns, contrasting with the paper's privacy-centric goal of intentionally degrading geographic inference. The scope notes clarify that privacy-focused adversarial methods are explicitly excluded from general robustness categories, reinforcing the paper's distinct positioning within a less-explored privacy-protection niche rather than the more crowded adversarial defense landscape.

Among 25 candidates examined across three contributions, none were identified as clearly refuting the work. The ReasonBreak framework examined 10 candidates with zero refutable matches, the GeoPrivacy-6K dataset examined 5 with zero refutations, and the empirical validation examined 10 with zero refutations. This absence of overlapping prior work within the limited search scope suggests the concept-aware disruption of hierarchical reasoning chains represents a novel approach. However, the modest search scale (25 total candidates) means the analysis captures top semantic matches rather than exhaustive coverage of adversarial privacy or geographic reasoning literature.

Given the sparse taxonomy position and zero refutations among examined candidates, the work appears to introduce a genuinely new direction within geographic privacy protection. The limited search scope (top-25 semantic matches) provides reasonable confidence for the specific contributions but cannot rule out related work in adjacent domains (e.g., broader adversarial privacy methods not focused on hierarchical reasoning). The taxonomy structure suggests this represents an early exploration of a distinct research gap rather than an incremental refinement within an established subfield.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Disrupting hierarchical geographic reasoning in multimodal models through adversarial perturbations. This emerging field sits at the intersection of adversarial machine learning and geographic information systems, exploring how vision-language models handle spatial hierarchies and how these capabilities can be systematically disrupted or protected. The taxonomy reveals several distinct research directions: one branch focuses on adversarial defenses and robustness mechanisms that protect multimodal models from attacks, exemplified by works like Hierarchical Defense Multimodal[2] and Probing Vision Language Robustness[6]. A second branch emphasizes privacy-preserving perturbations that intentionally obscure geographic information while maintaining utility. Additional branches explore spatial representation learning through adversarial and semantic techniques, geo-augmented analytics for trajectory data (as seen in Bike Sharing Demand[1] and Land Use POIs[3]), and multimodal geo-localization methods such as Frame Aggregation Geolocalization[5]. These branches collectively address the tension between model capability and information protection in geographic reasoning tasks. Particularly active lines of work contrast defensive robustness against intentional disruption for privacy. While some studies strengthen models against adversarial attacks to preserve accurate geographic inference, others deliberately craft perturbations to prevent location disclosure—a fundamental trade-off between utility and privacy. Hierarchical Geographic Privacy[0] occupies a distinctive position within the privacy-preserving branch, specifically targeting concept-aware disruption of hierarchical reasoning structures. Unlike broader robustness work such as Hierarchical Defense Multimodal[2], which aims to fortify models against general adversarial threats, this approach strategically exploits the layered nature of geographic hierarchies (e.g., city-state-country) to selectively degrade reasoning at specific granularity levels. This contrasts with frame-based localization methods like Frame Aggregation Geolocalization[5], which aggregate visual information to improve geographic precision rather than obscure it, highlighting the dual-use nature of advances in spatial understanding.

Claimed Contributions

ReasonBreak adversarial framework for disrupting hierarchical reasoning

10 retrieved papers

The authors propose ReasonBreak, an adversarial framework that generates concept-aware perturbations to disrupt the hierarchical chain-of-thought reasoning processes used by multi-modal large reasoning models for geographic inference. The framework strategically targets critical conceptual dependencies within reasoning chains to invalidate specific inference steps.

10 retrieved papers

GeoPrivacy-6K dataset with hierarchical concept annotations

5 retrieved papers

The authors introduce GeoPrivacy-6K, a specialized dataset containing 6,341 ultra-high-resolution images with detailed hierarchical concept annotations and spatial bounding boxes. This dataset is specifically designed to enable reasoning-aware privacy defense research against geographic inference.

5 retrieved papers

Empirical validation establishing new state-of-the-art in privacy protection

10 retrieved papers

The authors demonstrate through extensive evaluation across seven state-of-the-art MLRMs (including GPT-o3, GPT-5, and Gemini 2.5 Pro) that ReasonBreak achieves superior privacy protection, with 14.4% improvement in tract-level protection and nearly doubling block-level protection compared to existing methods.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Within the taxonomy built over the current TopK core-task papers, the original paper is assigned to a leaf with no direct siblings and no cousin branches under the same grandparent topic. In this retrieved landscape, it appears structurally isolated, which is one partial signal of novelty, but still constrained by search coverage and taxonomy granularity.

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

ReasonBreak adversarial framework for disrupting hierarchical reasoning

[8] Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving PDF

Cannot Refute

[9] When alignment fails: Multimodal adversarial attacks on vision-language-action models PDF

Cannot Refute

[10] ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents PDF

Cannot Refute

[11] Adashield: Safeguarding multimodal large language models from structure-based attack via adaptive shield prompting PDF

Cannot Refute

[12] VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models PDF

Cannot Refute

[13] Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning PDF

Cannot Refute

[14] Jailbreak vision language models via bi-modal adversarial prompt PDF

Cannot Refute

[15] From Coarse to Fine: A Training-Free Framework for Hierarchical Traceability of Adversarial Attacks in Remote Sensing Systems PDF

Cannot Refute

[16] Prism: Programmatic reasoning with image sequence manipulation for lvlm jailbreaking PDF

Cannot Refute

[17] Adversarial Attacks on Vision-Language Model-Empowered Chatbots in Consumer Electronics PDF

Cannot Refute

Contribution

GeoPrivacy-6K dataset with hierarchical concept annotations

[18] Brain imaging generation with latent diffusion models PDF

Cannot Refute

[19] A short review on different clustering techniques and their applications PDF

Cannot Refute

[20] NAFlora-1M: continental-scale high-resolution fine-grained plant classification dataset PDF

Cannot Refute

[21] Object discovery in high-resolution remote sensing images: a semantic perspective PDF

Cannot Refute

[22] A Framework for Fashion Data Gathering, Hierarchical-Annotation and Analysis for Social Media and Online Shop: TOOLKIT FOR DETAILED STYLE ANNOTATIONS â¦ PDF

Cannot Refute

Contribution

Empirical validation establishing new state-of-the-art in privacy protection

[23] Location-Privacy-Aware Service Migration Against Inference Attacks in Multiuser MEC Systems PDF

Cannot Refute

[24] Human activity recognition (har) using deep learning: Review, methodologies, progress and future research directions PDF

Cannot Refute

[25] Granular privacy control for geolocation with vision language models PDF

Cannot Refute

[26] Privacy Protection in Collaborative Geographically Distributed Co-Simulation of Multimodal Energy Systems PDF

Cannot Refute

[27] Image-based geolocation using large vision-language models PDF

Cannot Refute

[28] LocGuard: A location privacy defender for image sharing PDF

Cannot Refute

[29] Geoshield: Safeguarding geolocation privacy from vision-language models via adversarial perturbations PDF

Cannot Refute

[30] BCRA: bidirectional cross-modal implicit relation reasoning and aligning for text-to-image person retrieval PDF

Cannot Refute

[31] Cross-modal film and television content generation and dynamic modeling of ethical risks based on diffusion model-federated learning collaborative optimization PDF

Cannot Refute

[32] GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents PDF

Cannot Refute

Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

Contribution Analysis

ReasonBreak adversarial framework for disrupting hierarchical reasoning

[8] Black-Box Adversarial Attack on Vision Language Models for Autonomous Driving PDF

[9] When alignment fails: Multimodal adversarial attacks on vision-language-action models PDF

[10] ADVEDM:Fine-grained Adversarial Attack against VLM-based Embodied Agents PDF

[11] Adashield: Safeguarding multimodal large language models from structure-based attack via adaptive shield prompting PDF

[12] VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models PDF

[13] Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning PDF

[14] Jailbreak vision language models via bi-modal adversarial prompt PDF

[15] From Coarse to Fine: A Training-Free Framework for Hierarchical Traceability of Adversarial Attacks in Remote Sensing Systems PDF

[16] Prism: Programmatic reasoning with image sequence manipulation for lvlm jailbreaking PDF

[17] Adversarial Attacks on Vision-Language Model-Empowered Chatbots in Consumer Electronics PDF

GeoPrivacy-6K dataset with hierarchical concept annotations

[18] Brain imaging generation with latent diffusion models PDF

[19] A short review on different clustering techniques and their applications PDF

[20] NAFlora-1M: continental-scale high-resolution fine-grained plant classification dataset PDF

[21] Object discovery in high-resolution remote sensing images: a semantic perspective PDF

[22] A Framework for Fashion Data Gathering, Hierarchical-Annotation and Analysis for Social Media and Online Shop: TOOLKIT FOR DETAILED STYLE ANNOTATIONS â¦ PDF

Empirical validation establishing new state-of-the-art in privacy protection

[23] Location-Privacy-Aware Service Migration Against Inference Attacks in Multiuser MEC Systems PDF

[24] Human activity recognition (har) using deep learning: Review, methodologies, progress and future research directions PDF

[25] Granular privacy control for geolocation with vision language models PDF

[26] Privacy Protection in Collaborative Geographically Distributed Co-Simulation of Multimodal Energy Systems PDF

[27] Image-based geolocation using large vision-language models PDF

[28] LocGuard: A location privacy defender for image sharing PDF

[29] Geoshield: Safeguarding geolocation privacy from vision-language models via adversarial perturbations PDF

[30] BCRA: bidirectional cross-modal implicit relation reasoning and aligning for text-to-image person retrieval PDF

[31] Cross-modal film and television content generation and dynamic modeling of ethical risks based on diffusion model-federated learning collaborative optimization PDF

[32] GEO-Detective: Unveiling Location Privacy Risks in Images with LLM Agents PDF

Table of Contents

[22] A Framework for Fashion Data Gathering, Hierarchical-Annotation and Analysis for Social Media and Online Shop: TOOLKIT FOR DETAILED STYLE ANNOTATIONS â¦ PDF