DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

text-to-imagesemantic leakagecomputer visionautomatic evaluationmultimodal

Text-to-Image (T2I) models have advanced rapidly, yet they remain vulnerable to semantic leakage, the unintended transfer of semantically related features between distinct entities. Existing mitigation strategies are often optimization-based or dependent on external inputs. We introduce DeLeaker, a lightweight, optimization-free inference-time approach that mitigates leakage by directly intervening on the model’s attention maps. Throughout the diffusion process, DeLeaker dynamically reweights attention maps to suppress excessive cross-entity interactions while strengthening the identity of each entity. To support systematic evaluation, we introduce SLIM (Semantic Leakage in IMages), the first dataset dedicated to semantic leakage, comprising 1,130 human-verified samples spanning diverse scenarios, together with a novel automatic evaluation framework. Experiments demonstrate that DeLeaker consistently outperforms all baselines, even when they are provided with external information, achieving effective leakage mitigation without compromising fidelity or quality. These results underscore the value of attention control and pave the way for more semantically precise T2I models.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces DeLeaker, a lightweight inference-time method that mitigates semantic leakage by dynamically reweighting attention maps during diffusion. It resides in the 'Inference-Time Attention Reweighting' leaf, which contains five papers total, including the original work. This leaf sits within the broader 'Attention Mechanism Intervention' branch, one of nine major research directions in the taxonomy. The relatively small cluster suggests this specific approach—dynamic reweighting without optimization or external inputs—represents a focused but not overcrowded research direction within the larger semantic leakage mitigation landscape.

The taxonomy reveals neighboring leaves addressing related attention-based strategies: 'Attention Map Alignment and Control' enforces spatial constraints using layouts or energy objectives, while 'Text Self-Attention and Syntactic Guidance' leverages text encoder structure. These sibling categories share the attention intervention philosophy but differ in mechanism. Beyond attention methods, parallel branches explore embedding manipulation, concept erasure, and bias mitigation, indicating that the field pursues semantic control through diverse complementary pathways. DeLeaker's focus on cross-entity interaction suppression distinguishes it from spatial alignment methods and positions it as a refinement of dynamic attention control.

Among thirty candidates examined, the DeLeaker method (Contribution A) shows two refutable candidates from ten examined, suggesting some prior work addresses inference-time attention reweighting for semantic control. The SLIM dataset (Contribution B) and automated evaluation framework (Contribution C) each examined ten candidates with zero refutations, indicating these evaluation contributions appear more distinctive within the limited search scope. The statistics reflect a targeted literature search rather than exhaustive coverage, meaning the analysis captures top semantic matches and immediate citations but may not encompass all relevant prior work in this evolving subfield.

Based on the limited search scope of thirty candidates, the method contribution appears to build incrementally on existing attention reweighting strategies, while the evaluation contributions show stronger novelty signals. The taxonomy context reveals a moderately populated research direction with clear boundaries separating dynamic reweighting from spatial alignment and embedding-based approaches. The analysis provides a snapshot of the immediate research neighborhood but does not claim comprehensive coverage of all semantic leakage mitigation techniques.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Semantic leakage mitigation in text-to-image generation addresses the challenge of preventing unintended information transfer or attribute mixing when generating images from textual descriptions. The field's taxonomy reveals a diverse landscape organized around nine major branches. Attention Mechanism Intervention focuses on inference-time reweighting and dynamic modulation strategies to steer cross-attention maps, as exemplified by works like Attend and Excite[2] and Be Yourself[3]. Embedding and Latent Space Manipulation explores how to disentangle or constrain representations to prevent unwanted semantic bleed, while Concept Erasure and Safety Filtering (surveyed in Concept Erasure Survey[4]) targets the removal of harmful or copyrighted content. Bias Detection and Mitigation examines fairness issues in generated outputs, and Multi-Subject and Multi-Attribute Composition tackles the challenge of faithfully rendering multiple entities without attribute confusion. Style Preservation and Content Leakage Prevention aims to separate stylistic elements from content, Cross-Modal and Temporal Consistency ensures coherence across modalities and frames, Semantic Alignment and Data Augmentation refines training signals, and Specialized Applications address domain-specific leakage problems. Recent work highlights contrasting strategies for controlling semantic flow during generation. Attention-based methods like Temporal Adaptive Attention[11] and Attention Modulation[34] dynamically adjust cross-attention weights to prevent attribute leakage, while embedding-level approaches manipulate latent codes or token representations to enforce semantic boundaries. DeLeaker[0] operates within the Inference-Time Attention Reweighting cluster, sharing the attention intervention philosophy of Attend and Excite[2] and Be Yourself[3], yet it emphasizes mitigating leakage through targeted reweighting rather than broad excitation or identity preservation. This positions DeLeaker[0] as a refinement of attention control strategies, addressing scenarios where subtle semantic drift occurs despite standard guidance. Meanwhile, works like InstantStyle[1] and Only Style[18] tackle the related but distinct problem of style-content separation, illustrating how different branches converge on the shared goal of preventing unintended information mixing through complementary mechanisms.

Claimed Contributions

DeLeaker: Dynamic Inference-Time Reweighting Method

Can Refute

10 retrieved papers

DeLeaker is a novel inference-time method that mitigates semantic leakage in text-to-image models by dynamically reweighting attention maps. It suppresses cross-entity interactions while strengthening each entity's self-identity, without requiring external inputs or costly optimization.

10 retrieved papers

Can Refute

SLIM Dataset for Semantic Leakage Evaluation

10 retrieved papers

SLIM is the first dedicated dataset explicitly designed to evaluate semantic leakage in text-to-image models. It contains 1,130 human-verified samples organized into five subsets covering diverse leakage scenarios, including visually similar entities, spatial interactions, and multi-entity compositions.

10 retrieved papers

Automated Evaluation Framework for Semantic Leakage

10 retrieved papers

The authors introduce a comprehensive automated evaluation pipeline for assessing semantic leakage mitigation. The framework uses comparative evaluation that breaks down the assessment into discrete logical steps, including leakage detection, mitigation success ranking, and preservation of image quality, validated through extensive human study.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[2] Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models PDF

Hila Chefer, Yuval Alaluf, Yael Vinker, Lior Wolf, Daniel Cohen-Or, D. Cohen-Or (2023)

[3] Be yourself: Bounded attention for multi-subject text-to-image generation PDF

Omer Dahary, Or Patashnik, Kfir Aberman, Daniel Cohen-Or (2024)

[11] Temporal Adaptive Attention Map Guidance for Text-to-Image Diffusion Models PDF

Sunghoon Jung, Sung-Hoon Jung, Yong-Seok Heo, Yong Seok Heo (2025)

[34] Towards Better Text-to-Image Generation Alignment via Attention Modulation PDF

Yihang Wu, Xiao Cao, Kaixin Li, Zitan Chen, Haonan Wang, Lei Meng, Zhiyong Huang (2024) • arXiv.org

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

DeLeaker: Dynamic Inference-Time Reweighting Method

[3] Be yourself: Bounded attention for multi-subject text-to-image generation PDF

Can Refute

[34] Towards Better Text-to-Image Generation Alignment via Attention Modulation PDF

Can Refute

[2] Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models PDF

Cannot Refute

[9] Object-conditioned energy-based attention map alignment in text-to-image diffusion models PDF

Cannot Refute

[10] Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing PDF

Cannot Refute

[11] Temporal Adaptive Attention Map Guidance for Text-to-Image Diffusion Models PDF

Cannot Refute

[12] FateZero: Fusing Attentions for Zero-shot Text-based Video Editing PDF

Cannot Refute

[13] Compositional text-to-image synthesis with attention map control of diffusion models PDF

Cannot Refute

[14] Object-conditioned energy-based model for attention map alignment in text-to-image diffusion models PDF

Cannot Refute

[62] Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis PDF

Cannot Refute

Contribution

SLIM Dataset for Semantic Leakage Evaluation

[9] Object-conditioned energy-based attention map alignment in text-to-image diffusion models PDF

Cannot Refute

[14] Object-conditioned energy-based model for attention map alignment in text-to-image diffusion models PDF

Cannot Refute

[22] DALLE-2 is seeing double: Flaws in word-to-concept mapping in Text2Image models PDF

Cannot Refute

[41] COUNTLOOP: Iterative Agent Guided High Instance Image Generation PDF

Cannot Refute

[44] Addressing Text Embedding Leakage in Diffusion-based Image Editing PDF

Cannot Refute

[49] FreeText: Training-Free Text Rendering in Diffusion Transformers via Attention Localization and Spectral Glyph Injection PDF

Cannot Refute

[58] WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation PDF

Cannot Refute

[59] Evaluating Attribute Confusion in Fashion Text-to-Image Generation PDF

Cannot Refute

[60] Contrastive Parallel Denoising for Improving Attribute Alignment of Diffusion models PDF

Cannot Refute

[61] MALeR: Improving Compositional Fidelity in Layout-Guided Generation PDF

Cannot Refute

Contribution

Automated Evaluation Framework for Semantic Leakage

[8] Identifying and solving conditional image leakage in image-to-video diffusion model PDF

Cannot Refute

[10] Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing PDF

Cannot Refute

[18] Only-Style: Stylistic Consistency in Image Generation without Content Leakage PDF

Cannot Refute

[51] Token merging for training-free semantic binding in text-to-image synthesis PDF

Cannot Refute

[52] GANobfuscator: Mitigating information leakage under GAN via differential privacy PDF

Cannot Refute

[53] Discovering Universal Semantic Triggers for Text-to-Image Synthesis PDF

Cannot Refute

[54] Breaking Semantic Artifacts for Generalized AI-generated Image Detection PDF

Cannot Refute

[55] Monte Carlo and Reconstruction Membership Inference Attacks against Generative Models PDF

Cannot Refute

[56] Score-based Membership Inference on Diffusion Models PDF

Cannot Refute

[57] Privacy threats in stable diffusion models PDF

Cannot Refute

DeLeaker: Dynamic Inference-Time Reweighting For Semantic Leakage Mitigation in Text-to-Image Models

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[2] Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models PDF

[3] Be yourself: Bounded attention for multi-subject text-to-image generation PDF

[11] Temporal Adaptive Attention Map Guidance for Text-to-Image Diffusion Models PDF

[34] Towards Better Text-to-Image Generation Alignment via Attention Modulation PDF

Contribution Analysis

DeLeaker: Dynamic Inference-Time Reweighting Method

[3] Be yourself: Bounded attention for multi-subject text-to-image generation PDF

[34] Towards Better Text-to-Image Generation Alignment via Attention Modulation PDF

[2] Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models PDF

[9] Object-conditioned energy-based attention map alignment in text-to-image diffusion models PDF

[10] Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing PDF

[11] Temporal Adaptive Attention Map Guidance for Text-to-Image Diffusion Models PDF

[12] FateZero: Fusing Attentions for Zero-shot Text-based Video Editing PDF

[13] Compositional text-to-image synthesis with attention map control of diffusion models PDF

[14] Object-conditioned energy-based model for attention map alignment in text-to-image diffusion models PDF

[62] Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis PDF

SLIM Dataset for Semantic Leakage Evaluation

[9] Object-conditioned energy-based attention map alignment in text-to-image diffusion models PDF

[14] Object-conditioned energy-based model for attention map alignment in text-to-image diffusion models PDF

[22] DALLE-2 is seeing double: Flaws in word-to-concept mapping in Text2Image models PDF

[41] COUNTLOOP: Iterative Agent Guided High Instance Image Generation PDF

[44] Addressing Text Embedding Leakage in Diffusion-based Image Editing PDF

[49] FreeText: Training-Free Text Rendering in Diffusion Transformers via Attention Localization and Spectral Glyph Injection PDF

[58] WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation PDF

[59] Evaluating Attribute Confusion in Fashion Text-to-Image Generation PDF

[60] Contrastive Parallel Denoising for Improving Attribute Alignment of Diffusion models PDF

[61] MALeR: Improving Compositional Fidelity in Layout-Guided Generation PDF

Automated Evaluation Framework for Semantic Leakage

[8] Identifying and solving conditional image leakage in image-to-video diffusion model PDF

[10] Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing PDF

[18] Only-Style: Stylistic Consistency in Image Generation without Content Leakage PDF

[51] Token merging for training-free semantic binding in text-to-image synthesis PDF

[52] GANobfuscator: Mitigating information leakage under GAN via differential privacy PDF

[53] Discovering Universal Semantic Triggers for Text-to-Image Synthesis PDF

[54] Breaking Semantic Artifacts for Generalized AI-generated Image Detection PDF

[55] Monte Carlo and Reconstruction Membership Inference Attacks against Generative Models PDF

[56] Score-based Membership Inference on Diffusion Models PDF

[57] Privacy threats in stable diffusion models PDF

Table of Contents