Learning Heterogeneous Degradation Representation for Real-World Super-Resolution

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 5.6 Download Report PDF

Real-World Super-ResolutionRepresentation Learning.

Real-World Super-Resolution (RWSR) aims to reconstruct high-resolution images from low-resolution inputs captured under complex, real-life conditions, where diverse distortions result in significant degradation heterogeneity. Many methods rely on degradation representations, yet they struggle with the lack of spatially variant degradation modeling and degradation-content entanglement. We propose Spatially Amortized Variational Learning (SAVL), an implicit framework that models per-pixel degradations as spatially varying Gaussians inferred from local neighborhoods. SAVL couples a conditional likelihood lane (SAVL-LM) with a mutual information suppression lane (SAVL-MIS) to filter out degradation-irrelevant signals, yielding a well-constrained solution space. Both our qualitative visualizations and quantitative analyses confirm that the learned representations effectively capture the spatial distribution of complex degradations while being highly discriminative of diverse underlying degradation factors. Building on these representations, we design a degradation-aware SR network with channel-wise guidance and spatial attention modulation for adaptive reconstruction under heterogeneous degradations. Extensive experiments on real-world datasets demonstrate consistent gains over prior methods.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes Spatially Amortized Variational Learning (SAVL) for real-world super-resolution, modeling per-pixel degradations as spatially varying Gaussians inferred from local neighborhoods. According to the taxonomy, this work resides in the 'Spatially Variant and Heterogeneous Degradation Modeling' leaf under 'Degradation Modeling and Representation Learning'. Notably, this leaf contains only the original paper itself with no sibling papers, indicating a relatively sparse research direction within the broader field of 50 surveyed papers across 36 topics.

The taxonomy reveals that neighboring leaves address related but distinct approaches: 'Implicit Degradation Representation Learning' contains three subcategories (contrastive, generative, and diffusion-based methods totaling nine papers), while 'Explicit Degradation Modeling via Synthetic Pipelines' and 'Degradation Estimation and Kernel Modeling' focus on explicit parameter estimation rather than spatially variant implicit representations. The scope note for the original paper's leaf explicitly excludes 'uniform or global degradation modeling approaches', positioning this work as addressing finer-grained spatial heterogeneity compared to methods assuming homogeneous degradations across image regions.

Among 29 candidates examined across three contributions, no clearly refuting prior work was identified. The SAVL framework examined nine candidates with zero refutable matches, the mutual information suppression mechanism examined ten candidates with zero refutable matches, and the degradation-aware SR network examined ten candidates with zero refutable matches. This suggests that within the limited search scope of top-K semantic matches and citation expansion, the combination of spatially amortized variational inference, mutual information suppression for degradation-content decoupling, and dual guidance mechanisms appears relatively unexplored in the examined literature.

Based on the limited search of 29 candidates, the work appears to occupy a distinct position addressing spatially variant degradation modeling through variational inference. The absence of sibling papers in its taxonomy leaf and the lack of refuting candidates suggest novelty within the examined scope, though this assessment is constrained by the search methodology and does not constitute an exhaustive literature review. The approach's emphasis on per-pixel degradation modeling differentiates it from neighboring methods focused on global or uniform degradation representations.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: Real-world image super-resolution with heterogeneous degradation modeling. The field addresses the challenge of restoring high-resolution images from low-quality inputs that exhibit diverse and spatially varying degradations, unlike the simplified uniform blur or noise assumptions of classical methods. The taxonomy reveals a rich structure organized around several complementary themes. Degradation Modeling and Representation Learning focuses on capturing the complex, often spatially variant degradation processes that occur in real-world scenarios, with works like Heterogeneous Degradation Representation[0] and Mixed Probabilistic Degradation[2] exploring how to represent and learn these heterogeneous patterns. Super-Resolution Network Architectures and Adaptation Mechanisms emphasizes designing flexible models that can adapt to varying degradation types, while Domain-Specific Real-World Super-Resolution targets particular application areas such as license plates, remote sensing, and faces. Parallel branches address Video Super-Resolution with Real-World Degradation[5], Frequency Domain and Iterative Degradation Modeling[3], GAN-Based Degradation Learning and Restoration[6][34], and Multi-Task frameworks that jointly handle multiple degradation orders or restoration objectives. A particularly active line of work centers on learning expressive degradation representations that can guide adaptive restoration. Some approaches leverage probabilistic or contrastive frameworks to disentangle degradation from content, while others incorporate semantic priors from vision-language models[11] or employ test-time adaptation strategies[16]. Heterogeneous Degradation Representation[0] sits within the spatially variant modeling cluster, emphasizing the need to handle non-uniform degradations that vary across image regions—a contrast to earlier works like Real-World Benchmark Model[4] or Practical Degradation Model[25], which often assume more homogeneous or globally parameterized degradation processes. Compared to neighboring efforts such as Elaborate Degradation Modeling[9] or Degradation-Adaptive Network[45], the original paper's focus on heterogeneity suggests a finer-grained treatment of local degradation variations, addressing scenarios where different image patches experience distinct degradation characteristics.

Claimed Contributions

Spatially Amortized Variational Learning (SAVL) framework

9 retrieved papers

The authors introduce SAVL, a framework that learns spatially varying Gaussian distributions for per-pixel degradations. It uses amortized inference networks to predict local posteriors from image neighborhoods, avoiding per-pixel optimization while capturing spatial heterogeneity in real-world image degradations.

9 retrieved papers

Mutual information suppression mechanism for degradation-content decoupling

10 retrieved papers

The method incorporates a mutual information suppression strategy that explicitly constrains the dependence between degradation representations and image content. This dual-lane approach ensures the learned representations capture degradation-specific factors while filtering out content-related signals.

10 retrieved papers

Degradation-aware SR network with dual guidance mechanism

10 retrieved papers

The authors propose a super-resolution network that leverages the learned degradation representation through a dual-modulation strategy: the posterior mode provides channel-wise guidance while the variance enables spatial feature modulation, enabling adaptive reconstruction under diverse degradation conditions.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Within the taxonomy built over the current TopK core-task papers, the original paper is assigned to a leaf with no direct siblings and no cousin branches under the same grandparent topic. In this retrieved landscape, it appears structurally isolated, which is one partial signal of novelty, but still constrained by search coverage and taxonomy granularity.

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Spatially Amortized Variational Learning (SAVL) framework

[71] Deep variational network toward blind image restoration PDF

Cannot Refute

[72] Neural Posterior Estimation for Cataloging Astronomical Images with Spatially Varying Backgrounds and Point Spread Functions PDF

Cannot Refute

[73] Semi-Unbalanced Optimal Transport for Reference-Based Image Restoration and Synthesis PDF

Cannot Refute

[74] Variational Deep Atmospheric Turbulence Correction for Video PDF

Cannot Refute

[75] Patch-based learning of space-variant hyperparameters in variational image restoration. PDF

Cannot Refute

[76] Fast Bayesian Estimation Using Location-Type Variational Representations PDF

Cannot Refute

[77] Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration PDF

Cannot Refute

[78] Video Variational Deep Atmospheric Turbulence Correction PDF

Cannot Refute

[79] Variational bayesian image restoration with a product of spatially weighted total variation image priors. PDF

Cannot Refute

Contribution

Mutual information suppression mechanism for degradation-content decoupling

[51] Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization PDF

Cannot Refute

[52] Avoiding shortcut-learning by mutual information minimization in deep learning-based image processing PDF

Cannot Refute

[53] DRFormer: Learning Disentangled Representation for Pan-Sharpening via Mutual Information- Based Transformer PDF

Cannot Refute

[54] Learning disentangled representations via mutual information estimation PDF

Cannot Refute

[55] CDG: Conditional Domain Generalization for Hyperspectral Imagery Classification with Convergence and Constrained-risk Theories PDF

Cannot Refute

[56] An interpretable image denoising framework via dual disentangled representation learning PDF

Cannot Refute

[57] Mutual information regularized feature-level frankenstein for discriminative recognition PDF

Cannot Refute

[58] Learning degradation-invariant representation for robust real-world person re-identification PDF

Cannot Refute

[59] Style-Based Attentive Network for Real-World Face Hallucination PDF

Cannot Refute

[60] MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation PDF

Cannot Refute

Contribution

Degradation-aware SR network with dual guidance mechanism

[61] Image super-resolution reconstruction based on feature map attention mechanism PDF

Cannot Refute

[62] Single image super-resolution via a holistic attention network PDF

Cannot Refute

[63] Activating More Pixels in Image Super-Resolution Transformer PDF

Cannot Refute

[64] Degradation-Aware Feature Perturbation for All-in-One Image Restoration PDF

Cannot Refute

[65] A study on the super resolution combining spatial attention and channel attention PDF

Cannot Refute

[66] Dual Aggregation Transformer for Image Super-Resolution PDF

Cannot Refute

[67] Enhanced Autoencoders With Attention-Embedded Degradation Learning for Unsupervised Hyperspectral Image Super-Resolution PDF

Cannot Refute

[68] Super-resolution reconstruction of medical images based on deep residual attention network PDF

Cannot Refute

[69] Squeeze & Excitation joint with Combined Channel and Spatial Attention for pathology image super-resolution PDF

Cannot Refute

[70] A Method for Image Super-Resolution Reconstruction with Attention Mechanism in Generative Adversarial Networks PDF

Cannot Refute

Learning Heterogeneous Degradation Representation for Real-World Super-Resolution

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

Contribution Analysis

Spatially Amortized Variational Learning (SAVL) framework

[71] Deep variational network toward blind image restoration PDF

[72] Neural Posterior Estimation for Cataloging Astronomical Images with Spatially Varying Backgrounds and Point Spread Functions PDF

[73] Semi-Unbalanced Optimal Transport for Reference-Based Image Restoration and Synthesis PDF

[74] Variational Deep Atmospheric Turbulence Correction for Video PDF

[75] Patch-based learning of space-variant hyperparameters in variational image restoration. PDF

[76] Fast Bayesian Estimation Using Location-Type Variational Representations PDF

[77] Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration PDF

[78] Video Variational Deep Atmospheric Turbulence Correction PDF

[79] Variational bayesian image restoration with a product of spatially weighted total variation image priors. PDF

Mutual information suppression mechanism for degradation-content decoupling

[51] Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization PDF

[52] Avoiding shortcut-learning by mutual information minimization in deep learning-based image processing PDF

[53] DRFormer: Learning Disentangled Representation for Pan-Sharpening via Mutual Information- Based Transformer PDF

[54] Learning disentangled representations via mutual information estimation PDF

[55] CDG: Conditional Domain Generalization for Hyperspectral Imagery Classification with Convergence and Constrained-risk Theories PDF

[56] An interpretable image denoising framework via dual disentangled representation learning PDF

[57] Mutual information regularized feature-level frankenstein for discriminative recognition PDF

[58] Learning degradation-invariant representation for robust real-world person re-identification PDF

[59] Style-Based Attentive Network for Real-World Face Hallucination PDF

[60] MIGA: Mutual Information-Guided Attack on Denoising Models for Semantic Manipulation PDF

Degradation-aware SR network with dual guidance mechanism

[61] Image super-resolution reconstruction based on feature map attention mechanism PDF

[62] Single image super-resolution via a holistic attention network PDF

[63] Activating More Pixels in Image Super-Resolution Transformer PDF

[64] Degradation-Aware Feature Perturbation for All-in-One Image Restoration PDF

[65] A study on the super resolution combining spatial attention and channel attention PDF

[66] Dual Aggregation Transformer for Image Super-Resolution PDF

[67] Enhanced Autoencoders With Attention-Embedded Degradation Learning for Unsupervised Hyperspectral Image Super-Resolution PDF

[68] Super-resolution reconstruction of medical images based on deep residual attention network PDF

[69] Squeeze & Excitation joint with Combined Channel and Spatial Attention for pathology image super-resolution PDF

[70] A Method for Image Super-Resolution Reconstruction with Attention Mechanism in Generative Adversarial Networks PDF

Table of Contents