You Point, I Learn: Online Adaptation of Interactive Segmentation Models for Handling Distribution Shifts in Medical Imaging

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Medical Image SegmentationInteractiveOnline Adaptation

Interactive segmentation uses real-time user inputs, such as mouse clicks, to iteratively refine model predictions. Although not originally designed to address distribution shifts, this paradigm naturally lends itself to such challenges. In medical imaging, where distribution shifts are common, interactive methods can use user inputs to guide models towards improved predictions. Moreover, once a model is deployed, user corrections can be used to adapt the network parameters to the new data distribution, mitigating distribution shift. Based on these insights, we aim to develop a practical, effective method for improving the adaptive capabilities of interactive segmentation models to new data distributions in medical imaging. Firstly, we found that strengthening the model's responsiveness to clicks is important for the initial training process. Moreover, we show that by treating the post-interaction user-refined model output as pseudo-ground-truth, we can design a lean, practical online adaptation method that enables a model to learn effectively across sequential test images. The framework includes two components: (i) a Post-Interaction adaptation process, updating the model after the user has completed interactive refinement of an image, and (ii) a Mid-Interaction adaptation process, updating incrementally after each click. Both processes include a Click-Centered Gaussian loss that strengthens the model's reaction to clicks and enhances focus on user-guided, clinically relevant regions. Experiments on 5 fundus and 4 brain‑MRI databases show that our approach consistently outperforms existing methods under diverse distribution shifts, including unseen imaging modalities and pathologies. Code and pretrained models will be released upon publication.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes a framework for online adaptation of interactive segmentation models under distribution shifts in medical imaging, introducing a Click-Centered Gaussian loss to strengthen click responsiveness and a post-interaction adaptation method using user-refined outputs as pseudo ground-truth. It resides in the 'Direct Parameter Update Methods' leaf under 'Continual and Online Adaptation Frameworks', alongside two sibling papers that also update parameters directly from user corrections. This leaf represents a focused research direction within the broader taxonomy of thirty papers across multiple adaptation paradigms, indicating a moderately populated but not overcrowded niche addressing real-time parameter updates without explicit forgetting prevention mechanisms.

The taxonomy reveals neighboring research directions that contextualize this work. The sibling leaf 'Teacher-Student and Knowledge Retention Architectures' contains one paper employing distillation to prevent catastrophic forgetting, while 'Reinforcement-Based Interactive Learning' houses one work using reinforcement signals for noisy feedback. Adjacent branches include 'Test-Time Adaptation and Domain Generalization' with five papers exploring self-supervised objectives and foundation model refinement, and 'Domain Adaptation Methods' with six papers addressing feature alignment and active learning. The scope note for the paper's leaf explicitly excludes teacher-student frameworks and reinforcement approaches, positioning this work as a direct update strategy distinct from more complex retention architectures.

Among twenty-seven candidates examined, none clearly refute the three proposed contributions. The Click-Centered Gaussian loss examined nine candidates with zero refutations, the post-interaction adaptation method examined eight with zero refutations, and the mid-interaction process examined ten with zero refutations. This suggests that within the limited search scope, the specific combination of click-focused loss design and dual-stage adaptation appears underexplored. However, the sibling papers 'Learning from Corrections' and 'Continuous Online Adaptation' likely share conceptual overlap in using user corrections for parameter updates, though the contribution-level analysis did not identify direct refutations among the examined candidates.

Based on the limited literature search covering top-K semantic matches and citation expansion, the work appears to occupy a distinct position within direct parameter update methods. The absence of refutations across all contributions suggests novelty in the specific technical approach, though the small number of sibling papers and the focused scope of the search mean this assessment reflects only the examined subset of the field rather than an exhaustive comparison.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: online adaptation of interactive segmentation models under distribution shifts. The field addresses how segmentation systems can continuously improve when deployed in real-world environments where data distributions differ from training conditions. The taxonomy reveals several complementary research directions: Continual and Online Adaptation Frameworks focus on methods that update model parameters incrementally during deployment, often leveraging user corrections as supervision signals; Test-Time Adaptation and Domain Generalization explore techniques that adjust models at inference time without extensive retraining; Domain Adaptation Methods tackle cross-domain transfer through alignment strategies; Supervised Fine-Tuning Strategies investigate how labeled data can refine pre-trained models; Application-Specific Interactive Segmentation targets particular domains like medical imaging or aerial imagery; while Surveys and Conceptual Frameworks provide broader perspectives on the landscape. Works such as Interactive Segmentation Review[3] synthesize these diverse threads, and systems like nnInteractive[4] demonstrate practical implementations across multiple branches. Recent efforts reveal a tension between adaptation speed and stability under continuous distribution shifts. A small cluster of works emphasizes direct parameter updates from user feedback, including Learning from Corrections[2] and Continuous Online Adaptation[17], which refine models incrementally as annotators provide corrective clicks. You Point I Learn[0] sits squarely within this branch, proposing mechanisms to learn from interactive corrections in real time. Nearby approaches like Teacher Student Interactive[1] and Self-supervised Interactive[6] explore alternative supervision paradigms, balancing the need for rapid adaptation against the risk of catastrophic forgetting. Meanwhile, methods such as Reinforced Interactive Continual[12] and Continual Hippocampus[13] address longer-term continual learning scenarios with smoother shifts. The central challenge remains designing update rules that are both responsive to immediate user input and robust to the non-stationary data streams characteristic of deployment environments.

Claimed Contributions

Click-Centered Gaussian (CCG) loss for interactive segmentation

9 retrieved papers

A novel loss function that strengthens the model's responsiveness to user clicks by applying spatially-weighted penalties in regions surrounding each click. The loss uses a Gaussian kernel and is class-limited, applying only to pixels that should share the same class as the click.

9 retrieved papers

Post-Interaction online adaptation method using pseudo ground-truth

8 retrieved papers

A two-stage online adaptation approach that updates the model after user completes interactive refinement of an image. It treats the user-corrected final segmentation as pseudo ground-truth and includes fine-tuning with localization clicks and multiple correction clicks generated from erroneous regions.

8 retrieved papers

Mid-Interaction online adaptation process

10 retrieved papers

An online adaptation mechanism that updates model parameters incrementally after each individual user click during the interactive refinement process. It uses the model output before and after each click as pseudo ground-truth, combined with the CCG loss to focus learning on click-centered regions.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[2] Continuous adaptation for interactive object segmentation by learning from corrections PDF

Kontogianni, Theodora, Theodora Kontogianni, Gygli, Michael, Michael Gygli, Uijlings, Jasper, Jasper Uijlings, Ferrari, Vittorio, Vittorio Ferrari, J. Uijlings, V. Ferrari (2020)

[17] Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation PDF

XU Wentian, Liang Zi-Yun, Anthony Harry, Ibrahim Yasin, Yang Guang, Whitehouse, Daniel, Menon David, Newcombe, Virginia, Kamnitsas, Konstantinos (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Click-Centered Gaussian (CCG) loss for interactive segmentation

[31] Rethinking RoI Strategy in Interactive 3D Segmentation for Medical Images PDF

Cannot Refute

[32] Guiding the Guidance: A Comparative Analysis of User Guidance Signals for Interactive Segmentation of Volumetric Images PDF

Cannot Refute

[33] AeroClick: An advanced single-click interactive framework for aeroengine defect segmentation PDF

Cannot Refute

[34] Structured click control in transformer-based interactive segmentation PDF

Cannot Refute

[35] A dual-stream framework guided by adaptive gaussian maps for interactive image segmentation PDF

Cannot Refute

[36] Improving Click-based Interactive Image Segmentation by Click Simulation and Triangle Encoding PDF

Cannot Refute

[37] FIST: fast interactive segmentation of tumors PDF

Cannot Refute

[38] Improving Interactive Segmentation Techniques in Medical Imaging PDF

Cannot Refute

[39] Interactive segmentation using U-Net with weight map and dynamic user interactions PDF

Cannot Refute

Contribution

Post-Interaction online adaptation method using pseudo ground-truth

[5] Leveraging AI Predicted and Expert Revised Annotations in Interactive Segmentation: Continual Tuning or Full Training? PDF

Cannot Refute

[40] A dynamic interactive learning framework for automated 3D medical image segmentation PDF

Cannot Refute

[41] DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images PDF

Cannot Refute

[42] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion PDF

Cannot Refute

[43] SqueezeSAM: User friendly mobile interactive segmentation PDF

Cannot Refute

[44] Weakly-supervised semantic segmentation via online pseudo-mask correcting PDF

Cannot Refute

[45] Interactive Video Object Mask Annotation PDF

Cannot Refute

[46] Addressing Intermediate Verification Latency in Online Learning Through Immediate Pseudo-labeling and Oriented Synthetic Correction PDF

Cannot Refute

Contribution

Mid-Interaction online adaptation process

[1] Continuous adaptation for interactive segmentation using teacher-student architecture PDF

Cannot Refute

[3] Deep interactive segmentation of medical images: A systematic review and taxonomy PDF

Cannot Refute

[40] A dynamic interactive learning framework for automated 3D medical image segmentation PDF

Cannot Refute

[47] An interactive segmentation-based method for seismic facies annotation and segmentation PDF

Cannot Refute

[48] Click prompt learning with optimal transport for interactive segmentation PDF

Cannot Refute

[49] Iteratively trained interactive segmentation PDF

Cannot Refute

[50] Scale-aware test-time click adaptation for pulmonary nodule and mass segmentation PDF

Cannot Refute

[51] AdaptiveClick: Click-Aware Transformer With Adaptive Focal Loss for Interactive Image Segmentation PDF

Cannot Refute

[52] Sequential interactive image segmentation PDF

Cannot Refute

[53] AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation PDF

Cannot Refute

You Point, I Learn: Online Adaptation of Interactive Segmentation Models for Handling Distribution Shifts in Medical Imaging

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[2] Continuous adaptation for interactive object segmentation by learning from corrections PDF

[17] Continuous Online Adaptation Driven by User Interaction for Medical Image Segmentation PDF

Contribution Analysis

Click-Centered Gaussian (CCG) loss for interactive segmentation

[31] Rethinking RoI Strategy in Interactive 3D Segmentation for Medical Images PDF

[32] Guiding the Guidance: A Comparative Analysis of User Guidance Signals for Interactive Segmentation of Volumetric Images PDF

[33] AeroClick: An advanced single-click interactive framework for aeroengine defect segmentation PDF

[34] Structured click control in transformer-based interactive segmentation PDF

[35] A dual-stream framework guided by adaptive gaussian maps for interactive image segmentation PDF

[36] Improving Click-based Interactive Image Segmentation by Click Simulation and Triangle Encoding PDF

[37] FIST: fast interactive segmentation of tumors PDF

[38] Improving Interactive Segmentation Techniques in Medical Imaging PDF

[39] Interactive segmentation using U-Net with weight map and dynamic user interactions PDF

Post-Interaction online adaptation method using pseudo ground-truth

[5] Leveraging AI Predicted and Expert Revised Annotations in Interactive Segmentation: Continual Tuning or Full Training? PDF

[40] A dynamic interactive learning framework for automated 3D medical image segmentation PDF

[41] DeepEdit: Deep Editable Learning for Interactive Segmentation of 3D Medical Images PDF

[42] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion PDF

[43] SqueezeSAM: User friendly mobile interactive segmentation PDF

[44] Weakly-supervised semantic segmentation via online pseudo-mask correcting PDF

[45] Interactive Video Object Mask Annotation PDF

[46] Addressing Intermediate Verification Latency in Online Learning Through Immediate Pseudo-labeling and Oriented Synthetic Correction PDF

Mid-Interaction online adaptation process

[1] Continuous adaptation for interactive segmentation using teacher-student architecture PDF

[3] Deep interactive segmentation of medical images: A systematic review and taxonomy PDF

[40] A dynamic interactive learning framework for automated 3D medical image segmentation PDF

[47] An interactive segmentation-based method for seismic facies annotation and segmentation PDF

[48] Click prompt learning with optimal transport for interactive segmentation PDF

[49] Iteratively trained interactive segmentation PDF

[50] Scale-aware test-time click adaptation for pulmonary nodule and mass segmentation PDF

[51] AdaptiveClick: Click-Aware Transformer With Adaptive Focal Loss for Interactive Image Segmentation PDF

[52] Sequential interactive image segmentation PDF

[53] AdaptiveClick: Clicks-aware Transformer with Adaptive Focal Loss for Interactive Image Segmentation PDF

Table of Contents