COSA: Context-aware Output-Space Adapter for Test-Time Adaptation in Time Series Forecasting

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Test-Time AdaptationTime-Series ForecastingSimple Adapter

Deployed time-series forecasters suffer performance degradation under non-stationarity and distribution shifts. Test-time adaptation (TTA) for time-series forecasting differs from vision TTA because ground truth becomes observable shortly after prediction. Existing time-series TTA methods typically employ dual input/output adapters that indirectly modify data distributions, making their effect on the frozen model difficult to analyze. We introduce the Context-aware Output-Space Adapter (COSA), a minimal, plug-and-play adapter that directly corrects predictions of a frozen base model. COSA performs residual correction modulated by gating, utilizing the original prediction and a lightweight context vector that summarizes statistics from recently observed ground truth. At test time, only the adapter parameters (linear layer and gating) are updated under a leakage-free protocol, using observed ground truth with an adaptive learning rate schedule for faster adaptation. Across diverse scenarios, COSA demonstrates substantial performance gains versus baselines without TTA (13.91 $\sim$ 17.03%) and SOTA TTA methods (10.48 $\sim$ 13.05%), with particularly large improvements at long horizons, while adding a reasonable level of parameters and negligible computational overhead. The simplicity of COSA makes it architecture-agnostic and deployment-friendly. Source code: https://anonymous.4open.science/r/linear-adapter-A720

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces COSA, a context-aware output-space adapter that directly corrects predictions of a frozen forecasting model using residual adjustments and gating. It resides in the Output-Space Correction Approaches leaf, which contains only two papers including COSA itself. This leaf sits within the broader Test-Time Adaptation Mechanisms branch, indicating a relatively sparse research direction focused on lightweight, prediction-level corrections rather than full model updates. The small sibling count suggests this specific approach—direct output correction with minimal parameter overhead—remains underexplored compared to heavier adaptation strategies.

The taxonomy reveals neighboring leaves such as Input-Space and Full-Model Adaptation, which houses six papers employing dual adapters or full parameter updates, and Foundation Model Adaptation, containing four papers on parameter-efficient tuning of pre-trained models. COSA diverges from these by avoiding input transformations or extensive parameter modifications, instead operating solely in output space. The Continuous Online Adaptation branch, with ten papers on incremental learning and ensembling, represents a related but distinct paradigm emphasizing streaming updates without the frozen-model constraint. COSA's design philosophy aligns more closely with efficiency-focused methods than with detection-driven approaches in the Concept Drift Detection branch.

Among twenty-nine candidates examined across three contributions, none were flagged as clearly refuting COSA's novelty. The core adapter mechanism examined nine candidates with zero refutations, the gating-based residual correction examined ten with none refutable, and the adaptive learning schedule also examined ten with no overlaps found. This suggests that within the limited search scope—top-K semantic matches plus citation expansion—no prior work directly anticipates COSA's combination of output-space correction, context-aware gating, and adaptive learning. However, the search scale is modest, and the sparse Output-Space Correction leaf indicates fewer benchmarks exist for comparison.

Based on the limited literature search, COSA appears to occupy a relatively novel position within test-time adaptation for time series forecasting. The sparse sibling count and absence of refutable candidates among twenty-nine examined papers suggest the specific design—frozen base model with lightweight output correction—has not been extensively explored. Nonetheless, the analysis covers a focused subset of the field, and broader surveys or domain-specific venues may reveal additional related work not captured here.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: test-time adaptation in time series forecasting. The field addresses how forecasting models can adjust to evolving data distributions without retraining from scratch. The taxonomy reveals several major branches: Test-Time Adaptation Mechanisms explore how models update at inference (e.g., output-space corrections, parameter-efficient tuning as in Parameter Efficient TTA[22]), while Concept Drift Detection and Handling focuses on identifying and responding to distributional shifts (e.g., Detect then Adapt[11], Battling Nonstationarity[3]). Continuous Online Adaptation emphasizes streaming scenarios where models learn incrementally (Lightweight Online Adaption[10], Active Adaptation Streaming[21]), and Domain Adaptation and Transfer Learning tackle cross-domain generalization (AdaRNN[26], MemDA[47]). Specialized Adaptation Contexts address domain-specific challenges like traffic (Traffic Flow TTA[6]) or climate (Climate Resilience Forecasting[5]), while Uncertainty Quantification and Robustness ensure reliable predictions under shift (Adaptive Conformal[18]). Architectural Innovations (Informer[7], Time LlaMA[17]) and Methodological Foundations provide the technical substrate, with Application-Specific Forecasting targeting real-world deployments. A central tension emerges between lightweight, output-space corrections and deeper model updates. COSA[0] exemplifies the former, residing in Output-Space Correction Approaches alongside Parameter Efficient TTA[22], both prioritizing minimal computational overhead by adjusting predictions or a small parameter subset rather than full model retraining. This contrasts with heavier adaptation strategies like Proactive Model Adaptation[4] or Foundational LoRA[13], which modify internal representations more extensively. COSA[0] shares the efficiency ethos of Parameter Efficient TTA[22] but focuses on correcting outputs directly, making it particularly suitable for resource-constrained or latency-sensitive settings. Meanwhile, works like Battling Nonstationarity[3] and Shift Aware TTA[48] emphasize detecting when adaptation is needed, complementing correction methods by triggering updates selectively. The interplay between detection, correction, and continuous learning remains an active area, with COSA[0] contributing a streamlined correction pathway within this broader landscape.

Claimed Contributions

Context-aware Output-Space Adapter (COSA)

9 retrieved papers

COSA is a single output-space adapter that directly corrects predictions from a frozen base forecaster using residual correction modulated by gating. It utilizes the original prediction and a lightweight context vector summarizing statistics from recently observed ground truth, avoiding the dual input-output adapter design of prior methods.

9 retrieved papers

Context-aware linear residual with gating mechanism

10 retrieved papers

The adapter performs linear residual correction by concatenating base predictions with a context vector (summarizing recent ground truth statistics) and applying a learnable gating mechanism to modulate the correction strength, enabling adaptive output adjustment.

10 retrieved papers

Adaptive learning rate schedule for fast adaptation

10 retrieved papers

COSA employs a cosine-adaptive learning rate (CALR) schedule that adjusts the learning rate online based on short-horizon loss trends, enabling faster convergence within limited adaptation steps while maintaining stability through early stopping and gradient clipping.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[22] Accurate Parameter-Efficient Test-Time Adaptation for Time Series Forecasting PDF

Medeiros, Heitor R., Sharifi-Noghabi, Hossein, Oliveira, Gabriel L., Irandoust, Saghar (2025)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Context-aware Output-Space Adapter (COSA)

[29] Calibration of time-series forecasting: Detecting and adapting context-driven distribution shift PDF

Cannot Refute

[71] Personalized adapter for large meteorology model on devices: Towards weather foundation models PDF

Cannot Refute

[72] Neutral residues: revisiting adapters for model extension PDF

Cannot Refute

[73] Improving zero-shot generalization for clip with variational adapter PDF

Cannot Refute

[74] Feature fusion and enhancement for lightweight visible-thermal infrared tracking via multiple adapters PDF

Cannot Refute

[75] Residual Adapters for Targeted Updates in RNN-Transducer Based Speech Recognition System PDF

Cannot Refute

[76] GRASP: Guided Residual Adapters with Sample-wise Partitioning PDF

Cannot Refute

[77] A Provable Quantile Regression Adapter via Transfer Learning PDF

Cannot Refute

[78] Adaptive Forecasting of EV Aggregator Loads and Price Elasticities: A KAN-enhanced Foundation Model Adapter PDF

Cannot Refute

Contribution

Context-aware linear residual with gating mechanism

[51] Gated Linear Attention Transformers with Hardware-Efficient Training PDF

Cannot Refute

[52] ReGLA: Refining Gated Linear Attention PDF

Cannot Refute

[53] A gate-aware GRU model with trend-residual decomposition and quantile regression for remaining useful life prediction of IGBT PDF

Cannot Refute

[54] Residual Gated Graph ConvNets PDF

Cannot Refute

[55] HGRN2: Gated Linear RNNs with State Expansion PDF

Cannot Refute

[56] Automatic building extraction from high-resolution aerial images and LiDAR data using gated residual refinement network PDF

Cannot Refute

[57] Improving multi-step dissolved oxygen prediction in aquaculture using adaptive temporal convolution and optimized transformer PDF

Cannot Refute

[58] Realizing linear synaptic plasticity in electric double layer-gated transistors for improved predictive accuracy and efficiency in neuromorphic computing PDF

Cannot Refute

[59] A hybrid squeeze excitation gate recurrent unit-autoregressive integrated moving average model for long-term state of health estimation of lithium-ion batteries with â¦ PDF

Cannot Refute

[60] Holistic Transmission Performance Prediction of Balise System With Gate-Steered Residual Interweave Networks PDF

Cannot Refute

Contribution

Adaptive learning rate schedule for fast adaptation

[61] PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time Adaptation PDF

Cannot Refute

[62] DLTTA: Dynamic Learning Rate for Test-Time Adaptation on Cross-Domain Medical Images PDF

Cannot Refute

[63] CADE: Cosine Annealing Differential Evolution for Spiking Neural Network PDF

Cannot Refute

[64] Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression PDF

Cannot Refute

[65] Integrating EfficientNet, Cosine Annealing, and Advanced Data Augmentation for Enhanced Aircraft Detection in Satellite Imagery PDF

Cannot Refute

[66] Stamp: Outlier-aware test-time adaptation with stable memory replay PDF

Cannot Refute

[67] Sgem: Test-time adaptation for automatic speech recognition via sequential-level generalized entropy minimization PDF

Cannot Refute

[68] Bootstrap generalization ability from loss landscape perspective PDF

Cannot Refute

[69] Enhancing Gastrointestinal Disease Diagnosis Using Fine-Tuned MobileNetV2 PDF

Cannot Refute

[70] Adaptive Weighted Fusion of EfficientNetV2 for Acute Lymphoblastic Leukemia Detection PDF

Cannot Refute

COSA: Context-aware Output-Space Adapter for Test-Time Adaptation in Time Series Forecasting

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[22] Accurate Parameter-Efficient Test-Time Adaptation for Time Series Forecasting PDF

Contribution Analysis

Context-aware Output-Space Adapter (COSA)

[29] Calibration of time-series forecasting: Detecting and adapting context-driven distribution shift PDF

[71] Personalized adapter for large meteorology model on devices: Towards weather foundation models PDF

[72] Neutral residues: revisiting adapters for model extension PDF

[73] Improving zero-shot generalization for clip with variational adapter PDF

[74] Feature fusion and enhancement for lightweight visible-thermal infrared tracking via multiple adapters PDF

[75] Residual Adapters for Targeted Updates in RNN-Transducer Based Speech Recognition System PDF

[76] GRASP: Guided Residual Adapters with Sample-wise Partitioning PDF

[77] A Provable Quantile Regression Adapter via Transfer Learning PDF

[78] Adaptive Forecasting of EV Aggregator Loads and Price Elasticities: A KAN-enhanced Foundation Model Adapter PDF

Context-aware linear residual with gating mechanism

[51] Gated Linear Attention Transformers with Hardware-Efficient Training PDF

[52] ReGLA: Refining Gated Linear Attention PDF

[53] A gate-aware GRU model with trend-residual decomposition and quantile regression for remaining useful life prediction of IGBT PDF

[54] Residual Gated Graph ConvNets PDF

[55] HGRN2: Gated Linear RNNs with State Expansion PDF

[56] Automatic building extraction from high-resolution aerial images and LiDAR data using gated residual refinement network PDF

[57] Improving multi-step dissolved oxygen prediction in aquaculture using adaptive temporal convolution and optimized transformer PDF

[58] Realizing linear synaptic plasticity in electric double layer-gated transistors for improved predictive accuracy and efficiency in neuromorphic computing PDF

[59] A hybrid squeeze excitation gate recurrent unit-autoregressive integrated moving average model for long-term state of health estimation of lithium-ion batteries with â¦ PDF

[60] Holistic Transmission Performance Prediction of Balise System With Gate-Steered Residual Interweave Networks PDF

Adaptive learning rate schedule for fast adaptation

[61] PALM: Pushing Adaptive Learning Rate Mechanisms for Continual Test-Time Adaptation PDF

[62] DLTTA: Dynamic Learning Rate for Test-Time Adaptation on Cross-Domain Medical Images PDF

[63] CADE: Cosine Annealing Differential Evolution for Spiking Neural Network PDF

[64] Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression PDF

[65] Integrating EfficientNet, Cosine Annealing, and Advanced Data Augmentation for Enhanced Aircraft Detection in Satellite Imagery PDF

[66] Stamp: Outlier-aware test-time adaptation with stable memory replay PDF

[67] Sgem: Test-time adaptation for automatic speech recognition via sequential-level generalized entropy minimization PDF

[68] Bootstrap generalization ability from loss landscape perspective PDF

[69] Enhancing Gastrointestinal Disease Diagnosis Using Fine-Tuned MobileNetV2 PDF

[70] Adaptive Weighted Fusion of EfficientNetV2 for Acute Lymphoblastic Leukemia Detection PDF

Table of Contents

[59] A hybrid squeeze excitation gate recurrent unit-autoregressive integrated moving average model for long-term state of health estimation of lithium-ion batteries with â¦ PDF