COSA: Context-aware Output-Space Adapter for Test-Time Adaptation in Time Series Forecasting

ICLR 2026 Conference SubmissionAnonymous Authors
Test-Time AdaptationTime-Series ForecastingSimple Adapter
Abstract:

Deployed time-series forecasters suffer performance degradation under non-stationarity and distribution shifts. Test-time adaptation (TTA) for time-series forecasting differs from vision TTA because ground truth becomes observable shortly after prediction. Existing time-series TTA methods typically employ dual input/output adapters that indirectly modify data distributions, making their effect on the frozen model difficult to analyze. We introduce the Context-aware Output-Space Adapter (COSA), a minimal, plug-and-play adapter that directly corrects predictions of a frozen base model. COSA performs residual correction modulated by gating, utilizing the original prediction and a lightweight context vector that summarizes statistics from recently observed ground truth. At test time, only the adapter parameters (linear layer and gating) are updated under a leakage-free protocol, using observed ground truth with an adaptive learning rate schedule for faster adaptation. Across diverse scenarios, COSA demonstrates substantial performance gains versus baselines without TTA (13.91\sim17.03%) and SOTA TTA methods (10.48\sim13.05%), with particularly large improvements at long horizons, while adding a reasonable level of parameters and negligible computational overhead. The simplicity of COSA makes it architecture-agnostic and deployment-friendly. Source code: https://anonymous.4open.science/r/linear-adapter-A720

Disclaimer
This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.
NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.
If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper introduces COSA, a context-aware output-space adapter that directly corrects predictions of a frozen forecasting model using residual adjustments and gating. It resides in the Output-Space Correction Approaches leaf, which contains only two papers including COSA itself. This leaf sits within the broader Test-Time Adaptation Mechanisms branch, indicating a relatively sparse research direction focused on lightweight, prediction-level corrections rather than full model updates. The small sibling count suggests this specific approach—direct output correction with minimal parameter overhead—remains underexplored compared to heavier adaptation strategies.

The taxonomy reveals neighboring leaves such as Input-Space and Full-Model Adaptation, which houses six papers employing dual adapters or full parameter updates, and Foundation Model Adaptation, containing four papers on parameter-efficient tuning of pre-trained models. COSA diverges from these by avoiding input transformations or extensive parameter modifications, instead operating solely in output space. The Continuous Online Adaptation branch, with ten papers on incremental learning and ensembling, represents a related but distinct paradigm emphasizing streaming updates without the frozen-model constraint. COSA's design philosophy aligns more closely with efficiency-focused methods than with detection-driven approaches in the Concept Drift Detection branch.

Among twenty-nine candidates examined across three contributions, none were flagged as clearly refuting COSA's novelty. The core adapter mechanism examined nine candidates with zero refutations, the gating-based residual correction examined ten with none refutable, and the adaptive learning schedule also examined ten with no overlaps found. This suggests that within the limited search scope—top-K semantic matches plus citation expansion—no prior work directly anticipates COSA's combination of output-space correction, context-aware gating, and adaptive learning. However, the search scale is modest, and the sparse Output-Space Correction leaf indicates fewer benchmarks exist for comparison.

Based on the limited literature search, COSA appears to occupy a relatively novel position within test-time adaptation for time series forecasting. The sparse sibling count and absence of refutable candidates among twenty-nine examined papers suggest the specific design—frozen base model with lightweight output correction—has not been extensively explored. Nonetheless, the analysis covers a focused subset of the field, and broader surveys or domain-specific venues may reveal additional related work not captured here.

Taxonomy

Core-task Taxonomy Papers
50
3
Claimed Contributions
29
Contribution Candidate Papers Compared
0
Refutable Paper

Research Landscape Overview

Core task: test-time adaptation in time series forecasting. The field addresses how forecasting models can adjust to evolving data distributions without retraining from scratch. The taxonomy reveals several major branches: Test-Time Adaptation Mechanisms explore how models update at inference (e.g., output-space corrections, parameter-efficient tuning as in Parameter Efficient TTA[22]), while Concept Drift Detection and Handling focuses on identifying and responding to distributional shifts (e.g., Detect then Adapt[11], Battling Nonstationarity[3]). Continuous Online Adaptation emphasizes streaming scenarios where models learn incrementally (Lightweight Online Adaption[10], Active Adaptation Streaming[21]), and Domain Adaptation and Transfer Learning tackle cross-domain generalization (AdaRNN[26], MemDA[47]). Specialized Adaptation Contexts address domain-specific challenges like traffic (Traffic Flow TTA[6]) or climate (Climate Resilience Forecasting[5]), while Uncertainty Quantification and Robustness ensure reliable predictions under shift (Adaptive Conformal[18]). Architectural Innovations (Informer[7], Time LlaMA[17]) and Methodological Foundations provide the technical substrate, with Application-Specific Forecasting targeting real-world deployments. A central tension emerges between lightweight, output-space corrections and deeper model updates. COSA[0] exemplifies the former, residing in Output-Space Correction Approaches alongside Parameter Efficient TTA[22], both prioritizing minimal computational overhead by adjusting predictions or a small parameter subset rather than full model retraining. This contrasts with heavier adaptation strategies like Proactive Model Adaptation[4] or Foundational LoRA[13], which modify internal representations more extensively. COSA[0] shares the efficiency ethos of Parameter Efficient TTA[22] but focuses on correcting outputs directly, making it particularly suitable for resource-constrained or latency-sensitive settings. Meanwhile, works like Battling Nonstationarity[3] and Shift Aware TTA[48] emphasize detecting when adaptation is needed, complementing correction methods by triggering updates selectively. The interplay between detection, correction, and continuous learning remains an active area, with COSA[0] contributing a streamlined correction pathway within this broader landscape.

Claimed Contributions

Context-aware Output-Space Adapter (COSA)

COSA is a single output-space adapter that directly corrects predictions from a frozen base forecaster using residual correction modulated by gating. It utilizes the original prediction and a lightweight context vector summarizing statistics from recently observed ground truth, avoiding the dual input-output adapter design of prior methods.

9 retrieved papers
Context-aware linear residual with gating mechanism

The adapter performs linear residual correction by concatenating base predictions with a context vector (summarizing recent ground truth statistics) and applying a learnable gating mechanism to modulate the correction strength, enabling adaptive output adjustment.

10 retrieved papers
Adaptive learning rate schedule for fast adaptation

COSA employs a cosine-adaptive learning rate (CALR) schedule that adjusts the learning rate online based on short-horizon loss trends, enabling faster convergence within limited adaptation steps while maintaining stability through early stopping and gradient clipping.

10 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Context-aware Output-Space Adapter (COSA)

COSA is a single output-space adapter that directly corrects predictions from a frozen base forecaster using residual correction modulated by gating. It utilizes the original prediction and a lightweight context vector summarizing statistics from recently observed ground truth, avoiding the dual input-output adapter design of prior methods.

Contribution

Context-aware linear residual with gating mechanism

The adapter performs linear residual correction by concatenating base predictions with a context vector (summarizing recent ground truth statistics) and applying a learnable gating mechanism to modulate the correction strength, enabling adaptive output adjustment.

Contribution

Adaptive learning rate schedule for fast adaptation

COSA employs a cosine-adaptive learning rate (CALR) schedule that adjusts the learning rate online based on short-horizon loss trends, enabling faster convergence within limited adaptation steps while maintaining stability through early stopping and gradient clipping.