Oracle-efficient Hybrid Learning with Constrained Adversaries

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.5 Download Report PDF

hybrid online learningoracle-efficiencyERMrademacher complexity

The Hybrid Online Learning Problem, where features are drawn i.i.d. from an unknown distribution but labels are generated adversarially, is a well-motivated setting positioned between statistical and fully-adversarial online learning. Prior work has presented a dichotomy: algorithms that are statistically-optimal, but computationally intractable \citep{wu2023expected}, and algorithms that are computationally-efficient (given an ERM oracle), but statistically-suboptimal \citep{pmlr-v247-wu24a}.

This paper takes a significant step towards achieving statistical optimality and computational efficiency \emph{simultaneously} in the Hybrid Learning setting. To do so, we consider a structured setting, where the Adversary is constrained to pick labels from an expressive, but fixed, class of functions $\mathcal{R}$ . Our main result is a new learning algorithm, which runs efficiently given an ERM oracle and obtains regret scaling with the Rademacher complexity of a class derived from the Learner's hypothesis class $\mathcal{H}$ and the Adversary's label class $\mathcal{R}$ . As a key corollary, we give an oracle-efficient algorithm for computing equilibria in stochastic zero-sum games when action sets may be high-dimensional but the payoff function exhibits a type of low-dimensional structure. Technically, we develop a number of novel tools for the design and analysis of our learning algorithm, including a novel Frank-Wolfe reduction with "truncated entropy regularizer" and a new tail bound for sums of "hybrid'' martingale difference sequences.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Overall Novelty Assessment

The paper proposes an oracle-efficient algorithm for hybrid online learning where adversaries select labels from a fixed expressive function class, achieving regret bounds scaling with Rademacher complexity. It resides in the 'Constrained Adversarial Label Classes' leaf, which contains only two papers including this one. This represents a sparse research direction within the broader taxonomy of eight papers across three main branches, suggesting the specific combination of oracle efficiency and structured adversarial constraints remains relatively unexplored compared to adjacent areas like smoothed adversarial models or robustness-focused approaches.

The taxonomy reveals neighboring work in smoothed adversarial models that exploit loss smoothness or uniform feature distributions rather than explicit adversarial constraints. The broader 'Oracle-Efficient Algorithms with Structured Adversaries' branch encompasses both constrained label classes and smoothness-based methods, indicating two parallel strategies for achieving computational tractability. Outside this branch, robustness-focused work addresses Byzantine attacks and replicability under distribution drift, while feedback graph extensions explore richer observation structures—directions orthogonal to this paper's focus on constrained adversarial label generation with standard full-information feedback.

Among fifteen candidates examined, the first contribution (oracle-efficient algorithm for constrained adversaries) shows one refutable candidate among eight examined, suggesting some prior overlap in this specific algorithmic direction. The second contribution (Frank-Wolfe reduction with truncated entropy) examined five candidates with none refutable, indicating greater technical novelty in the reduction mechanism. The third contribution (tail bound for hybrid martingales) examined only two candidates with no refutations, though the limited search scope makes it difficult to assess whether this reflects true novelty or simply sparse coverage of this technical tool.

Based on top-fifteen semantic matches, the work appears to occupy a relatively underexplored niche combining oracle efficiency with structured adversarial constraints. The limited taxonomy size and sparse leaf population suggest this specific problem formulation has received less attention than adjacent smoothness-based or robustness-focused approaches. However, the restricted search scope means potentially relevant work in broader online learning or game theory literatures may not be fully captured in this analysis.

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: hybrid online learning with i.i.d. features and adversarial labels. This setting blends stochastic and adversarial elements, where feature vectors arrive independently from a fixed distribution but labels may be chosen adversarially. The taxonomy organizes the field into three main branches. The first, Oracle-Efficient Algorithms with Structured Adversaries, focuses on designing computationally tractable methods that exploit constraints or structure in the adversary's label choices, enabling efficient learning without exhaustive enumeration. The second branch, Robustness and Reliability in Adversarial Online Learning, emphasizes algorithmic stability under worst-case perturbations, including Byzantine attacks and replicability requirements. The third, Feedback Graph Extensions, broadens the framework to settings where learners observe outcomes beyond their own actions, leveraging richer feedback structures to improve regret guarantees. Within the Oracle-Efficient branch, several lines of work explore different forms of adversarial structure. Some studies impose smoothness or distributional assumptions on adversaries, as in Smoothed Adversaries[1] and Smoothness Improved Regret[6], which relax pure adversarial models to achieve tighter bounds. Others, like Hybrid Constrained Adversaries[0] and Hybrid Unknown Distribution[8], investigate scenarios where the adversary's label-generating mechanism is constrained or partially unknown, requiring learners to adapt without full knowledge of the constraint class. Meanwhile, works such as Online Platt Scaling[3] and Calfat[4] address calibration and probabilistic prediction under adversarial feedback. Hybrid Constrained Adversaries[0] sits squarely in this constrained-adversary cluster, sharing with Hybrid Unknown Distribution[8] an emphasis on limited adversarial power, yet differing in whether the constraint structure is known or must be inferred during learning.

Claimed Contributions

Oracle-efficient algorithm for Hybrid Online Learning with constrained adversaries

Can Refute

8 retrieved papers

The authors develop a computationally efficient learning algorithm for the Hybrid Online Learning problem where the adversary is constrained to choose labels from a fixed function class R. The algorithm achieves near-optimal regret bounds that scale with the Rademacher complexity of the composite class while requiring only oracle access to linear optimization over the hypothesis class H.

8 retrieved papers

Can Refute

Novel Frank-Wolfe reduction with truncated entropy regularizer

5 retrieved papers

The authors introduce a new technical tool consisting of a Frank-Wolfe reduction combined with a truncated entropy regularization scheme. This enables efficient implementation of their learning algorithm using only linear optimization oracles, avoiding the need for computationally expensive operations over the full hypothesis class.

5 retrieved papers

Tail bound for hybrid martingale difference sequences

2 retrieved papers

The authors prove a novel uniform convergence bound (Proposition 1.3) that handles concentration for function classes evaluated on i.i.d. data where the functions themselves are chosen adaptively based on previous samples. This addresses the challenge of bounding martingale difference sequences in the hybrid setting where features are stochastic but labels are adversarial.

2 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[8] Oracle-Efficient Hybrid Online Learning with Unknown Distribution PDF

Wu Changlong, Changlong Wu, Sima, Jin, Jin Sima, Szpankowski, Wojciech, Wojciech Szpankowski, Wojtek Szpankowski (2024)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

Oracle-efficient algorithm for Hybrid Online Learning with constrained adversaries

[11] Online learning: Stochastic, constrained, and smoothed adversaries PDF

Can Refute

[1] Oracle-efficient online learning for smoothed adversaries PDF

Cannot Refute

[9] Multiclass online learning and uniform convergence PDF

Cannot Refute

[10] Regret guarantees for online deep control PDF

Cannot Refute

[12] Online Learning: Stochastic and Constrained Adversaries PDF

Cannot Refute

[13] Hierarchies of relaxations for online prediction problems with evolving constraints PDF

Cannot Refute

[14] A Characterization of Online Multiclass Learnability PDF

Cannot Refute

[15] Rademacher complexity of stationary sequences PDF

Cannot Refute

Contribution

Novel Frank-Wolfe reduction with truncated entropy regularizer

[18] A conditional gradient algorithm for distributed online optimization in networks PDF

Cannot Refute

[19] Provably efficient maximum entropy exploration PDF

Cannot Refute

[20] Statistical inference of convex order by Wasserstein projection PDF

Cannot Refute

[21] Entropic optimal transport with congestion aversion Application to relocation of drones PDF

Cannot Refute

[22] Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond PDF

Cannot Refute

Contribution

Tail bound for hybrid martingale difference sequences

[16] Toward optimal adaptive online shortest path routing with acceleration under jamming attack PDF

Cannot Refute

[17] Minimizing regret with label efficient prediction PDF

Cannot Refute

Oracle-efficient Hybrid Learning with Constrained Adversaries

Overview

Overall Novelty Assessment

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[8] Oracle-Efficient Hybrid Online Learning with Unknown Distribution PDF

Contribution Analysis

Oracle-efficient algorithm for Hybrid Online Learning with constrained adversaries

[11] Online learning: Stochastic, constrained, and smoothed adversaries PDF

[1] Oracle-efficient online learning for smoothed adversaries PDF

[9] Multiclass online learning and uniform convergence PDF

[10] Regret guarantees for online deep control PDF

[12] Online Learning: Stochastic and Constrained Adversaries PDF

[13] Hierarchies of relaxations for online prediction problems with evolving constraints PDF

[14] A Characterization of Online Multiclass Learnability PDF

[15] Rademacher complexity of stationary sequences PDF

Novel Frank-Wolfe reduction with truncated entropy regularizer

[18] A conditional gradient algorithm for distributed online optimization in networks PDF

[19] Provably efficient maximum entropy exploration PDF

[20] Statistical inference of convex order by Wasserstein projection PDF

[21] Entropic optimal transport with congestion aversion Application to relocation of drones PDF

[22] Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond PDF

Tail bound for hybrid martingale difference sequences

[16] Toward optimal adaptive online shortest path routing with acceleration under jamming attack PDF

[17] Minimizing regret with label efficient prediction PDF

Table of Contents