On the $O(1/T)$ Convergence of Alternating Gradient Descent–Ascent in Bilinear Games

ICLR 2026 Conference SubmissionAnonymous Authors

OpenReview Score: 6.0 Download Report PDF

Two-player zero-sum gamesAlternating gradient descent-ascentPerformance estimation programming

We study the alternating gradient descent-ascent (AltGDA) algorithm in two-player zero-sum games. Alternating methods, where players take turns to update their strategies, have long been recognized as simple and practical approaches for learning in games, exhibiting much better numerical performance than their simultaneous counterparts. However, our theoretical understanding of alternating algorithms remains limited, and results are mostly restricted to the unconstrained setting. We show that for two-player zero-sum games that admit an interior Nash equilibrium, AltGDA converges at an $O(1/T)$ ergodic convergence rate when employing a small constant stepsize. This is the first result showing that alternation improves over the simultaneous counterpart of GDA in the constrained setting. For games without an interior equilibrium, we show an $O(1/T)$ local convergence rate with a constant stepsize that is independent of any game-specific constants. In a more general setting, we develop a performance estimation programming (PEP) framework to jointly optimize the AltGDA stepsize along with its worst-case convergence rate. The PEP results indicate that AltGDA may achieve an $O(1/T)$ convergence rate for a finite horizon $T$ , whereas its simultaneous counterpart appears limited to an $O(1/\sqrt{T})$ rate.

Abstract:

Disclaimer

This report is AI-GENERATED using Large Language Models and WisPaper (A scholar search engine). It analyzes academic papers' tasks and contributions against retrieved prior work. While this system identifies POTENTIAL overlaps and novel directions, ITS COVERAGE IS NOT EXHAUSTIVE AND JUDGMENTS ARE APPROXIMATE. These results are intended to assist human reviewers and SHOULD NOT be relied upon as a definitive verdict on novelty.

NOTE that some papers exist in multiple, slightly different versions (e.g., with different titles or URLs). The system may retrieve several versions of the same underlying work. The current automated pipeline does not reliably align or distinguish these cases, so human reviewers will need to disambiguate them manually.

If you have any questions, please contact: mingzhang23@m.fudan.edu.cn

Overview

Taxonomy

Core-task Taxonomy Papers

Claimed Contributions

Contribution Candidate Papers Compared

Refutable Paper

Research Landscape Overview

Core task: convergence analysis of alternating gradient descent-ascent in bilinear games. The field organizes around several complementary perspectives on minimax optimization. Alternating update schemes form a central branch, examining how players take turns updating their strategies and the resulting convergence guarantees in bilinear and broader game settings. Simultaneous and optimistic gradient methods explore synchronous updates and lookahead mechanisms that can stabilize dynamics. Nonconvex-nonconcave minimax optimization tackles settings beyond the classical convex-concave regime, while Markov games and sequential decision settings extend the analysis to multi-agent reinforcement learning. Specialized algorithmic variants introduce momentum, variance reduction, and acceleration techniques, and theoretical frameworks develop tools such as performance estimation problems and co-coercivity conditions. Application-driven branches address constrained optimization and robust formulations that arise in practice. Within alternating update schemes, a particularly active line of work investigates convergence rates in bilinear settings, where the minimax objective decomposes into a simple bilinear form. Alternating GDA Bilinear[0] contributes to this cluster by analyzing how alternating updates behave in such structured games, complementing earlier studies like Local Alternating GDA[3] that examined local convergence properties and Alternating Updates Benefit[4] that highlighted advantages of the alternating schedule. These works contrast with optimistic methods such as Optimistic GDA Bilinear[7] and extragradient approaches like Unified Extragradient Analysis[5], which use lookahead or correction steps to achieve faster or more stable convergence. A recurring theme across these branches is the trade-off between algorithmic simplicity and convergence speed: alternating schemes are often straightforward to implement but may exhibit oscillatory behavior, whereas optimistic and simultaneous methods can offer improved rates at the cost of additional gradient evaluations or memory.

Claimed Contributions

O(1/T) ergodic convergence rate for AltGDA with interior Nash equilibrium

2 retrieved papers

The authors prove that alternating gradient descent-ascent achieves an O(1/T) convergence rate in bilinear games with an interior Nash equilibrium using a constant stepsize. This is the first result showing that alternation improves over simultaneous GDA in the constrained setting.

2 retrieved papers

Local O(1/T) convergence rate with game-independent stepsize

10 retrieved papers

The authors establish a local O(1/T) convergence rate for AltGDA in general bilinear games without requiring an interior equilibrium. The constant stepsize used does not depend on game-specific parameters.

10 retrieved papers

Performance estimation programming framework for joint stepsize and convergence rate optimization

5 retrieved papers

The authors develop a PEP-based framework that simultaneously optimizes both the stepsize and worst-case convergence rate for AltGDA. This is the first instance of stepsize optimization for performance estimation problems involving primal-dual algorithms with linear operators.

5 retrieved papers

Core Task Comparisons

Comparisons with papers in the same taxonomy category

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

T Nan, SD Gupta, G Iyengar, C Kroer (2025)

[3] Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization PDF

Zhang Guo-dong, Wang, Yuanhao, Lessard, Laurent, Grosse, Roger (2021)

[4] Fundamental benefit of alternating updates in minimax optimization PDF

Lee Jae-Wook, Cho, Hanseul, Jaewook Lee, Yun, Chulhee, Hanseul Cho, Chulhee Yun (2024)

Contribution Analysis

Detailed comparisons for each claimed contribution

Contribution

O(1/T) ergodic convergence rate for AltGDA with interior Nash equilibrium

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

Cannot Refute

[15] Convergence of gradient methods on bilinear zero-sum games PDF

Cannot Refute

Contribution

Local O(1/T) convergence rate with game-independent stepsize

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

Cannot Refute

[9] Competitive gradient descent PDF

Cannot Refute

[12] Optimistic Gradient Descent Ascent in Zero-Sum and General-Sum Bilinear Games PDF

Cannot Refute

[15] Convergence of gradient methods on bilinear zero-sum games PDF

Cannot Refute

[18] A Robust Framework for Analyzing Gradient-Based Dynamics in Bilinear Games PDF

Cannot Refute

[34] Linear Last-iterate Convergence in Constrained Saddle-point Optimization PDF

Cannot Refute

[45] Finite-time last-iterate convergence for learning in multi-player games PDF

Cannot Refute

[46] Accelerated primal-dual gradient method for smooth and convex-concave saddle-point problems with bilinear coupling PDF

Cannot Refute

[47] Recursive Reasoning in Minimax Games: A Level Gradient Play Method PDF

Cannot Refute

[48] A Descent-based method on the Duality Gap for solving zero-sum games PDF

Cannot Refute

Contribution

Performance estimation programming framework for joint stepsize and convergence rate optimization

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

Cannot Refute

[41] Practical Large-Scale Linear Programming using Primal-Dual Hybrid Gradient PDF

Cannot Refute

[42] Primal-Dual Coordinate Descent for Nonconvex-Nonconcave Saddle Point Problems Under the Weak MVI Assumption PDF

Cannot Refute

[43] Algorithm Analysis: Automatic Lyapunov-based Analysis of First-Order Methods In Julia PDF

Cannot Refute

[44] A primal-dual conjugate subgradient algorithm for specially structured linear and convex programming problems PDF

Cannot Refute

On the O(1/T)O(1/T)O(1/T) Convergence of Alternating Gradient Descent–Ascent in Bilinear Games

Overview

Taxonomy

Research Landscape Overview

Claimed Contributions

Core Task Comparisons

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

[3] Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization PDF

[4] Fundamental benefit of alternating updates in minimax optimization PDF

Contribution Analysis

O(1/T) ergodic convergence rate for AltGDA with interior Nash equilibrium

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

[15] Convergence of gradient methods on bilinear zero-sum games PDF

Local O(1/T) convergence rate with game-independent stepsize

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

[9] Competitive gradient descent PDF

[12] Optimistic Gradient Descent Ascent in Zero-Sum and General-Sum Bilinear Games PDF

[15] Convergence of gradient methods on bilinear zero-sum games PDF

[18] A Robust Framework for Analyzing Gradient-Based Dynamics in Bilinear Games PDF

[34] Linear Last-iterate Convergence in Constrained Saddle-point Optimization PDF

[45] Finite-time last-iterate convergence for learning in multi-player games PDF

[46] Accelerated primal-dual gradient method for smooth and convex-concave saddle-point problems with bilinear coupling PDF

[47] Recursive Reasoning in Minimax Games: A Level Gradient Play Method PDF

[48] A Descent-based method on the Duality Gap for solving zero-sum games PDF

Performance estimation programming framework for joint stepsize and convergence rate optimization

[1] On the Convergence of Alternating Gradient Descent-Ascent in Bilinear Games PDF

[41] Practical Large-Scale Linear Programming using Primal-Dual Hybrid Gradient PDF

[42] Primal-Dual Coordinate Descent for Nonconvex-Nonconcave Saddle Point Problems Under the Weak MVI Assumption PDF

[43] Algorithm Analysis: Automatic Lyapunov-based Analysis of First-Order Methods In Julia PDF

[44] A primal-dual conjugate subgradient algorithm for specially structured linear and convex programming problems PDF

Table of Contents

On the $O(1/T)$ Convergence of Alternating Gradient Descent–Ascent in Bilinear Games