Convergence of single-timescale mean-field Langevin descent-ascent for two-player zero-sum games
Posed by Guillaume Wang et al. (2024)
§ Problem Statement
Setup
Let be the flat -torus with Lebesgue measure, and let be the set of Borel probability measures on . Fix and a smooth payoff . Define, for ,
where if has density w.r.t. Lebesgue measure and otherwise. In this setting has a unique saddle point (the entropy-regularized mixed Nash equilibrium).
Consider the single-timescale Wasserstein gradient descent-ascent (GDA) flow associated with : follows the Wasserstein gradient flow that decreases while follows the Wasserstein gradient flow that increases , using the same time parameter . For instance, when and have smooth positive densities, writing and , the formal PDE system is
with gradients and Laplacians on .
Unsolved Problem
For every smooth and every , do trajectories of this single-timescale Wasserstein GDA flow converge as (e.g. weakly in for each marginal) to the unique saddle point ?
§ Discussion
§ Significance & Implications
This asks for a qualitative long-time convergence result for a coupled descent-ascent flow in Wasserstein space that models the mean-field (infinite-particle) limit of Langevin descent-ascent in entropy-regularized two-player zero-sum games. A proof (or a counterexample) would clarify whether the natural single-timescale min-max dynamics is intrinsically stabilizing at the PDE/measure level, beyond regimes where one can enforce convergence by separating ascent and descent timescales.
§ Known Partial Results
Wang et al. (2024): The functional is entropy-regularized (via and ) and admits a unique saddle point , interpreted as the entropy-regularized mixed Nash equilibrium.
Wang et al. (2024): The associated Wasserstein gradient descent-ascent flow corresponds to the mean-field limit of a Langevin descent-ascent particle dynamics.
Wang et al. (2024): Convergence can be ensured by using different timescales for descent and ascent (a timescale-separated variant), but the single-timescale convergence question remains open for general smooth and .
Wang et al. (2024): The core difficulty is establishing (or refuting) global asymptotic convergence for this coupled min-max Wasserstein flow with currently available tools for long-time analysis in Wasserstein geometry.
§ References
Guillaume Wang, Lénaïc Chizat (2024)
Conference on Learning Theory (COLT), PMLR 247
📍 Open-problem note in COLT proceedings.
Guillaume Wang, Lénaïc Chizat (2024)
Conference on Learning Theory (COLT), PMLR 247
📍 Proceedings PDF.