Minimax Rate for Wasserstein Distance Estimation in High Dimensions
§ Problem Statement
Setup
Fix , , and sample sizes . Let be the set of all Borel probability measures on . For , define the -Wasserstein distance
where is the set of all couplings of and .
Assume we observe independent samples and , with the -sample independent of the -sample. An estimator of is any measurable map . Its worst-case squared-error risk over is
Define the minimax risk
Unsolved Problem
Determine the sharp dependence of on (up to constants, and logarithmic factors where unavoidable) for the full class . A key distinction is between: (1) rates for empirical-measure approximation ( and ), and (2) rates for direct estimation of the functional from samples. In high dimension (notably regimes like ), does the empirical plug-in estimator
achieve minimax-optimal squared risk over unrestricted , or can one do strictly better (possibly under additional smoothness/separation assumptions)? Current literature does not resolve this for the fully nonparametric class.
§ Discussion
§ Significance & Implications
Optimal transport and Wasserstein distances are central in statistics and machine learning. For unrestricted high-dimensional distributions, empirical Wasserstein convergence shows strong dimensional effects (with regime changes around ), but those results do not by themselves settle minimax optimality for direct estimation of under squared loss. Clarifying this gap is important both theoretically and for practical sample-complexity guidance.
§ Known Partial Results
Weed et al. (2019): Empirical-measure estimation (not direct functional estimation): Weed & Bach (2019) gives sharp benchmark rates for over broad classes on , with regime split at : typically for , at , and for (for unsquared loss; squared-loss exponents double).
Niles-Weed et al. (2022): Plug-in implications for : triangle-inequality arguments transfer empirical-measure upper bounds to , but these are indirect and do not by themselves prove minimax optimality for direct functional estimation under squared risk.
Niles-Weed et al. (2022): Direct Wasserstein-functional estimation: Niles-Weed & Rigollet (2022) provides model-based lower/upper bounds (spiked transport model) showing nontrivial gaps and that structure can help, without resolving the unrestricted minimax rate over all .
Manole et al. (2024): Smooth-cost / separation regimes: Manole & Niles-Weed (2024) establishes sharp empirical OT rates under smooth-cost regularity assumptions; these results clarify favorable structured regimes but do not close the general worst-case two-sample functional-estimation question.
Niles-Weed et al. (2022): Current status: the full minimax characterization for estimating over unrestricted remains open, especially in high-dimensional regimes such as .
§ References
Estimation of Wasserstein distances in the spiked transport model
Jonathan Niles-Weed, Philippe Rigollet (2022)
Bernoulli
📍 Section 3.3 (Lower bounds), paragraph after Theorem 3 (states that closing the rate gap for Wasserstein-distance estimation is a fundamental open question).
Sharp convergence rates for empirical optimal transport with smooth costs
Tudor Manole, Jonathan Niles-Weed (2024)
Annals of Applied Probability
Jonathan Weed, Francis Bach (2019)
Bernoulli
📍 Sections 1 and 3 (dimension-dependent rates for empirical-measure convergence in $W_p$, including low-dimensional boundary-log behavior).