Unsolved

Generalization-error estimation beyond Gaussian designs

Sourced from the work of Pierre C Bellec, Kai Tan

§ Problem Statement

Setup

Let (xi,yi)i=1n(x_i,y_i)_{i=1}^n be i.i.d. training data with xiRpx_i\in\mathbb R^p and

yi=xiβ0+εi,y_i=x_i^\top\beta_0+\varepsilon_i,

where β0Rp\beta_0\in\mathbb R^p is deterministic (or independent of the sample), E[εixi]=0\mathbb E[\varepsilon_i\mid x_i]=0, and E[εi2xi]<\mathbb E[\varepsilon_i^2\mid x_i]<\infty. Write XRn×pX\in\mathbb R^{n\times p} for the design matrix with rows xix_i^\top, and y=(y1,,yn)y=(y_1,\dots,y_n)^\top.

Assume a high-dimensional regime n,pn,p\to\infty with p/nγ(0,)p/n\to\gamma\in(0,\infty) under non-Gaussian random designs (e.g., sub-Gaussian or elliptical families with conditions sufficient for asymptotic normality arguments). For fixed iteration index tNt\in\mathbb N, let β^t=β^t(X,y)\hat\beta^t=\hat\beta^t(X,y) be the tt-th iterate of a first-order method (such as GD, proximal GD, or an accelerated variant) applied to least squares, possibly with a proper closed convex penalty. With independent test covariate xnewxix_{\mathrm{new}}\sim x_i, define

Rt:=E ⁣[(xnewβ^txnewβ0)2X,y].R_t:=\mathbb E\!\left[\left(x_{\mathrm{new}}^\top\hat\beta^t-x_{\mathrm{new}}^\top\beta_0\right)^2\mid X,y\right].

Unsolved Problem

For fixed tt, construct a data-driven estimator R^t\hat R_t of RtR_t that admits a valid n\sqrt n-scale limit law under non-Gaussian designs, with finite nondegenerate asymptotic variance and conditions for consistent variance estimation.

§ Discussion

Loading discussion…

§ Significance & Implications

The cited work studies uncertainty quantification for fixed-time iterates in high-dimensional linear models under Gaussian-design assumptions. A non-Gaussian extension remains a natural and practically important direction, but the exact scope of what is already proved beyond Gaussian settings should be treated cautiously pending a dedicated 2025-2026 literature check.

§ Known Partial Results

  • Bellec et al. (2024): Under Gaussian-design assumptions, the cited paper establishes n\sqrt n-scale uncertainty quantification for risk estimation at fixed iterate tt for several first-order algorithms. Open-status assessment for non-Gaussian designs requires dedicated literature verification.

§ References

[1]

Uncertainty quantification for iterative algorithms in linear models with application to early stopping

Pierre C Bellec, Kai Tan (2024)

Annals of Statistics (to appear)

📍 Section 5 (Discussion), exact paragraph citation to be pinned after direct full-text verification of the final published version.

Primary source motivating this formalization; non-Gaussian extension remains an open direction.

§ Tags