Outlier characterization for unbounded link functions (e.g., phase retrieval)
Sourced from the work of Gerard Ben Arous, Reza Gheissari, Jiaoyang Huang, Aukosh Jagannath
§ Problem Statement
Setup
Let with . For each , let be i.i.d. samples from a -component Gaussian mixture
where are deterministic class means with . Let be deterministic with , and define the projected feature .
This setup follows Arous et al. (2025).
Consider random matrices of Hessian/information type
where is measurable and may be unbounded (for example, growing polynomially; in phase retrieval-type models one gets in the single-index case ). Let and . Assume the empirical spectral distribution of converges almost surely to a deterministic law whose support is not bounded above.
Unsolved Problem
In bounded-support settings, outliers are characterized by eigenvalues separating to the right of the finite bulk edge. Here no finite right edge exists. Motivated by the open-direction discussion in the source paper, formulate a replacement theory in this unbounded-support regime:
-
Give necessary and sufficient conditions, in terms of (equivalently the finite-dimensional Gram data of signal directions plus the link-induced moments), for existence of finitely many spike-generated eigenvalues of that are spectrally distinguishable from the background spectrum despite .
-
Prove deterministic asymptotic formulas for those eigenvalues and for eigenvector alignment with , i.e. limits of quantities like for corresponding unit eigenvectors .
-
Identify an appropriate notion of "isolation" when the bulk has unbounded support (for example, separation from the unspiked comparison model at the relevant extreme-value scale) and establish a BBP-type phase transition criterion under that notion.
See Local geometry of high-dimensional mixture models: Effective spectral theory and dynamical transitions (arXiv:2502.15655v3) for context.
§ Discussion
§ Significance & Implications
Unbounded-link models (including phase-retrieval-type examples) arise naturally, while rigorous outlier characterizations in this line of work mainly rely on a finite right bulk edge. Extending those results to unbounded-support regimes would broaden the currently analyzable model class.
§ Known Partial Results
Arous et al. (2025): The paper develops effective bulk analysis beyond uniformly bounded links, but its outlier arguments are tied to separation past a finite right bulk edge; a full unbounded-support outlier theory is left open (as of February 25, 2025, in arXiv:2502.15655v3).
§ References
Gerard Ben Arous, Reza Gheissari, Jiaoyang Huang, Aukosh Jagannath (2025)
Annals of Statistics (to appear)
📍 arXiv v3, Section 1.5.2 (Parametric regression for the multi-index model), Remark 1.15
Source paper; Section 1.5.2 and Remark 1.15 motivate the unbounded-link outlier characterization as an open direction.