Theory for Debiased PCR Under General Covariate Models
Sourced from the work of Yufan Li, Pragya Sur
§ Problem Statement
Setup
Let be independent observations from the linear model
where is unknown, satisfies and (not assumed to be right-rotationally invariant), and is independent of with , . Write for the design matrix (rows ) and for the response vector.
Let the singular value decomposition of be , with rank , singular values , and right singular vectors . For a fixed truncation level , define , , , and the rank- principal components regression estimator
equivalently the least-squares estimator constrained to .
Consider a spectrum-aware debiased PCR estimator of the form
where is a data-dependent matrix built from the empirical spectrum/eigenstructure of to correct truncation and regularization bias.
For a contrast vector , define the conditional centering and scale
The
Unsolved Problem
Give sharp, verifiable conditions on , dimension growth , and contrast classes (for example, deterministic or random with bounded norm/sparsity) under which, for general non-rotationally-invariant designs,
and to construct plug-in estimators (and, if needed, ) that are consistent so that asymptotically valid confidence intervals for follow uniformly over the stated contrast class.
§ Discussion
§ Significance & Implications
PCR is widely used in low-rank/high-collinearity settings; inference is often the bottleneck. A general theory would convert debiased PCR from a first construction into a robust inference tool across modern correlated designs. See Li & Sur (2023) for details.
§ Known Partial Results
Li et al. (2023): The problem remains open in the cited source. The abstract claims the first debiased PCR estimator in high dimensions as a by-product of spectrum-aware debiasing.
§ References
Yufan Li, Pragya Sur (2023)
Annals of Statistics (to appear)
📍 Appendix I (Conjectures for general covariate models), Conjecture I.1
Source paper where this open problem is discussed.