Structure-Agnostic Minimax Risk for Partial Linear Model
Posed by Yihong Gu (2025)
§ Problem Statement
Setup
Let be i.i.d., with and taking values in a measurable space . Assume the partial linear model
where is the target and is an unknown nuisance. Define
and the residualized treatment . Assume finite second moments and a nondegeneracy/conditioning bound
A cross-fitted DML estimator is: split indices into folds; for each , fit nuisance predictors and using data excluding (or excluding 's fold), evaluate them at , and compute
Structure-agnostic learnability assumption: there exist such cross-fitted predictors constructed from the samples for which, for an independent draw (independent of the training sample),
where the expectation averages over the sample and any algorithmic randomness.
Let be the set of all distributions over satisfying the model, the moment and bounds above, and for which there exist cross-fitted predictors achieving the stated errors from samples. Define the structure-agnostic minimax mean-squared error
where the infimum is over all estimators measurable w.r.t. the sample.
Unsolved Problem
Characterize sharply (up to universal constants and, if unavoidable, logarithmic factors) as a function of . In particular, determine whether attains the minimax rate uniformly over all regimes of under only the structure-agnostic learnability assumption; if not, determine the minimax rate and exhibit an estimator achieving it, clarifying how variance/conditioning through constrains what is achievable without additional structural assumptions on or .
§ Discussion
§ Significance & Implications
The problem asks for an information-theoretic benchmark for estimating the scalar coefficient in a partial linear model when the only quantitative control on nuisance learning is out-of-sample mean-squared prediction error bounds , with no smoothness/sparsity/parametric structure assumed. A sharp characterization of would pin down the best possible tradeoff between sample size, nuisance prediction accuracy, and treatment-residual conditioning , and would decide whether cross-fitted orthogonal/DML estimation is uniformly rate-optimal in this purely structure-agnostic regime. This directly impacts when black-box prediction guarantees alone justify the commonly used residualization-and-regression pipeline for semiparametric/causal effect estimation, versus when additional assumptions are necessary to control variance-driven limitations tied to the residualized treatment.
§ Known Partial Results
Gu (2025): The orthogonal estimating equation underlying DML is first-order insensitive to nuisance errors, so analyses typically reduce MSE control for to bounding higher-order remainder terms involving the nuisance estimation errors.
Chernozhukov et al. (2018): In many standard DML analyses, achieving -type accuracy (or sharper bounds on ) requires product-rate conditions on nuisance estimation (informally, that the combined effect of and is small enough) together with nondegeneracy of the residualized treatment.
Gu (2025): The COLT 2025 note (Gu, 2025) highlights a specific gap for structure-agnostic minimax lower bounds in this model: existing techniques do not cleanly capture how variance/conditioning, as reflected by and its second moment bounds, may limit (or allow) improvements beyond generic DML rates, leaving the sharp minimax risk characterization open even for estimating .
§ References
Open Problem: Structure-Agnostic Minimax Risk for Partial Linear Model
Yihong Gu (2025)
Conference on Learning Theory (COLT), PMLR 291
📍 Open-problem note in COLT proceedings.
Open Problem: Structure-Agnostic Minimax Risk for Partial Linear Model (PDF)
Yihong Gu (2025)
Conference on Learning Theory (COLT), PMLR 291
📍 Proceedings PDF.
Root-N-Consistent Semiparametric Regression
Peter M. Robinson (1988)
Econometrica
📍 Classical partially linear model results (root-n estimation under regularity conditions).
Double/debiased machine learning for treatment and structural parameters
Victor Chernozhukov, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey, James Robins (2018)
The Econometrics Journal
📍 Canonical DML framework with orthogonal scores and cross-fitting.