Optimal Distribution-Free Prediction Intervals
§ Problem Statement
Setup
Let , , and let be the class of all Borel probability distributions on . For any , draw i.i.d. pairs , where and . A (possibly randomized) prediction-interval procedure is a measurable map sending and a test covariate to an interval , with length .
Finite-sample marginal validity under exchangeability at level is
Define
For unrestricted , the objective is degenerate: (equivalently, no finite uniform expected-length guarantee over all Borel laws).
Unsolved Problem
Pose and solve the minimax question on a restricted class (e.g., moment/tail, noise, smoothness, or shape constraints):
Determine sharp rates/constants of in and whether computationally efficient procedures attain them.
Now separate conditional targets:
- Exact conditional coverage (known impossible distribution-free):
- Relaxed conditional coverage, e.g. -approximate conditional coverage:
Characterize minimax-optimal length under such relaxed conditional criteria (or other precisely specified local/averaged variants) and compare conformal-type procedures to minimax lower bounds.
See Vovk et al. (2005) for conformal prediction, Lei & Wasserman (2014), Romano et al. (2019), Barber et al. (2021), and Gibbs & Candès (2021).
§ Discussion
§ Significance & Implications
Distribution-free predictive inference is central in statistics and machine learning. Conformal methods provide finite-sample marginal validity under exchangeability, but practical guarantees depend on finite-sample score calibration choices (typically conservative at the scale due to quantile discretization/tie handling). Exact conditional coverage is impossible without additional assumptions, so the key frontier is sharp efficiency and computational optimality under explicit structural restrictions and under well-defined relaxed conditional targets.
§ Known Partial Results
Lei et al. (2014): Unrestricted minimax objective under only distribution-free marginal validity is degenerate: the worst-case expected length over all Borel laws is infinite, so meaningful minimax analysis requires restricting .
Vovk et al. (2005): conformal prediction gives finite-sample marginal validity under exchangeability (with finite-sample calibration/discretization caveats).
Lei & Wasserman (2014): early distribution-free prediction-band constructions and analysis for nonparametric regression settings.
Romano et al. (2019): conformalized quantile regression with finite-sample marginal validity under exchangeability.
Barber et al. (2021): exact conditional coverage is impossible distribution-free except with vacuous/very wide intervals.
Gibbs & Candès (2021): adaptive conformal methods for certain distribution-shift regimes, outside exact conditional guarantees.
§ References
Distribution-free prediction bands for non-parametric regression
Jing Lei, Larry Wasserman (2014)
Journal of the Royal Statistical Society: Series B (Statistical Methodology)
📍 JRSSB 76(1):71-96; methodology and guarantees in Sections 2-3, discussion/open questions in Section 6.
The limits of distribution-free conditional predictive inference
Rina Foygel Barber, Emmanuel Candès, Aaditya Ramdas, Ryan Tibshirani (2021)
Information and Inference: A Journal of the IMA
Conformalized Quantile Regression
Yaniv Romano, Evan Patterson, Emmanuel Candès (2019)
Advances in Neural Information Processing Systems (NeurIPS 32)
📍 NeurIPS 2019 paper; Algorithm 1 and Theorem 1 (finite-sample marginal validity under exchangeability).
Algorithmic Learning in a Random World
Vladimir Vovk, Alexander Gammerman, Glenn Shafer (2005)
Springer
📍 Monograph introducing conformal prediction and finite-sample validity under exchangeability (early chapters).
Adaptive Conformal Inference Under Distribution Shift
Isaac Gibbs, Emmanuel Candès (2021)
Advances in Neural Information Processing Systems (NeurIPS 34)
📍 Adaptive conformal calibration for covariate/distribution shift settings.