Unsolved

Sharp boundary for attainability in finite-sample location models

Sourced from the work of Spencer Compton, Gregory Valiant

§ Problem Statement

Setup

Let P0P_0 be a Borel probability distribution on R\mathbb{R} with finite first moment and centered so that xdP0(x)=0\int x\,dP_0(x)=0. For each shift parameter θR\theta\in\mathbb{R}, define the location family member PθP_\theta by Pθ(A)=P0(Aθ)P_\theta(A)=P_0(A-\theta) for all Borel sets ARA\subseteq\mathbb{R}. Given nn i.i.d. observations X1,,XnPθX_1,\dots,X_n\sim P_{\theta^\star} with unknown θ\theta^\star, an estimator is any measurable map θ^n:RnR\hat\theta_n:\mathbb{R}^n\to\mathbb{R}.

Define worst-case absolute-error risk

Rn(θ^n;P0)=supθREPθn ⁣[θ^nθ],R_n(\hat\theta_n;P_0)=\sup_{\theta\in\mathbb{R}}\mathbb{E}_{P_\theta^{\otimes n}}\!\left[\,|\hat\theta_n-\theta|\,\right],

and minimax risk Rn(P0)=infθ^nRn(θ^n;P0)R_n^\star(P_0)=\inf_{\hat\theta_n}R_n(\hat\theta_n;P_0).

For probability measures P,QP,Q on R\mathbb{R}, define squared Hellinger distance

H2(P,Q)=1dPdμdQdμdμ,H^2(P,Q)=1-\int \sqrt{\frac{dP}{d\mu}\frac{dQ}{d\mu}}\,d\mu,

where μ\mu is any dominating measure (this value is independent of μ\mu). Define the location-model Hellinger modulus for real effective sample size t1t\ge 1 by

ωP0(t)=sup{δ: H2(P0,Pδ)1t}.\omega_{P_0}(t)=\sup\left\{|\delta|:\ H^2(P_0,P_\delta)\le \frac{1}{t}\right\}.

(For integer t=mt=m, this matches the usual sample-size-mm two-point-testing benchmark up to universal constants via Le Cam lower bounds.)

Say the two-point rate is near-attainable for this fixed base distribution P0P_0 if there exist constants C>0C>0, a,b0a,b\ge 0, and estimators {θ^n}n2\{\hat\theta_n\}_{n\ge 2} such that for all n2n\ge 2,

Rn(θ^n;P0)C(logn)aωP0 ⁣(n(logn)b).R_n(\hat\theta_n;P_0)\le C(\log n)^a\,\omega_{P_0}\!\left(\frac{n}{(\log n)^b}\right).

Equivalently: for this P0P_0, estimation error matches the Hellinger-modulus lower bound up to polylogarithmic factors and a O~(n)\tilde O(n) effective sample-size loss.

Unsolved Problem

Determine a necessary-and-sufficient condition on P0P_0 for this near-attainability property to hold.

§ Discussion

Loading discussion…

§ Significance & Implications

Compton & Valiant (arXiv:2502.05730, v3 dated 2026-01-04) give positive and negative finite-sample attainability results in related shape-constrained settings, but do not provide a full iff characterization over base laws P0P_0 for this location-model criterion. A sharp criterion would complete the one-dimensional finite-sample picture and identify exactly which distributional geometries permit near-attainment of the two-point benchmark. This problem remains open under currently cited sources.

§ Known Partial Results

  • Compton et al. (2025): For the known-P0P_0 location setting, the paper proves near-attainability guarantees for unimodal base distributions under its stated theorem assumptions (up to polylog factors). Separately, its symmetric-distribution lower bound is existential/non-uniform over the class (for each sample size, there exist symmetric examples with large gaps from the two-point benchmark), so it should not be read as saying every symmetric P0P_0 fails near-attainability. A complete necessary-and-sufficient characterization for fixed P0P_0 is not provided there and remains open.

§ References

[1]

Attainability of Two-Point Testing Rates for Finite-Sample Location Estimation

Spencer Compton, Gregory Valiant (2025)

Annals of Statistics (to appear)

📍 Section 6 (Discussion), avenue "Adaptive location estimation for more general distributions", p. 56 (arXiv v3, 2026-01-04)

Primary source paper discussing this open direction.

§ Tags