Polynomial-time minimax robust mean estimation under star-shaped constraints
Sourced from the work of Akshay Prasadan, Matey Neykov
§ Problem Statement
Setup
Fix integers , a contamination level , a scale parameter , and a star-shaped set (that is, there exists such that for every and ). Let
For and , let be the maximal cardinality of a -packing of in Euclidean norm (pairwise distances ). For an absolute constant , define the local entropy
and define
This setup follows Prasadan & Neykov (2024).
Data model: there are unobserved clean samples , , with unknown . An adversary, after seeing all clean samples and the estimation procedure, outputs observed samples by changing at most coordinates arbitrarily. Denote by the class of all such contamination mechanisms. The estimator is any measurable function of .
Noise regimes:
-
Gaussian: .
-
Known-or-sign-symmetric sub-Gaussian: are iid mean-zero sub-Gaussian vectors with parameter at most , and either the full noise law is known or .
Unsolved Problem
- Unknown sub-Gaussian: are iid mean-zero sub-Gaussian vectors with parameter at most , with otherwise unknown law.
§ Discussion
§ Known Partial Results
Prasadan et al. (2024): This paper proves the above minimax rates information-theoretically and gives matching (but computationally hard) procedures in the stated small-contamination regime. Prior efficient methods either need stronger assumptions (e.g., symmetry/known structure) or are rate-suboptimal in general unknown-noise settings.
§ References
Information Theoretic Limits of Robust Sub-Gaussian Mean Estimation Under Star-Shaped Constraints
Akshay Prasadan, Matey Neykov (2024)
Annals of Statistics (to appear)
📍 Section 6 (Discussion and Future Work), first paragraph, which asks for "computationally efficient algorithms achieving the same performance under various constraints for the mean".
Primary source for this problem. Year convention here uses the initial arXiv preprint year (2024), not a later revision/acceptance publication year.