Estimating minimum effect with outlier selection

Alexandra Carpentier; Sylvain Delattre; Etienne Roquain; Nicolas Verzelen

doi:10.1214/20-AOS1956

Article Dans Une Revue Annals of Statistics Année : 2021

Estimating minimum effect with outlier selection

(1) , (2) , (3) , (4)

1
2
3
4

Alexandra Carpentier

Fonction : Auteur
PersonId : 910455

Otto-von-Guericke-Universität Magdeburg = Otto-von-Guericke University [Magdeburg]

Sylvain Delattre

Fonction : Auteur

Laboratoire de Probabilités et Modèles Aléatoires

Etienne Roquain

Fonction : Auteur
PersonId : 861126
IdHAL : etienne-roquain
IdRef : 121165418

Laboratoire de Probabilités, Statistique et Modélisation

Nicolas Verzelen

Fonction : Auteur
PersonId : 737715
IdHAL : nicolas-verzelen
ORCID : 0009-0009-3411-0076
IdRef : 137391293

Mathématiques, Informatique et STatistique pour l'Environnement et l'Agronomie

Résumé

We introduce one-sided versions of Huber's contamination model, in which corrupted samples tend to take larger values than uncorrupted ones. Two intertwined problems are addressed: estimation of the mean of the uncorrupted samples (minimum effect) and selection of the corrupted samples (outliers). Regarding estimation of the minimum effect, we derive the minimax risks and introduce estimators that are adaptive with respect to the unknown number of contaminations. The optimal convergence rates differ from the ones in the classical Huber contamination model. This fact uncovers the effect of the one-sided structural assumption of the contaminations. As for the problem of selecting the outliers, we formulate the problem in a multiple testing framework for which the location and scaling of the null hypotheses are unknown. We rigorously prove that estimating the null hypothesis while maintaining a theoretical guarantee on the amount of the falsely selected outliers is possible, both through false discovery rate (FDR) and through post hoc bounds. As a by-product, we address a long-standing open issue on FDR control under equi-correlation, which reinforces the interest of removing dependency in such a setting.

Mots clés

Contamination equicorrelation false discovery rate Hermite polynomials minimax rate moment matching multiple testing post hoc selective inference sparsity

Domaines

Mathématiques [math] Probabilités [math.PR]

Fichier principal

Postprint estimating mnimum effect 2021.pdf (1.2 Mo)

Origine	Fichiers éditeurs autorisés sur une archive ouverte

Christelle Raynaud : Connectez-vous pour contacter le contributeur

https://hal.inrae.fr/hal-03173293

Soumis le : mercredi 1 juin 2022-13:34:25

Dernière modification le : mercredi 30 octobre 2024-13:33:58

Archivage à long terme le : vendredi 2 septembre 2022-19:07:46

Dates et versions

hal-03173293 , version 1 (01-06-2022)

Identifiants

HAL Id : hal-03173293 , version 1
ARXIV : 1809.08330
DOI : 10.1214/20-AOS1956
WOS : 000614187400012

Citer

Alexandra Carpentier, Sylvain Delattre, Etienne Roquain, Nicolas Verzelen. Estimating minimum effect with outlier selection. Annals of Statistics, 2021, 49 (1), pp.272-294. ⟨10.1214/20-AOS1956⟩. ⟨hal-03173293⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PMA CNRS INSMI LPSM SORBONNE-UNIVERSITE SU-SCIENCES INSTITUT-AGRO-MONTPELLIER INRAE UP-SCIENCES INRAEOCCITANIEMONTPELLIER MISTEA ANR MATHNUM RESEAU-EAU

255 Consultations

90 Téléchargements

Estimating minimum effect with outlier selection

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager