Open Access
2019 Semiparametric density testing in the contamination model
Denys Pommeret, Pierre Vandekerkhove
Electron. J. Statist. 13(2): 4743-4793 (2019). DOI: 10.1214/19-EJS1650

Abstract

In this paper we investigate a semiparametric testing approach to answer if the parametric family allocated to the unknown density of a two-component mixture model with one known component is correct or not. Based on a semiparametric estimation of the Euclidean parameters of the model (free from the null assumption), our method compares pairwise the Fourier’s type coefficients of the model estimated directly from the data with the ones obtained by plugging the estimated parameters into the mixture model. These comparisons are incorporated into a sum of square type statistic which order is controlled by a penalization rule. We prove under mild conditions that our test statistic is asymptotically $\chi^{2}_{1}$-distributed and study its behavior, both numerically and theoretically, under different types of alternatives including contiguous nonparametric alternatives. We discuss the counterintuitive, from the practitioner point of view, lack of power of the maximum likelihood version of our test in a neighborhood of challenging non-identifiable situations. Several level and power studies are numerically conducted on models close to those considered in the literature, such as in McLachlan et al. [21], to validate the suitability of our approach. We also implement our testing procedure on the Carina galaxy real dataset which low luminosity mixes with the one of its companion Milky Way. Finally we discuss possible extensions of our work to a wider class of contamination models.

Citation

Download Citation

Denys Pommeret. Pierre Vandekerkhove. "Semiparametric density testing in the contamination model." Electron. J. Statist. 13 (2) 4743 - 4793, 2019. https://doi.org/10.1214/19-EJS1650

Information

Received: 1 April 2019; Published: 2019
First available in Project Euclid: 28 November 2019

zbMATH: 07147364
MathSciNet: MR4038725
Digital Object Identifier: 10.1214/19-EJS1650

Subjects:
Primary: 62F03
Secondary: 62G10

Keywords: Chi-squared test , contamination model , maximum likelihood estimator , nonparametric contiguous alternative , semiparametric estimator

Vol.13 • No. 2 • 2019
Back to Top