Open Access
2013 Semiparametric estimation of a two-component mixture of linear regressions in which one component is known
L. Bordes, I. Kojadinovic, P. Vandekerkhove
Electron. J. Statist. 7: 2603-2644 (2013). DOI: 10.1214/13-EJS858

Abstract

A new estimation method for the two-component mixture model introduced in [29] is proposed. This model consists of a two-component mixture of linear regressions in which one component is entirely known while the proportion, the slope, the intercept and the error distribution of the other component are unknown. In spite of good performance for datasets of reasonable size, the method proposed in [29] suffers from a serious drawback when the sample size becomes large as it is based on the optimization of a contrast function whose pointwise computation requires $O(n^{2})$ operations. The range of applicability of the method derived in this work is substantially larger as it relies on a method-of-moments estimator free of tuning parameters whose computation requires $O(n)$ operations. From a theoretical perspective, the asymptotic normality of both the estimator of the Euclidean parameter vector and of the semiparametric estimator of the c.d.f. of the error is proved under weak conditions not involving zero-symmetry assumptions. In addition, an approximate confidence band for the c.d.f. of the error can be computed using a weighted bootstrap whose asymptotic validity is proved. The finite-sample performance of the resulting estimation procedure is studied under various scenarios through Monte Carlo experiments. The proposed method is illustrated on three real datasets of size $n=150$, 51 and 176,343, respectively. Two extensions of the considered model are discussed in the final section: a model with an additional scale parameter for the first component, and a model with more than one explanatory variable.

Citation

Download Citation

L. Bordes. I. Kojadinovic. P. Vandekerkhove. "Semiparametric estimation of a two-component mixture of linear regressions in which one component is known." Electron. J. Statist. 7 2603 - 2644, 2013. https://doi.org/10.1214/13-EJS858

Information

Published: 2013
First available in Project Euclid: 23 October 2013

zbMATH: 1294.62151
MathSciNet: MR3121625
Digital Object Identifier: 10.1214/13-EJS858

Subjects:
Primary: 62J05
Secondary: 62G08

Keywords: asymptotic normality , Identifiability , Linear regression , method of moments , mixture , multiplier central limit theorem , weighted bootstrap

Rights: Copyright © 2013 The Institute of Mathematical Statistics and the Bernoulli Society

Back to Top