Open Access
December 2004 Model selection for Gaussian regression with random design
Lucien Birgé
Author Affiliations +
Bernoulli 10(6): 1039-1051 (December 2004). DOI: 10.3150/bj/1106314849

Abstract

This paper is concerned with Gaussian regression with random design, where the observations are independent and identically distributed. It is known from work by Le Cam that the rate of convergence of optimal estimators is closely connected to the metric structure of the parameter space with respect to the Hellinger distance. In particular, this metric structure essentially determines the risk when the loss function is a power of the Hellinger distance. For random design regression, one typically uses as loss function the squared L2-distance between the estimator and the parameter. If the parameter space is bounded with respect to the L-norm, both distances are equivalent. Without this assumption, it may happen that there is a large distortion between the two distances, resulting in some unusual rates of convergence for the squared L2-risk, as noticed by Baraud. We explain this phenomenon and then show that the use of the Hellinger distance instead of the L2-distance allows us to recover the usual rates and to carry out model selection in great generality. An extension to the L2-risk is given under a boundedness assumption similar to that given by Wegkamp and by Yang.

Citation

Download Citation

Lucien Birgé. "Model selection for Gaussian regression with random design." Bernoulli 10 (6) 1039 - 1051, December 2004. https://doi.org/10.3150/bj/1106314849

Information

Published: December 2004
First available in Project Euclid: 21 January 2005

zbMATH: 1064.62030
MathSciNet: MR2108042
Digital Object Identifier: 10.3150/bj/1106314849

Keywords: Besov spaces , Hellinger distance , minimax risk , Model selection , random design regression

Rights: Copyright © 2004 Bernoulli Society for Mathematical Statistics and Probability

Vol.10 • No. 6 • December 2004
Back to Top