Quantifying the cost of simultaneous non-parametric approximation of several samples

P.L. Davies; A. Kovac

doi:10.1214/08-EJS298

2009 Quantifying the cost of simultaneous non-parametric approximation of several samples

P.L. Davies, A. Kovac

Electron. J. Statist. 3: 747-780 (2009). DOI: 10.1214/08-EJS298

Abstract

We consider the standard non-parametric regression model with Gaussian errors but where the data consist of different samples. The question to be answered is whether the samples can be adequately represented by the same regression function. To do this we define for each sample a universal, honest and non-asymptotic confidence region for the regression function. Any subset of the samples can be represented by the same function if and only if the intersection of the corresponding confidence regions is non-empty. If the empirical supports of the samples are disjoint then the intersection of the confidence regions is always non–empty and a negative answer can only be obtained by placing shape or quantitative smoothness conditions on the joint approximation, or by making additional assumptions about the support points. Alternatively, a simplest joint approximation function can be calculated which gives a measure of the cost of the joint approximation, for example, the number of extra peaks required.

Citation

Download Citation

P.L. Davies. A. Kovac. "Quantifying the cost of simultaneous non-parametric approximation of several samples." Electron. J. Statist. 3 747 - 780, 2009. https://doi.org/10.1214/08-EJS298