Open Access
December, 1971 A Polynomial Algorithm for Density Estimation
Grace Wahba
Ann. Math. Statist. 42(6): 1870-1886 (December, 1971). DOI: 10.1214/aoms/1177693053

Abstract

An algorithm for density estimation based on ordinary polynomial (Lagrange) interpolation is studied. Let $F_n(x)$ be $n/(n + 1)$ times the sample c.d.f. based on $n$ order statistics, $t_1, t_2, \cdots t_n$, from a population with density $f(x)$. It is assumed that $f^{(v)}$ is continuous, $v = 0, 1, 2,\cdots, r, r = m - 1$, and $f^{(m)} \in L_2(-\infty, \infty). F_n(x)$ is first locally interpolated by the $m$th degree polynomial passing through $F_n(t_{ik_n}), F_n(t_{(i+1)k_n}),\cdots F_n(t_{(i+m)k_n})$, where $k_n$ is a suitably chosen number, depending on $n$. The density estimate is then, locally, the derivative of this interpolating polynomial. If $k_n = O(n^{(2m-1)/(2m)})$, then it is shown that the mean square convergence rate of the estimate to the true density is $O(n^{-(2m-1)/(2m)})$. Thus these convergence rates are slightly better than those obtained by the Parzen kernel-type estimates for densities with $r$ continuous derivatives. If it is assumed that $f^{(m)}$ is bounded, and $k_n = O(n^{2m/(2m+1)})$, then it is shown that the mean square convergence rates are $O(n^{-2m/(2m+1)})$, which are the same as those of the Parzen estimates for $m$ continuous derivatives. An interesting theorem about Lagrange interpolation, concerning how well a function can be interpolated knowing only its integral at nearby points, is also demonstrated.

Citation

Download Citation

Grace Wahba. "A Polynomial Algorithm for Density Estimation." Ann. Math. Statist. 42 (6) 1870 - 1886, December, 1971. https://doi.org/10.1214/aoms/1177693053

Information

Published: December, 1971
First available in Project Euclid: 27 April 2007

zbMATH: 0226.62038
MathSciNet: MR303652
Digital Object Identifier: 10.1214/aoms/1177693053

Rights: Copyright © 1971 Institute of Mathematical Statistics

Vol.42 • No. 6 • December, 1971
Back to Top