The Annals of Statistics

Statistical inferences for functional data

Jin-Ting Zhang and Jianwei Chen

Full-text: Open access

Abstract

With modern technology development, functional data are being observed frequently in many scientific fields. A popular method for analyzing such functional data is “smoothing first, then estimation.” That is, statistical inference such as estimation and hypothesis testing about functional data is conducted based on the substitution of the underlying individual functions by their reconstructions obtained by one smoothing technique or another. However, little is known about this substitution effect on functional data analysis. In this paper this problem is investigated when the local polynomial kernel (LPK) smoothing technique is used for individual function reconstructions. We find that under some mild conditions, the substitution effect can be ignored asymptotically. Based on this, we construct LPK reconstruction-based estimators for the mean, covariance and noise variance functions of a functional data set and derive their asymptotics. We also propose a GCV rule for selecting good bandwidths for the LPK reconstructions. When the mean function also depends on some time-independent covariates, we consider a functional linear model where the mean function is linearly related to the covariates but the covariate effects are functions of time. The LPK reconstruction-based estimators for the covariate effects and the covariance function are also constructed and their asymptotics are derived. Moreover, we propose a L2-norm-based global test statistic for a general hypothesis testing problem about the covariate effects and derive its asymptotic random expression. The effect of the bandwidths selected by the proposed GCV rule on the accuracy of the LPK reconstructions and the mean function estimator is investigated via a simulation study. The proposed methodologies are illustrated via an application to a real functional data set collected in climatology.

Article information

Source
Ann. Statist., Volume 35, Number 3 (2007), 1052-1079.

Dates
First available in Project Euclid: 24 July 2007

Permanent link to this document
https://projecteuclid.org/euclid.aos/1185303998

Digital Object Identifier
doi:10.1214/009053606000001505

Mathematical Reviews number (MathSciNet)
MR2341698

Zentralblatt MATH identifier
1129.62029

Subjects
Primary: 62G07: Density estimation
Secondary: 62G10: Hypothesis testing 62J12: Generalized linear models

Keywords
Asymptotic Gaussian process asymptotic normal distribution functional data hypothesis test local polynomial smoothing nonparametric estimation reconstructed individual functions root-n consistent

Citation

Zhang, Jin-Ting; Chen, Jianwei. Statistical inferences for functional data. Ann. Statist. 35 (2007), no. 3, 1052--1079. doi:10.1214/009053606000001505. https://projecteuclid.org/euclid.aos/1185303998


Export citation

References

  • Besse, P. (1992). PCA stability and choice of dimensionality. \StatProb 13 405–410.
  • Besse, P., Cardot, H. and Ferraty, F. (1997). Simultaneous nonparametric regressions of unbalanced longitudinal data. \Comp 24 255–270.
  • Besse, P. and Ramsay, J. O. (1986). Principal components analysis of sampled functions. Psychometrika 51 285–311.
  • Brumback, B. and Rice, J. A. (1998). Smoothing spline models for the analysis of nested and crossed samples of curves (with discussion). \JASA 93 961–994.
  • Buckley, M. J. and Eagleson, G. K. (1988). An approximation to the distribution of quadratic forms in normal random variables. \AJStat 30A 150–159.
  • Canadian Climate Program (1982). Canadian Climate Normals 1981–1980. Environment Canada, Ottawa.
  • Eubank, R. L. (1999). Nonparametric Regression and Spline Smoothing, 2nd ed. Dekker, New York.
  • Fan, J. (1992). Design-adaptive nonparametric regression. \JASA 87 998–1004.
  • Fan, J. (1993). Local linear regression smoothers and their minimax efficiencies. \AnnStat 21 196–216.
  • Fan, J. and Gijbels, I. (1996). Local Polynomial Modelling and Its Applications. Chapman and Hall, London.
  • Fan, J. and Lin, S.-K. (1998). Test of significance when data are curves. \JASA 93 1007–1021.
  • Fan, J. and Yao, Q. (1998). Efficient estimation of conditional variance functions in stochastic regression. \Biometk 85 645–660.
  • Faraway, J. J. (1997). Regression analysis for a functional response. Technometrics 39 254–261.
  • Green, P. J. and Silverman, B. W. (1994). Nonparametric Regression and Generalized Linear Models. A Roughness Penalty Approach. Chapman and Hall, London.
  • Hall, P. and Marron, J. S. (1990). On variance estimation in nonparametric regression. \Biometk 77 415–419.
  • Hart, J. D. and Wehrly, T. E. (1986). Kernel regression estimation using repeated measurements data. \JASA 81 1080–1088.
  • Kneip, A. (1994). Nonparametric estimation of common regressors for similar curve data. \AnnStat 22 1386–1427.
  • Kneip, A. and Engel, J. (1995). Model estimation in nonlinear regression under shape invariance. \AnnStat 23 551–570.
  • Kneip, A. and Gasser, T. (1992). Statistical tools to analyze data representing a sample of curves. \AnnStat 20 1266–1305.
  • Ramsay, J. O. (1995). Some tools for the multivariate analysis of functional data. In Recent Advances in Descriptive Multivariate Analysis (W. Krzanowski, ed.) 269–282. Oxford Univ. Press, New York.
  • Ramsay, J. O. and Dalzell, C. J. (1991). Some tools for functional data analysis (with discussion). \JRSSB 53 539–572.
  • Ramsay, J. O. and Li, X. (1998). Curve registration. J. R. Stat. Soc. Ser. B Stat. Methodol. 60 351–363.
  • Ramsay, J. O. and Silverman, B. W. (1997). Functional Data Analysis. Springer, New York.
  • Ramsay, J. O. and Silverman, B. W. (2002). Applied Functional Data Analysis. Methods and Case Studies. Springer, New York.
  • Rice, J. A. and Silverman, B. W. (1991). Estimating the mean and covariance structure nonparametrically when the data are curves. \JRSSB 53 233–243.
  • Silverman, B. W. (1995). Incorporating parametric effects into principal components analysis. \JRSSB 57 673–689.
  • Wahba, G. (1990). Spline Models for Observational Data. SIAM, Philadelphia.
  • Wand, M. P. and Jones, M. C. (1995). Kernel Smoothing. Chapman and Hall, London.
  • Zhang, J.-T. (2005). Approximate and asymptotic distributions of chi-squared-type mixtures with applications. \JASA 100 273–285.