## The Annals of Statistics

### Hypothesis testing on linear structures of high-dimensional covariance matrix

#### Abstract

This paper is concerned with test of significance on high-dimensional covariance structures, and aims to develop a unified framework for testing commonly used linear covariance structures. We first construct a consistent estimator for parameters involved in the linear covariance structure, and then develop two tests for the linear covariance structures based on entropy loss and quadratic loss used for covariance matrix estimation. To study the asymptotic properties of the proposed tests, we study related high-dimensional random matrix theory, and establish several highly useful asymptotic results. With the aid of these asymptotic results, we derive the limiting distributions of these two tests under the null and alternative hypotheses. We further show that the quadratic loss based test is asymptotically unbiased. We conduct Monte Carlo simulation study to examine the finite sample performance of the two tests. Our simulation results show that the limiting null distributions approximate their null distributions quite well, and the corresponding asymptotic critical values keep Type I error rate very well. Our numerical comparison implies that the proposed tests outperform existing ones in terms of controlling Type I error rate and power. Our simulation indicates that the test based on quadratic loss seems to have better power than the test based on entropy loss.

#### Article information

Source
Ann. Statist., Volume 47, Number 6 (2019), 3300-3334.

Dates
Revised: August 2018
First available in Project Euclid: 31 October 2019

https://projecteuclid.org/euclid.aos/1572487394

Digital Object Identifier
doi:10.1214/18-AOS1779

Mathematical Reviews number (MathSciNet)
MR4025743

Subjects
Primary: 62H15: Hypothesis testing
Secondary: 62H10: Distribution of statistics

#### Citation

Zheng, Shurong; Chen, Zhao; Cui, Hengjian; Li, Runze. Hypothesis testing on linear structures of high-dimensional covariance matrix. Ann. Statist. 47 (2019), no. 6, 3300--3334. doi:10.1214/18-AOS1779. https://projecteuclid.org/euclid.aos/1572487394

#### References

• Anderson, T. W.T. W. (1973). Asymptotically efficient estimation of covariance matrices with linear structure. Ann. Statist. 1 135–141.
• Anderson, T. W.T. W. (2003). An Introduction to Multivariate Statistical Analysis, 3rd ed. Wiley Series in Probability and Statistics. Wiley Interscience, Hoboken, NJ.
• Bai, ZhidongZ. and Saranadasa, H. (1996). Effect of high dimension: By an example of a two sample problem. Statist. Sinica 6 311–329.
• Bai, Z. D. and Silverstein, J. W. (2004). CLT for linear spectral statistics of large-dimensional sample covariance matrices. Ann. Probab. 32 553–605.
• Bai, Z. and Silverstein, J. W. (2010). Spectral Analysis of Large Dimensional Random Matrices, 2nd ed. Springer Series in Statistics. Springer, New York.
• Bai, Z., Jiang, D., Yao, J.-F. and Zheng, S. (2009). Corrections to LRT on large-dimensional covariance matrix by RMT. Ann. Statist. 37 3822–3840.
• Birke, M. and Dette, H. (2005). A note on testing the covariance matrix for large dimension. Statist. Probab. Lett. 74 281–289.
• Cai, T. T. and Jiang, T. (2011). Limiting laws of coherence of random matrices with applications to testing covariance structure and construction of compressed sensing matrices. Ann. Statist. 39 1496–1525.
• Cai, T. T. and Ma, Z. (2013). Optimal hypothesis testing for high dimensional covariance matrices. Bernoulli 19 2359–2388.
• Chen, S. X. and Qin, Y.-L. (2010). A two-sample test for high-dimensional data with applications to gene-set testing. Ann. Statist. 38 808–835.
• Chen, S. X., Zhang, L.-X. and Zhong, P.-S. (2010). Tests for high-dimensional covariance matrices. J. Amer. Statist. Assoc. 105 810–819.
• Fan, J. and Li, R. (2006). Statistical challenges with high dimensionality: Feature selection in knowledge discovery. In International Congress of Mathematicians, Vol. III 595–622. Eur. Math. Soc., Zürich.
• Haff, L. R. (1980). Empirical Bayes estimation of the multivariate normal covariance matrix. Ann. Statist. 8 586–597.
• James, W. and Stein, C. (1961). Estimation with quadratic loss. In Proc. 4th Berkeley Sympos. Math. Statist. and Prob., Vol. I 361–379. Univ. California Press, Berkeley, CA.
• Jiang, D., Jiang, T. and Yang, F. (2012). Likelihood ratio tests for covariance matrices of high-dimensional normal distributions. J. Statist. Plann. Inference 142 2241–2256.
• Jiang, T. and Qi, Y. (2015). Likelihood ratio tests for high-dimensional normal distributions. Scand. J. Stat. 42 988–1009.
• Jiang, T. and Yang, F. (2013). Central limit theorems for classical likelihood ratio tests for high-dimensional normal distributions. Ann. Statist. 41 2029–2074.
• Johnstone, I. M. (2001). On the distribution of the largest eigenvalue in principal components analysis. Ann. Statist. 29 295–327.
• Kato, N., Yamada, T. and Fujikoshi, Y. (2010). High-dimensional asymptotic expansion of LR statistic for testing intraclass correlation structure and its error bound. J. Multivariate Anal. 101 101–112.
• Ledoit, O. and Wolf, M. (2002). Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size. Ann. Statist. 30 1081–1102.
• McDonald, R. P. (1974). The measurement of factor indeterminacy. Psychometrika 39 203–222.
• Muirhead, R. J. (1982). Aspects of Multivariate Statistical Theory. Wiley Series in Probability and Mathematical Statistics. Wiley, New York.
• Olkin, I. and Selliah, J. B. (1977). Estimating covariances in a multivariate normal distribution. In Statistical Decision Theory and Related Topics, II (S. S. Gupta and D. S. Moore, eds.) 313–326. Academic Press, New York.
• Qiu, Y. and Chen, S. X. (2012). Test for bandedness of high-dimensional covariance matrices and bandwidth estimation. Ann. Statist. 40 1285–1314.
• Srivastava, M. S. (2005). Some tests concerning the covariance matrix in high dimensional data. J. Japan Statist. Soc. 35 251–272.
• Srivastava, M. S. and Reid, N. (2012). Testing the structure of the covariance matrix with fewer observations than the dimension. J. Multivariate Anal. 112 156–171.
• Wang, C. (2014). Asymptotic power of likelihood ratio tests for high dimensional data. Statist. Probab. Lett. 88 184–189.
• Wang, Q. and Yao, J. (2013). On the sphericity test with large-dimensional observations. Electron. J. Stat. 7 2164–2192.
• Wang, C., Yang, J., Miao, B. and Cao, L. (2013). Identity tests for high dimensional data using RMT. J. Multivariate Anal. 118 128–137.
• Zheng, S. (2012). Central limit theorems for linear spectral statistics of large dimensional $F$-matrices. Ann. Inst. Henri Poincaré Probab. Stat. 48 444–476.
• Zheng, S., Chen, Z., Cui, H. and Li, R. (2019). Supplement to “Hypothesis testing on linear structures of high-dimensional covariance matrix.” DOI:10.1214/18-AOS1779SUPP.
• Zhong, P.-S., Lan, W., Song, P. X. K. and Tsai, C.-L. (2017). Tests for covariance structures with high-dimensional repeated measurements. Ann. Statist. 45 1185–1213.
• Zwiernik, P., Uhler, C. and Richards, D. (2017). Maximum likelihood estimation for linear Gaussian covariance models. J. R. Stat. Soc. Ser. B. Stat. Methodol. 79 1269–1292.

#### Supplemental materials

• Supplement to “Hypothesis testing on linear structures of high-dimensional covariance matrix”. This supplementary material consists of the technical proofs and additional numerical results.