- Statist. Surv.
- Volume 10 (2016), 132-167.
Measuring multivariate association and beyond
Simple correlation coefficients between two variables have been generalized to measure association between two matrices in many ways. Coefficients such as the RV coefficient, the distance covariance (dCov) coefficient and kernel based coefficients are being used by different research communities. Scientists use these coefficients to test whether two random vectors are linked. Once it has been ascertained that there is such association through testing, then a next step, often ignored, is to explore and uncover the association’s underlying patterns.
This article provides a survey of various measures of dependence between random vectors and tests of independence and emphasizes the connections and differences between the various approaches. After providing definitions of the coefficients and associated tests, we present the recent improvements that enhance their statistical properties and ease of interpretation. We summarize multi-table approaches and provide scenarii where the indices can provide useful summaries of heterogeneous multi-block data. We illustrate these different strategies on several examples of real data and suggest directions for future research.
Statist. Surv., Volume 10 (2016), 132-167.
Received: December 2015
First available in Project Euclid: 17 November 2016
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Josse, Julie; Holmes, Susan. Measuring multivariate association and beyond. Statist. Surv. 10 (2016), 132--167. doi:10.1214/16-SS116. https://projecteuclid.org/euclid.ssu/1479351622