## Electronic Journal of Statistics

### Analysis of proteomics data: Phase amplitude separation using an extended Fisher-Rao metric

#### Abstract

We consider the problem of alignment and classification of proteomics data, that is described in Koch et al. [4], using the Extended Fisher-Rao (EFR) framework introduced in [6]. We demonstrate this framework by separating amplitude and phase components of functional data from patients having therapeutic treatments for Acute Myeloid Leukemia (AML). Then, using individual functional principal component analysis, for both the phase and amplitude components [8], we obtain bases for principal subspaces and model the data by imposing probability models on principal coefficients. Lastly, using the distances calculated from individual components, we demonstrate a successful discrimination between responders and non-responders to treatment for AML.

#### Article information

Source
Electron. J. Statist., Volume 8, Number 2 (2014), 1724-1733.

Dates
First available in Project Euclid: 29 October 2014

https://projecteuclid.org/euclid.ejs/1414588155

Digital Object Identifier
doi:10.1214/14-EJS900B

Mathematical Reviews number (MathSciNet)
MR3273587

Zentralblatt MATH identifier
1305.62380

#### Citation

Tucker, J. Derek; Wu, Wei; Srivastava, Anuj. Analysis of proteomics data: Phase amplitude separation using an extended Fisher-Rao metric. Electron. J. Statist. 8 (2014), no. 2, 1724--1733. doi:10.1214/14-EJS900B. https://projecteuclid.org/euclid.ejs/1414588155

#### References

• [1] Bigot, J. and Gadat, S. (2010). Smoothing under diffeomorphic constraints with homeomorphic splines., SIAM Journal on Numerical Analysis 48 777–785.
• [2] Bolle, R. M., Connell, J. H., Pankanti, S., Ratha, N. K. and Senior, A. W. (2005). The relation between the ROC curve and the CMC. In, Automatic Identification Advanced Technologies, 2005. Fourth IEEE Workshop on, 15–20.
• [3] Joshi, S. H., Klassen, E., Srivastava, A. and Jermyn, I. H. (2007). A novel representation for Riemannian analysis of elastic curves in $\mathbbR ^n$. In, Proceedings of IEEE CVPR, 1–7.
• [4] Koch, I., Hoffmann, P. and Marron, J. S. (2014). Proteomics profiles from mass spectrometry., Electronic Journal of Statistics 8 1703–1713, Special Section on Statistics of Time Warpings and Phase Variations.
• [5] Kurtek, S., Srivastava, A. and Wu, W. (2011). Signal estimation under random time-warpings and nonlinear signal alignment. In, Proceedings of Advances in Neural Information Processing Systems (NIPS), Grenada, Spain, 676–683.
• [6] Srivastava, A., Wu, W., Kurtek, S., Klassen, E. and Marron, J. S. (2011a). Registration of functional data using Fisher-Rao metric., arXiv:1103.3817v2 [math.ST].
• [7] Srivastava, A., Klassen, E., Joshi, S. H. and Jermyn, I. H. (2011b). Shape analysis of elastic curves in Euclidean spaces., Pattern Analysis and Machine Intelligence, IEEE Transactions on 33 1415–1428.
• [8] Tucker, J. D., Wu, W. and Srivastava, A. (2013). Generative models for functional data using phase and amplitude separation., Computational Statistics and Data Analysis 61 50–66.
• [9] Wu, W. and Srivastava, A. (2014). Analysis of spike train data: Alignment and comparisons using extended Fisher-Rao metric., Electronic Journal of Statistics 8 1776–1785, Special Section on Statistics of Time Warpings and Phase Variations.