Open Access
March 2017 The General Projected Normal Distribution of Arbitrary Dimension: Modeling and Bayesian Inference
Daniel Hernandez-Stumpfhauser, F. Jay Breidt, Mark J. van der Woerd
Bayesian Anal. 12(1): 113-133 (March 2017). DOI: 10.1214/15-BA989

Abstract

The general projected normal distribution is a simple and intuitive model for directional data in any dimension: a multivariate normal random vector divided by its length is the projection of that vector onto the surface of the unit hypersphere. Observed data consist of the projections, but not the lengths. Inference for this model has been restricted to the two-dimensional (circular) case, using Bayesian methods with data augmentation to generate the latent lengths and a Metropolis-within-Gibbs algorithm to sample from the posterior. We describe a new parameterization of the general projected normal distribution that makes inference in any dimension tractable, including the important three-dimensional (spherical) case, which has not previously been considered. Under this new parameterization, the full conditionals of the unknown parameters have closed forms, and we propose a new slice sampler to draw the latent lengths without the need for rejection. Gibbs sampling with this new scheme is fast and easy, leading to improved Bayesian inference; for example, it is now feasible to conduct model selection among complex mixture and regression models for large data sets. Our parameterization also allows straightforward incorporation of covariates into the covariance matrix of the multivariate normal, increasing the ability of the model to explain directional data as a function of independent regressors. Circular and spherical cases are considered in detail and illustrated with scientific applications. For the circular case, seasonal variation in time-of-day departures of anglers from recreational fishing sites is modeled using covariates in both the mean vector and covariance matrix. For the spherical case, we consider paired angles that describe the relative positions of carbon atoms along the backbone chain of a protein. We fit mixtures of general projected normals to these data, with the best-fitting mixture accurately describing biologically meaningful structures including helices, β-sheets, and coils and turns. Finally, we show via simulation that our methodology has satisfactory performance in some 10-dimensional and 50-dimensional problems.

Citation

Download Citation

Daniel Hernandez-Stumpfhauser. F. Jay Breidt. Mark J. van der Woerd. "The General Projected Normal Distribution of Arbitrary Dimension: Modeling and Bayesian Inference." Bayesian Anal. 12 (1) 113 - 133, March 2017. https://doi.org/10.1214/15-BA989

Information

Published: March 2017
First available in Project Euclid: 19 January 2016

zbMATH: 1384.62176
MathSciNet: MR3597569
Digital Object Identifier: 10.1214/15-BA989

Keywords: Circular data , directional data , Gibbs sampler , Markov chain Monte Carlo , protein structure analysis , spherical data

Rights: Copyright © 2017 International Society for Bayesian Analysis

Vol.12 • No. 1 • March 2017
Back to Top