Electronic Journal of Statistics

Recursive partitioning and multi-scale modeling on conditional densities

Li Ma

Full-text: Open access

Abstract

We introduce a nonparametric prior on the conditional distribution of a (univariate or multivariate) response given a set of predictors. The prior is constructed in the form of a two-stage generative procedure, which in the first stage recursively partitions the predictor space, and then in the second stage generates the conditional distribution by a multi-scale nonparametric density model on each predictor partition block generated in the first stage. This design allows adaptive smoothing on both the predictor space and the response space, and it results in the full posterior conjugacy of the model, allowing exact Bayesian inference to be completed analytically through a forward-backward recursive algorithm without the need of MCMC, and thus enjoying high computational efficiency (scaling linearly with the sample size). We show that this prior enjoys desirable theoretical properties such as full $L_{1}$ support and posterior consistency. We illustrate how to apply the model to a variety of inference problems such as conditional density estimation as well as hypothesis testing and model selection in a manner similar to applying a parametric conjugate prior, while attaining full nonparametricity. Also provided is a comparison to two other state-of-the-art Bayesian nonparametric models for conditional densities in both model fit and computational time. A real data example from flow cytometry containing 455,472 observations is given to illustrate the substantial computational efficiency of our method and its application to multivariate problems.

Article information

Source
Electron. J. Statist., Volume 11, Number 1 (2017), 1297-1325.

Dates
Received: November 2016
First available in Project Euclid: 14 April 2017

Permanent link to this document
https://projecteuclid.org/euclid.ejs/1492135235

Digital Object Identifier
doi:10.1214/17-EJS1254

Mathematical Reviews number (MathSciNet)
MR3635914

Zentralblatt MATH identifier
1362.62117

Subjects
Primary: 62F15: Bayesian inference 62G99: None of the above, but in this section
Secondary: 62G07: Density estimation

Keywords
Pólya tree multi-resolution inference Bayesian nonparametrics density regression Bayesian CART

Rights
Creative Commons Attribution 4.0 International License.

Citation

Ma, Li. Recursive partitioning and multi-scale modeling on conditional densities. Electron. J. Statist. 11 (2017), no. 1, 1297--1325. doi:10.1214/17-EJS1254. https://projecteuclid.org/euclid.ejs/1492135235


Export citation

References