Open Access
2018 Bandwidth selection for kernel density estimators of multivariate level sets and highest density regions
Charles R. Doss, Guangwei Weng
Electron. J. Statist. 12(2): 4313-4376 (2018). DOI: 10.1214/18-EJS1501

Abstract

We consider bandwidth matrix selection for kernel density estimators of density level sets in $\mathbb{R} ^{d}$, $d\ge 2$. We also consider estimation of highest density regions, which differs from estimating level sets in that one specifies the probability content of the set rather than specifying the level directly. This complicates the problem. Bandwidth selection for KDEs is well studied, but the goal of most methods is to minimize a global loss function for the density or its derivatives. The loss we consider here is instead the measure of the symmetric difference of the true set and estimated set. We derive an asymptotic approximation to the corresponding risk. The approximation depends on unknown quantities which can be estimated, and the approximation can then be minimized to yield a choice of bandwidth, which we show in simulations performs well. We provide an R package lsbs for implementing our procedure.

Citation

Download Citation

Charles R. Doss. Guangwei Weng. "Bandwidth selection for kernel density estimators of multivariate level sets and highest density regions." Electron. J. Statist. 12 (2) 4313 - 4376, 2018. https://doi.org/10.1214/18-EJS1501

Information

Received: 1 June 2018; Published: 2018
First available in Project Euclid: 18 December 2018

zbMATH: 07003245
MathSciNet: MR3892342
Digital Object Identifier: 10.1214/18-EJS1501

Subjects:
Primary: 62G07

Keywords: Bandwidth selection , highest density region estimation , kernel density estimator , level set estimation

Vol.12 • No. 2 • 2018
Back to Top