- Volume 21, Number 4 (2015), 2289-2307.
We describe a novel algorithm for noisy global optimisation and continuum-armed bandits, with good convergence properties over any continuous reward function having finitely many polynomial maxima. Over such functions, our algorithm achieves square-root regret in bandits, and inverse-square-root error in optimisation, without prior information.
Our algorithm works by reducing these problems to tree-armed bandits, and we also provide new results in this setting. We show it is possible to adaptively combine multiple trees so as to minimise the regret, and also give near-matching lower bounds on the regret in terms of the zooming dimension.
Bernoulli, Volume 21, Number 4 (2015), 2289-2307.
Received: February 2013
Revised: February 2014
First available in Project Euclid: 5 August 2015
Permanent link to this document
Digital Object Identifier
Mathematical Reviews number (MathSciNet)
Zentralblatt MATH identifier
Bull, Adam D. Adaptive-treed bandits. Bernoulli 21 (2015), no. 4, 2289--2307. doi:10.3150/14-BEJ644. https://projecteuclid.org/euclid.bj/1438777594
- Supplement to “Adaptive-treed bandits”. We provide proofs of our results.