On ε-optimality of the pursuit learning algorithm

Ryan Martin; Omkar Tilak

doi:10.1239/jap/1346955334

Abstract

Estimator algorithms in learning automata are useful tools for adaptive, real-time optimization in computer science and engineering applications. In this paper we investigate theoretical convergence properties for a special case of estimator algorithms - the pursuit learning algorithm. We identify and fill a gap in existing proofs of probabilistic convergence for pursuit learning. It is tradition to take the pursuit learning tuning parameter to be fixed in practical applications, but our proof sheds light on the importance of a vanishing sequence of tuning parameters in a theoretical convergence analysis.

Citation

Download Citation

Ryan Martin. Omkar Tilak. "On ε-optimality of the pursuit learning algorithm." J. Appl. Probab. 49 (3) 795 - 805, September 2012. https://doi.org/10.1239/jap/1346955334

Information

Published: September 2012

First available in Project Euclid: 6 September 2012

zbMATH: 1251.68167

MathSciNet: MR3012100

Digital Object Identifier: 10.1239/jap/1346955334

Subjects:

Primary: 68Q87

Secondary: 68W27 , 68W40

Keywords: convergence , indirect estimator algorithm , learning automaton

Abstract

Citation

Information

KEYWORDS/PHRASES

PUBLICATION TITLE:

PUBLICATION YEARS