September 2012 On ε-optimality of the pursuit learning algorithm
Ryan Martin, Omkar Tilak
Author Affiliations +
J. Appl. Probab. 49(3): 795-805 (September 2012). DOI: 10.1239/jap/1346955334

Abstract

Estimator algorithms in learning automata are useful tools for adaptive, real-time optimization in computer science and engineering applications. In this paper we investigate theoretical convergence properties for a special case of estimator algorithms - the pursuit learning algorithm. We identify and fill a gap in existing proofs of probabilistic convergence for pursuit learning. It is tradition to take the pursuit learning tuning parameter to be fixed in practical applications, but our proof sheds light on the importance of a vanishing sequence of tuning parameters in a theoretical convergence analysis.

Citation

Download Citation

Ryan Martin. Omkar Tilak. "On ε-optimality of the pursuit learning algorithm." J. Appl. Probab. 49 (3) 795 - 805, September 2012. https://doi.org/10.1239/jap/1346955334

Information

Published: September 2012
First available in Project Euclid: 6 September 2012

zbMATH: 1251.68167
MathSciNet: MR3012100
Digital Object Identifier: 10.1239/jap/1346955334

Subjects:
Primary: 68Q87
Secondary: 68W27 , 68W40

Keywords: convergence , indirect estimator algorithm , learning automaton

Rights: Copyright © 2012 Applied Probability Trust

JOURNAL ARTICLE
11 PAGES

This article is only available to subscribers.
It is not available for individual sale.
+ SAVE TO MY LIBRARY

Vol.49 • No. 3 • September 2012
Back to Top