Finite state multi-armed bandit problems: sensitive-discount, average-reward and average-overtaking optimality

Michael N. Katehakis; Uriel G. Rothblum

doi:10.1214/aoap/1034968239

Password Forgot your password?

Show

Remember Email on this computer

Remember Password

Please wait...

No Project Euclid account? Create an account
or Sign in with your institutional credentials

We can help you reset your password using the email address linked to your Project Euclid account.

Registered users receive a variety of benefits including the ability to customize email alerts, create favorite journals list, and save searches. Please note that a Project Euclid web account does not automatically grant access to full-text content. An institutional or society member subscription is required to view non-Open Access content. Contact customer_support@projecteuclid.org with any questions.
View Project Euclid Privacy Policy

All Fields are Required

* First Name

* Last/Family Name

* Email

* Password

Password Requirements: Minimum 8 characters, must include as least one uppercase, one lowercase letter, and one number or permitted symbol Valid Symbols for password:
~ Tilde
! Exclamation Mark
@ At sign
$ Dollar sign
^ Caret
( Opening Parenthesis
) Closing Parenthesis
_ Underscore
. Period

* Confirm Password

Please wait...

Web Account created successfully

Browse
Resources
About

Advanced Search

Home > Journals > Ann. Appl. Probab. > Volume 6 > Issue 3 > Article

August 1996 Finite state multi-armed bandit problems: sensitive-discount, average-reward and average-overtaking optimality

Michael N. Katehakis, Uriel G. Rothblum

Ann. Appl. Probab. 6(3): 1024-1034 (August 1996). DOI: 10.1214/aoap/1034968239

ABOUT
FIRST PAGE
CITED BY
REFERENCES
DOWNLOAD PAPER SAVE TO MY LIBRARY

PERSONAL SIGN IN
Full access may be available with your subscription

Password Forgot your password?

Show

Remember Email on this computer

Remember Password

No Project Euclid account? Create an account
or Sign in with your institutional credentials

PURCHASE SINGLE ARTICLE

This article is only available to subscribers. It is not available for individual sale.

This will count as one of your downloads.

You will have access to both the presentation and article (if available).

DOWNLOAD NOW

This content is available for download via your institution's subscription. To access this item, please sign in to your personal account.

Password Forgot your password?

Show

Remember Email on this computer

Remember Password

No Project Euclid account? Create an account

My Library

You currently do not have any folders to save your paper to! Create a new folder below.

Abstract

We express Gittins indices for multi-armed bandit problems as Laurent expansions around discount factor 1. The coefficients of these expan-sions are then used to characterize stationary optimal policies when the optimality criteria are sensitive-discount optimality (otherwise known as Blackwell optimality), average-reward optimality and average-overtaking optimality. We also obtain bounds and derive optimality conditions for policies of a type that continue playing the same bandit as long as the state of that bandit remains in prescribed sets.

Citation

Download Citation

Michael N. Katehakis. Uriel G. Rothblum. "Finite state multi-armed bandit problems: sensitive-discount, average-reward and average-overtaking optimality." Ann. Appl. Probab. 6 (3) 1024 - 1034, August 1996. https://doi.org/10.1214/aoap/1034968239

Information

Published: August 1996

First available in Project Euclid: 18 October 2002

zbMATH: 0862.90127

MathSciNet: MR1410127

Digital Object Identifier: 10.1214/aoap/1034968239

Subjects:

Primary: 60G40 , 90C31 , 90C39 , 90C47

Keywords: bandit problems , Gittins index , Laurent expansions , Markov decision chains , optimality criteria

Access the abstract

JOURNAL ARTICLE
11 PAGES

DOWNLOAD PDF + SAVE TO MY LIBRARY

GET CITATION

My Library

You currently do not have any folders to save your paper to! Create a new folder below.

Folder Name

Folder Description

< Previous Article

Next Article >

Ann. Appl. Probab.

Vol.6 • No. 3 • August 1996

Institute of Mathematical Statistics

Subscribe to Project Euclid

Receive erratum alerts for this article

Michael N. Katehakis, Uriel G. Rothblum "Finite state multi-armed bandit problems: sensitive-discount, average-reward and average-overtaking optimality," The Annals of Applied Probability, Ann. Appl. Probab. 6(3), 1024-1034, (August 1996)

Include:

Citation Only

Citation & Abstract

Format:

RIS

EndNote

BibTex

Print Friendly Version (PDF)

Quick Links

Browse
Search
About
Accessibility
Sign in

Information for Librarians

Manage my account
Subscriptions and Access
Librarian tools
More Help

Information for Publishers

Manage my account

Contact & Support

Business Office
905 W. Main Street
Suite 18B
Durham, NC 27701 USA
Help | Contact Us

Connect

Browse
Resources
About

Help | Advanced Search

KEYWORDS/PHRASES

Keywords

Remove

+ Add another field

PUBLICATION TITLE:

All Titles

Choose Title(s)

PUBLICATION YEARS

Range

Single Year

Clear Form