A Bernoulli Two-armed Bandit

Donald A. Berry

doi:10.1214/aoms/1177692553

Password Forgot your password?

Show

Remember Email on this computer

Remember Password

Please wait...

No Project Euclid account? Create an account
or Sign in with your institutional credentials

We can help you reset your password using the email address linked to your Project Euclid account.

Registered users receive a variety of benefits including the ability to customize email alerts, create favorite journals list, and save searches. Please note that a Project Euclid web account does not automatically grant access to full-text content. An institutional or society member subscription is required to view non-Open Access content. Contact customer_support@projecteuclid.org with any questions.
View Project Euclid Privacy Policy

All Fields are Required

* First Name

* Last/Family Name

* Email

* Password

Password Requirements: Minimum 8 characters, must include as least one uppercase, one lowercase letter, and one number or permitted symbol Valid Symbols for password:
~ Tilde
! Exclamation Mark
@ At sign
$ Dollar sign
^ Caret
( Opening Parenthesis
) Closing Parenthesis
_ Underscore
. Period

* Confirm Password

Please wait...

Web Account created successfully

Browse
Resources
About

Advanced Search

Home > Journals > Ann. Math. Statist. > Volume 43 > Issue 3 > Article

June, 1972 A Bernoulli Two-armed Bandit

Donald A. Berry

Ann. Math. Statist. 43(3): 871-897 (June, 1972). DOI: 10.1214/aoms/1177692553

ABOUT
FIRST PAGE
CITED BY
DOWNLOAD PAPER SAVE TO MY LIBRARY

PERSONAL SIGN IN
Full access may be available with your subscription

Password Forgot your password?

Show

Remember Email on this computer

Remember Password

No Project Euclid account? Create an account
or Sign in with your institutional credentials

PURCHASE SINGLE ARTICLE

This article is only available to subscribers. It is not available for individual sale.

This will count as one of your downloads.

You will have access to both the presentation and article (if available).

DOWNLOAD NOW

This content is available for download via your institution's subscription. To access this item, please sign in to your personal account.

Password Forgot your password?

Show

Remember Email on this computer

Remember Password

No Project Euclid account? Create an account

My Library

You currently do not have any folders to save your paper to! Create a new folder below.

Abstract

One of two independent Bernoulli processes (arms) with unknown expectations $\rho$ and $\lambda$ is selected and observed at each of $n$ stages. The selection problem is sequential in that the process which is selected at a particular stage is a function of the results of previous selections as well as of prior information about $\rho$ and $\lambda$. The variables $\rho$ and $\lambda$ are assumed to be independent under the (prior) probability distribution. The objective is to maximize the expected number of successes from the $n$ selections. Sufficient conditions for the optimality of selecting one or the other of the arms are given and illustrated for example distributions. The stay-on-a-winner rule is proved.