Journal of Applied Probability

Multi-actor Markov decision processes

Hyun-Soo Ahn and Rhonda Righter
Source: J. Appl. Probab. Volume 42, Number 1 (2005), 15-26.

Abstract

We give a very general reformulation of multiactor Markov decision processes and show that there is a tendency for the actors to take the same action whenever possible. This considerably reduces the complexity of the problem, either facilitating numerical computation of the optimal policy or providing a basis for a heuristic.

First Page: Show Hide
Primary Subjects: 90C40
Secondary Subjects: 90B22
Full-text: Access denied (no subscription detected)
We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.jap/1110381367
Digital Object Identifier: doi:10.1239/jap/1110381367
Mathematical Reviews number (MathSciNet): MR2144089
Zentralblatt MATH identifier: 02199089

References

Ahn, H.-S., Duenyas, I. and Lewis, M. E. (2002). The optimal control of a two-stage tandem queueing system with flexible servers. Prob. Eng. Inf. Sci. 16, 453--469.
Mathematical Reviews (MathSciNet): MR1926931
Digital Object Identifier: doi:10.1017/S0269964802164047
Andradöttir, S., Ayhan, H. and Down, D. G. (2001). Server assignment policies for maximizing the steady-state throughput. Manag. Sci. 47, 1421--1439.
Gittins, J. C. (1979). Bandit processes and dynamic allocation indices. J. R. Statist. Soc. B 14, 148--177.
Mathematical Reviews (MathSciNet): MR547241
Harrison, J. M. (1975). Dynamic scheduling of a multiclass queue: discount optimality. Operat. Res. 23, 270--282.
Mathematical Reviews (MathSciNet): MR436376
Kaufman, D., Ahn, H.-S. and Lewis, M. E. (2004). On the introduction of agile, temporary workers into a tandem queueing system. Work in progress.
Klimov, G. P. (1974). Time-sharing service systems. I. J. Theory Prob. Appl. 19, 532--551.
Mathematical Reviews (MathSciNet): MR365761
Klimov, G. P. (1978). Time-sharing service systems. II. J. Theory Prob. Appl. 23, 314--321.
Mathematical Reviews (MathSciNet): MR499062
Koole, K. and Righter, R. (2004). Resource allocation in grid computing. Work in progress.
Mandelbaum, A. and Reiman, M. I. (1998). On pooling in queueing networks. Manag. Sci. 44, 971--981.
Sennott, L. I. (1999). Stochastic Dynamic Programming and the Control of Queueing Systems. John Wiley, New York.
Mathematical Reviews (MathSciNet): MR1645435
Zentralblatt MATH: 0997.93503
Van Oyen, M. P., Gel, E. G. S. and Hopp, W. J. (2001). Performance opportunity for workforce agility in collaborative and noncollaborative work systems. IIE Trans. 33, 761--777.
Vairaktarakis, G. L. (2003). The value of resource flexibility in the resource-constrained job assignment problem. Manag. Sci. 49, 718--732.
Weiss, G. and Pinedo, M. (1980). Scheduling tasks with exponential service times on non-identical processors to minimize various cost functions. J. Appl. Prob. 17, 187--202.
Mathematical Reviews (MathSciNet): MR557447

2012 © Applied Probability Trust

Journal of Applied Probability

Journal of Applied Probability