Open Access
April 2019 Join-the-shortest queue diffusion limit in Halfin–Whitt regime: Tail asymptotics and scaling of extrema
Sayan Banerjee, Debankur Mukherjee
Ann. Appl. Probab. 29(2): 1262-1309 (April 2019). DOI: 10.1214/18-AAP1436

Abstract

Consider a system of $N$ parallel single-server queues with unit-ex ponential service time distribution and a single dispatcher where tasks arrive as a Poisson process of rate $\lambda(N)$. When a task arrives, the dispatcher assigns it to one of the servers according to the Join-the-Shortest Queue (JSQ) policy. Eschenfeldt and Gamarnik (Math. Oper. Res. 43 (2018) 867–886) established that in the Halfin–Whitt regime where $(N-\lambda(N))/\sqrt{N}\to\beta>0$ as $N\to\infty$, appropriately scaled occupancy measure of the system under the JSQ policy converges weakly on any finite time interval to a certain diffusion process as $N\to\infty$. Recently, it was further established by Braverman (2018) that the convergence result extends to the steady state as well, that is, stationary occupancy measure of the system converges weakly to the steady state of the diffusion process as $N\to\infty$, proving the interchange of limits result.

In this paper, we perform a detailed analysis of the steady state of the above diffusion process. Specifically, we establish precise tail-asymptotics of the stationary distribution and scaling of extrema of the process on large time interval. Our results imply that the asymptotic steady-state scaled number of servers with queue length two or larger exhibits an exponential tail, whereas that for the number of idle servers turns out to be Gaussian. From the methodological point of view, the diffusion process under consideration goes beyond the state-of-the-art techniques in the study of the steady state of diffusion processes. Lack of any closed-form expression for the steady state and intricate interdependency of the process dynamics on its local times make the analysis significantly challenging. We develop a technique involving the theory of regenerative processes that provides a tractable form for the stationary measure, and in conjunction with several sharp hitting time estimates, acts as a key vehicle in establishing the results. The technique and the intermediate results might be of independent interest, and can possibly be used in understanding the bulk behavior of the process.

Citation

Download Citation

Sayan Banerjee. Debankur Mukherjee. "Join-the-shortest queue diffusion limit in Halfin–Whitt regime: Tail asymptotics and scaling of extrema." Ann. Appl. Probab. 29 (2) 1262 - 1309, April 2019. https://doi.org/10.1214/18-AAP1436

Information

Received: 1 March 2018; Revised: 1 July 2018; Published: April 2019
First available in Project Euclid: 24 January 2019

zbMATH: 07047449
MathSciNet: MR3910028
Digital Object Identifier: 10.1214/18-AAP1436

Subjects:
Primary: 60J60 , 60K25
Secondary: 60H20 , 60K05

Keywords: diffusion limit , Halfin–Whitt regime , Join the shortest queue , Local time , nonelliptic diffusion , regenerative processes , steady state analysis

Rights: Copyright © 2019 Institute of Mathematical Statistics

Vol.29 • No. 2 • April 2019
Back to Top