In this paper we study efficient simulation algorithms for estimating P(X›x), where
X is the total time of a job with ideal time $T$ that needs to be restarted after a
failure. The main tool is importance sampling, where a good importance distribution is
identified via an asymptotic description of the conditional distribution of T given
X›x. If T≡t is constant, the problem reduces to the efficient simulation of
geometric sums, and a standard algorithm involving a Cramér-type root, γ(t),
is available. However, we also discuss an algorithm that avoids finding
the root. If T is random, particular attention is given to T having either a
gamma-like tail or a regularly varying tail, and to failures at Poisson times.
Different types of conditional limit occur, in particular exponentially tilted
Gumbel distributions and Pareto distributions. The algorithms based upon importance
distributions for T using these asymptotic descriptions have bounded relative error
as x→∞ when combined with the ideas used for a fixed t. Nevertheless, we
give examples of algorithms carefully designed to enjoy bounded relative error that
may provide little or no asymptotic improvement over crude Monte Carlo simulation
when the computational effort is taken into account. To resolve this problem, an
alternative algorithm using two-sided Lundberg bounds is suggested.
Full-text: Access denied (no subscription detected)
We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription.
Read more about accessing full-text
References
Abramowitz, M. and Stegun, I. A. (eds) (1972). Handbook of Mathematical Functions. Dover, New York.
Andersen, L. N. and Asmussen, S. (2008). Parallel computing, failure recovery and extreme values. J. Statist. Theory Pract. 2, 279--292.
Asmussen, S. (2003). Applied Probability and Queues, 2nd edn. Springer, New York.
Asmussen, S. and Glynn, P. W. (2007). Stochastic Simulation: Algorithms and Analysis. Springer, New York.
Asmussen, S. and Lipsky, L. (2008). Failure recovery in computing and data transmission: limit theorems for checkpointing. Working paper. Aarhus University.
Asmussen, S. et al. (2008). Asymptotic behavior of total times for jobs that must start over if a failure occurs. Math. Operat. Res. 33, 932--944.
Billingsley, P. (1968). Convergence of Probability Measures. John Wiley, New York.
Mathematical Reviews (MathSciNet):
MR233396
Bingham, N. H., Goldie, C. M. and Teugels, J. L. (1987). Regular Variation. Cambridge University Press.
Mathematical Reviews (MathSciNet):
MR898871
Blanchet, J. and Glynn, P. W. (2008). Efficient rare-event simulation for the maximum of heavy-tailed random walks. Ann. Appl. Prob. 18, 1351--1378.
Blanchet, J. H. and Li, C. (2006). Efficient rare-event simulation for geometric sums. In Proc. RESIM, Bamberg, Germany.
David, H. A. (1970). Order Statistics. John Wiley, New York.
Mathematical Reviews (MathSciNet):
MR286226
Fisher, R. A. (1929). Tests of significance in harmonic analysis. Proc. R. Soc. London A 125, 54--59.
Glynn, P. W. and Whitt, W. (1992). The asymptotic efficiency of simulation estimators. Operat. Res. 40, 505--520.
Hammersley, J. M. and Handscomb, D. C. (1964). Monte Carlo Methods. Methuen, London.
Mathematical Reviews (MathSciNet):
MR223065
Jelenković, P. and Tan, J. (2007). Can retransmissions of superexponential documents cause subexponential delays? In Proc. IEEE INFOCOM (Anchorage, May 2007), pp. 892--900.
Jelenković, P. and Tan, J. (2007). Characterizing heavy-tailed distributions induced by retransmissions. Tech. Rep. EE2007-09-07, Columbia University.
Sheahan, R., Lipsky, L., Fiorini, P. and Asmussen, S. (2006). On the distribution of task completion times for tasks that must restart from the beginning if failure occurs. In ACM SIGMETRICS Performance Evaluation Review Association for Computing Machinery, New York, pp. 24--26.
Van Leeuwaarden, J. S. H., Löpker, A. H. and Janssen, A. J. E. M. (2008). Connecting renewal age processes and M/D/1 processor sharing queues through stickbreaking. EURANDOM Report 2008-17.
Willmot, G. E. and Lin, S. X. (2001). Lundberg Approximations for Compound Distributions with Insurance Applications (Lecture Notes Statist. 156), Springer, New York.