Journal of Applied Probability

Importance sampling for failure probabilities in computing and data transmission

Asmussen Søren

Source: J. Appl. Probab. Volume 46, Number 3 (2009), 768-790.

Abstract

In this paper we study efficient simulation algorithms for estimating P(Xx), where X is the total time of a job with ideal time $T$ that needs to be restarted after a failure. The main tool is importance sampling, where a good importance distribution is identified via an asymptotic description of the conditional distribution of T given Xx. If Tt is constant, the problem reduces to the efficient simulation of geometric sums, and a standard algorithm involving a Cramér-type root, γ(t), is available. However, we also discuss an algorithm that avoids finding the root. If T is random, particular attention is given to T having either a gamma-like tail or a regularly varying tail, and to failures at Poisson times. Different types of conditional limit occur, in particular exponentially tilted Gumbel distributions and Pareto distributions. The algorithms based upon importance distributions for T using these asymptotic descriptions have bounded relative error as x→∞ when combined with the ideas used for a fixed t. Nevertheless, we give examples of algorithms carefully designed to enjoy bounded relative error that may provide little or no asymptotic improvement over crude Monte Carlo simulation when the computational effort is taken into account. To resolve this problem, an alternative algorithm using two-sided Lundberg bounds is suggested.

Primary Subjects: 65C05, 68M15
Secondary Subjects: 60F05, 68O20
Keywords: Communications engineering; compound sum; computer reliability; conditioned limit theorem; Cramér root; exponential tilting; geometric sum; Gumbel distribution; integral asymptotics; Lundberg's inequality; rare event simulation; regular variation; RESTART

Full-text: Access denied (no subscription detected)

We're sorry, but we are unable to provide you with the full text of this article because we are not able to identify you as a subscriber.
If you have a personal subscription to this journal, then please login. If you are already logged in, then you may need to update your profile to register your subscription. Read more about accessing full-text
Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.jap/1253279851
Digital Object Identifier: doi:10.1239/jap/1253279851

References

Abramowitz, M. and Stegun, I. A. (eds) (1972). Handbook of Mathematical Functions. Dover, New York.
Andersen, L. N. and Asmussen, S. (2008). Parallel computing, failure recovery and extreme values. J. Statist. Theory Pract. 2, 279--292.
Asmussen, S. (2003). Applied Probability and Queues, 2nd edn. Springer, New York.
Mathematical Reviews (MathSciNet): MR1978607
Asmussen, S. and Glynn, P. W. (2007). Stochastic Simulation: Algorithms and Analysis. Springer, New York.
Mathematical Reviews (MathSciNet): MR2331321
Zentralblatt MATH: 1126.65001
Asmussen, S. and Lipsky, L. (2008). Failure recovery in computing and data transmission: limit theorems for checkpointing. Working paper. Aarhus University.
Asmussen, S. et al. (2008). Asymptotic behavior of total times for jobs that must start over if a failure occurs. Math. Operat. Res. 33, 932--944.
Mathematical Reviews (MathSciNet): MR2464652
Digital Object Identifier: doi:10.1287/moor.1080.0329
Billingsley, P. (1968). Convergence of Probability Measures. John Wiley, New York.
Mathematical Reviews (MathSciNet): MR233396
Bingham, N. H., Goldie, C. M. and Teugels, J. L. (1987). Regular Variation. Cambridge University Press.
Mathematical Reviews (MathSciNet): MR898871
Blanchet, J. and Glynn, P. W. (2008). Efficient rare-event simulation for the maximum of heavy-tailed random walks. Ann. Appl. Prob. 18, 1351--1378.
Mathematical Reviews (MathSciNet): MR2434174
Zentralblatt MATH: 1147.60315
Digital Object Identifier: doi:10.1214/07-AAP485
Project Euclid: euclid.aoap/1216677125
Blanchet, J. H. and Li, C. (2006). Efficient rare-event simulation for geometric sums. In Proc. RESIM, Bamberg, Germany.
David, H. A. (1970). Order Statistics. John Wiley, New York.
Mathematical Reviews (MathSciNet): MR286226
Fisher, R. A. (1929). Tests of significance in harmonic analysis. Proc. R. Soc. London A 125, 54--59.
Glynn, P. W. and Whitt, W. (1992). The asymptotic efficiency of simulation estimators. Operat. Res. 40, 505--520.
Mathematical Reviews (MathSciNet): MR1180030
Zentralblatt MATH: 0760.62009
Digital Object Identifier: doi:10.1287/opre.40.3.505
Hammersley, J. M. and Handscomb, D. C. (1964). Monte Carlo Methods. Methuen, London.
Mathematical Reviews (MathSciNet): MR223065
Jelenković, P. and Tan, J. (2007). Can retransmissions of superexponential documents cause subexponential delays? In Proc. IEEE INFOCOM (Anchorage, May 2007), pp. 892--900.
Jelenković, P. and Tan, J. (2007). Characterizing heavy-tailed distributions induced by retransmissions. Tech. Rep. EE2007-09-07, Columbia University.
Sheahan, R., Lipsky, L., Fiorini, P. and Asmussen, S. (2006). On the distribution of task completion times for tasks that must restart from the beginning if failure occurs. In ACM SIGMETRICS Performance Evaluation Review Association for Computing Machinery, New York, pp. 24--26.
Van Leeuwaarden, J. S. H., Löpker, A. H. and Janssen, A. J. E. M. (2008). Connecting renewal age processes and M/D/1 processor sharing queues through stickbreaking. EURANDOM Report 2008-17.
Willmot, G. E. and Lin, S. X. (2001). Lundberg Approximations for Compound Distributions with Insurance Applications (Lecture Notes Statist. 156), Springer, New York.
Mathematical Reviews (MathSciNet): MR1794566
Zentralblatt MATH: 0962.62099

2009 © Applied Probability Trust