The Annals of Applied Probability

The oscillatory distribution of distances in random tries

Costas A. Christophi and Hosam M. Mahmoud

Source: Ann. Appl. Probab. Volume 15, Number 2 (2005), 1536-1564.

Abstract

We investigate Δn, the distance between randomly selected pairs of nodes among n keys in a random trie, which is a kind of digital tree. Analytical techniques, such as the Mellin transform and an excursion between poissonization and depoissonization, capture small fluctuations in the mean and variance of these random distances. The mean increases logarithmically in the number of keys, but curiously enough the variance remains O(1), as n→∞. It is demonstrated that the centered random variable Δn*n−⌊2log2n⌋ does not have a limit distribution, but rather oscillates between two distributions.

Primary Subjects: 05C05, 60C05
Secondary Subjects: 60F05, 68P05, 68P10, 68P20
Keywords: Random trees; recurrence; Mellin transform; poissonization

Full-text: Open access

Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.aoap/1115137985
Digital Object Identifier: doi:10.1214/105051605000000106
Mathematical Reviews number (MathSciNet): MR2134114
Zentralblatt MATH identifier: 02187040

References

Coffman, E. and Eve, J. (1970). File structures using hashing functions. Comm. ACM 13 427--432, 436.
De La Briandais, R. (1959). File searching using variable length keys. In Proceedings of the Western Joint Computer Conference 295--298. AFIPS, San Francisco, CA.
Fill, J., Mahmoud, H. and Szpankowski, W. (1996). On the distribution for the duration of a randomized leader election algorithm. Ann. Appl. Probab. 6 1260--1283.
Mathematical Reviews (MathSciNet): MR1422986
Digital Object Identifier: doi:10.1214/aoap/1035463332
Project Euclid: euclid.aoap/1035463332
Fredkin, E. (1960). Trie memory. Communications of the ACM 3 490--499.
Gutman, I. and Polansky, O. (1986). Mathematical Concepts in Organic Chemistry. Springer, Berlin.
Mathematical Reviews (MathSciNet): MR861119
Zentralblatt MATH: 0657.92024
Jacquet, P. and Régnier, M. (1986). Trie Partitioning Process: Limiting Distributions. Lecture Notes in Comput. Sci. 214 196--210. Springer, New York.
Mathematical Reviews (MathSciNet): MR848795
Zentralblatt MATH: 0605.68057
Digital Object Identifier: doi:10.1007/BFb0022669
Jacquet, P. and Szpankowski, W. (1998). Analytical depoissonization and its applications. Theoret. Comput. Sci. 201 1--62.
Mathematical Reviews (MathSciNet): MR1625392
Digital Object Identifier: doi:10.1016/S0304-3975(97)00167-9
Kirschenhofer, P. and Prodinger, H. (1986). Some further results on digital search trees. Lecture Notes in Comput. Sci. 226 177--185. Springer, New York.
Mathematical Reviews (MathSciNet): MR864680
Zentralblatt MATH: 0596.68053
Knuth, D. (1998). The Art of Computer Programming 3: Sorting and Searching, 2nd ed. Addison--Wesley, Reading, MA.
Mathematical Reviews (MathSciNet): MR378456
Kuipers, L. and Niederreiter, H. (1974). Uniform Distribution of Sequences. Wiley, New York.
Mathematical Reviews (MathSciNet): MR419394
Zentralblatt MATH: 0281.10001
Mahmoud, H. (1992). Evolution of Random Search Trees. Wiley, New York.
Mathematical Reviews (MathSciNet): MR1140708
Zentralblatt MATH: 0762.68033
Mahmoud, H. (2000). Sorting: A Distribution Theory. Wiley, New York.
Mathematical Reviews (MathSciNet): MR1784633
Mahmoud, H. and Neininger, R. (2003). Distribution of distances in random binary search trees. Ann. Appl. Probab. 13 253--276.
Mathematical Reviews (MathSciNet): MR1951999
Digital Object Identifier: doi:10.1214/aoap/1042765668
Project Euclid: euclid.aoap/1042765668
Neininger, R. (2002). The Wiener index of random trees. Combin. Probab. Comput. 11 587--597.
Mathematical Reviews (MathSciNet): MR1940122
Digital Object Identifier: doi:10.1017/S0963548302005321
Panholzer, A. and Prodinger, H. (2004). Spanning tree size in random binary search trees. Ann. Appl. Probab. 14 718--733.
Mathematical Reviews (MathSciNet): MR2052899
Digital Object Identifier: doi:10.1214/105051604000000071
Project Euclid: euclid.aoap/1082737108
Prodinger, H. (2004). Oscillations in the analysis of algorithms and their cancellations. Journal of the Iranian Statistical Society 3 251--270.
Pittel, B. (1986). Paths in a random digital tree: Limiting distribution. Adv. in Appl. Probab. 18 139--155.
Mathematical Reviews (MathSciNet): MR827333
Szpankowski, W. (2001). Average Case Analysis of Algorithms on Sequences. Wiley, New York.
Mathematical Reviews (MathSciNet): MR1816272
Zentralblatt MATH: 0968.68205
Trinajstić, N. (1992). Chemical Graph Theory. CRC Press, Boca Raton, FL.
Mathematical Reviews (MathSciNet): MR1169298

2009 © Institute of Mathematical Statistics