Internet Mathematics

A Stochastic Model for the Link Analysis of the Web

Paola Favati, Grazia Lotti, Ornella Menchi, and Francesco Romani

Full-text: Open access

Abstract

The behavior of inlink and outlink distributions appears to be one of the most studied properties of the web structure. The literature agrees that the inlink distribution follows a power law, but no such agreement exists for the outlink distribution. Accurate observations show that in the low-degree region the link distribution fails to fit a power law with a discrepancy larger for outlinks than for inlinks. Moreover, a power law, as well as any continuous function, does not fit the scattered behavior shared by both the link distributions for large-degree values. The linking model we consider here is a mixed one, based on both the preferential attachment strategy and the uniform attachment strategy. A new approximation technique is devised to detect the parameters of the steady state solution that describe a real data set. A stochastic technique is suggested to describe the scattering of the data. With these techniques the model appears to be well suited for describing both inlink and outlink distributions. The experimentation on subsets of the World Wide Web and of Wikipedia shows that our approach produces an approximation more adequate than the power law. This approximation suggests that the two attachment strategies play a different role in the inlink and the outlink cases.

Article information

Source
Internet Math., Volume 3, Number 4 (2006), 509-531.

Dates
First available in Project Euclid: 18 November 2008

Permanent link to this document
https://projecteuclid.org/euclid.im/1227025011

Mathematical Reviews number (MathSciNet)
MR2412875

Zentralblatt MATH identifier
1147.68345

Citation

Favati, Paola; Lotti, Grazia; Menchi, Ornella; Romani, Francesco. A Stochastic Model for the Link Analysis of the Web. Internet Math. 3 (2006), no. 4, 509--531. https://projecteuclid.org/euclid.im/1227025011


Export citation