Open Access
Translator Disclaimer
March 2010 Analysis of dependence among size, rate and duration in internet flows
Cheolwoo Park, Felix Hernández-Campos, J. S. Marron, Kevin Jeffay, F. Donelson Smith
Ann. Appl. Stat. 4(1): 26-52 (March 2010). DOI: 10.1214/09-AOAS268


In this paper we examine rigorously the evidence for dependence among data size, transfer rate and duration in Internet flows. We emphasize two statistical approaches for studying dependence, including Pearson’s correlation coefficient and the extremal dependence analysis method. We apply these methods to large data sets of packet traces from three networks. Our major results show that Pearson’s correlation coefficients between size and duration are much smaller than one might expect. We also find that correlation coefficients between size and rate are generally small and can be strongly affected by applying thresholds to size or duration. Based on Transmission Control Protocol connection startup mechanisms, we argue that thresholds on size should be more useful than thresholds on duration in the analysis of correlations. Using extremal dependence analysis, we draw a similar conclusion, finding remarkable independence for extremal values of size and rate.


Download Citation

Cheolwoo Park. Felix Hernández-Campos. J. S. Marron. Kevin Jeffay. F. Donelson Smith. "Analysis of dependence among size, rate and duration in internet flows." Ann. Appl. Stat. 4 (1) 26 - 52, March 2010.


Published: March 2010
First available in Project Euclid: 11 May 2010

zbMATH: 1189.62101
MathSciNet: MR2758083
Digital Object Identifier: 10.1214/09-AOAS268

Keywords: correlation analysis , extremal dependence analysis , internet flows , network performance , thresholding

Rights: Copyright © 2010 Institute of Mathematical Statistics


Vol.4 • No. 1 • March 2010
Back to Top