Open Access
March 2010 Analysis of dependence among size, rate and duration in internet flows
Cheolwoo Park, Felix Hernández-Campos, J. S. Marron, Kevin Jeffay, F. Donelson Smith
Ann. Appl. Stat. 4(1): 26-52 (March 2010). DOI: 10.1214/09-AOAS268

Abstract

In this paper we examine rigorously the evidence for dependence among data size, transfer rate and duration in Internet flows. We emphasize two statistical approaches for studying dependence, including Pearson’s correlation coefficient and the extremal dependence analysis method. We apply these methods to large data sets of packet traces from three networks. Our major results show that Pearson’s correlation coefficients between size and duration are much smaller than one might expect. We also find that correlation coefficients between size and rate are generally small and can be strongly affected by applying thresholds to size or duration. Based on Transmission Control Protocol connection startup mechanisms, we argue that thresholds on size should be more useful than thresholds on duration in the analysis of correlations. Using extremal dependence analysis, we draw a similar conclusion, finding remarkable independence for extremal values of size and rate.

Citation

Download Citation

Cheolwoo Park. Felix Hernández-Campos. J. S. Marron. Kevin Jeffay. F. Donelson Smith. "Analysis of dependence among size, rate and duration in internet flows." Ann. Appl. Stat. 4 (1) 26 - 52, March 2010. https://doi.org/10.1214/09-AOAS268

Information

Published: March 2010
First available in Project Euclid: 11 May 2010

zbMATH: 1189.62101
MathSciNet: MR2758083
Digital Object Identifier: 10.1214/09-AOAS268

Keywords: correlation analysis , extremal dependence analysis , internet flows , network performance , thresholding

Rights: Copyright © 2010 Institute of Mathematical Statistics

Vol.4 • No. 1 • March 2010
Back to Top