Abstract
In 1926, G. Udny Yule (J. R. Stat. Soc. 89 (1926) 1–63) considered the following problem: given a sequence of pairs of random variables (), and letting and where and are the partial sums of two independent random walks, what is the distribution of the empirical correlation coefficient
Yule empirically observed the distribution of this statistic to be heavily dispersed and frequently large in absolute value, leading him to call it “nonsense correlation.” This unexpected finding led to his formulation of two concrete questions, each of which would remain open for more than ninety years: (i) Find (analytically) the variance of as and (ii): Find (analytically) the higher order moments and the density of as . In 2017, Ernst, Shepp and Wyner (Ann. Statist. 45 (2017) 1789–1809) considered the empirical correlation coefficient
of two independent Wiener processes , the limit to which converges weakly, as was first shown by P.C.B. Phillips (J. Econometrics 33 (1986) 311–340). Using tools from integral equation theory, Ernst, Shepp and Wyner (Ann. Statist. 45 (2017) 1789–1809) closed question (i) by explicitly calculating the second moment of ρ to be .240522. This paper adopts a completely different approach to the same question, rooted in an earlier literature on the laws of quadratic functionals of Gaussian diffusions (in particular, (Adv. in Appl. Probab. 25 (1993) 570–584; Stoch. Stoch. Rep. 41 (1992) 201–218)). This allows us to develop an Itô-formula approach from which we calculate expressions for the Laplace transform of ρ, leading to expressions for the moments which we evaluate up to order 16, thereby closing question (ii). This leads, for the first time, to an approximation to the density of Yule’s nonsense correlation. The broad applicability of this approach is demonstrated by answering the corresponding questions when the pair of independent Brownian motions is replaced by a pair of correlated Brownian motions, or by two independent Ornstein-Uhlenbeck processes, or by two independent Brownian bridges. We conclude by extending the definition of ρ to the time interval for any and prove a Central Limit Theorem for the case of two independent Ornstein-Uhlenbeck processes.
Funding Statement
The first named author acknowledges, with gratitude, the support of The Office of Naval Research’s Mathematical Data Science program (grants N00014-18-1-2192 and N00014-21-1-2672).
Acknowledgments
We thank Professor V. de la Peña, Professor Frederi Viens, and Professor Ivan Corwin for helpful conversations about this work. We are also grateful to the Editor-in-Chief and to two anonymous referees, whose comments improved the quality of this article.
Citation
Philip A. Ernst. L.C.G. Rogers. Quan Zhou. "Yule’s “nonsense correlation”: Moments and density." Bernoulli 31 (1) 412 - 431, February 2025. https://doi.org/10.3150/24-BEJ1733
Information