Open Access
2023 Asymptotic normality of Gini correlation in high dimension with applications to the K-sample problem
Yongli Sang, Xin Dang
Author Affiliations +
Electron. J. Statist. 17(2): 2539-2574 (2023). DOI: 10.1214/23-EJS2165

Abstract

The categorical Gini correlation proposed by Dang et al. [7] is a dependence measure to characterize independence between categorical and numerical variables. The asymptotic distributions of the sample correlation under dependence and independence have been established when the dimension of the numerical variable is fixed. However, its asymptotic behavior for high dimensional data has not been explored. In this paper, we develop the central limit theorem for the Gini correlation in the more realistic setting where the dimensionality of the numerical variable is diverging. We then construct a powerful and consistent test for the K-sample problem based on the asymptotic normality. The proposed test not only avoids computation burden but also gains power over the permutation procedure. Simulation studies and real data illustrations show that the proposed test is more competitive to existing methods across a broad range of realistic situations, especially in unbalanced cases.

Acknowledgments

Thanks Jun Li for sharing R codes on two-sample tests with us.

Citation

Download Citation

Yongli Sang. Xin Dang. "Asymptotic normality of Gini correlation in high dimension with applications to the K-sample problem." Electron. J. Statist. 17 (2) 2539 - 2574, 2023. https://doi.org/10.1214/23-EJS2165

Information

Received: 1 February 2022; Published: 2023
First available in Project Euclid: 6 November 2023

Digital Object Identifier: 10.1214/23-EJS2165

Subjects:
Primary: 60H20
Secondary: 60H15

Keywords: asymptotic normality , categorical Gini correlation , Distance correlation , high dimensional K-sample test

Vol.17 • No. 2 • 2023
Back to Top