Abstract
This paper is concerned with high-dimensional two-sample mean problems, which receive considerable attention in recent literature. To utilize the correlation information among variables for enhancing the power of two-sample mean tests, we consider the setting in which the precision matrix of high-dimensional data possesses a linear structure. Thus, we first propose a new precision matrix estimation procedure with considering its linear structure, and further develop regularization methods to select the true basis matrices and remove irrelevant basis matrices. With the aid of an estimated precision matrix, we propose a new test statistic for the two-sample mean problems by replacing the inverse of sample covariance matrix in the Hotelling test by the estimated precision matrix. The proposed test is applicable for both the low-dimensional setting and high-dimensional setting even if the dimension of the data exceeds the sample size. The limiting null distributions of the proposed test statistic under both null and alternative hypotheses are derived. We further derive the asymptotical power function of the proposed test and compare its asymptotic power with some existing tests. We find the estimation error of the precision matrix does not have impact on the asymptotical power function. Moreover, asymptotic relative efficiency of the proposed test to the classical Hotelling test tends to infinity when the ratio of the dimension of data to the sample size tends to 1. We conduct a Monte Carlo simulation study to assess the finite sample performance of the proposed precision matrix estimation procedure and the proposed high-dimensional two-sample mean test. Our numerical results imply that the proposed regularization method is able to effectively remove irrelevant basis matrices. The proposed test performs well compared with the existing methods especially when the elements of the vector have unequal variances. We also illustrate the proposed methodology by an empirical analysis of a real-world data set.
Funding Statement
Yang’s research was supported by the National Key R&D Program of China 2023YFA1008700 and the National Natural Foundation of China (NSFC) 12301389.
Zheng’s research was supported by NSFC 12231011, 12326606 and 12071066.
Li’s research was supported by a National Institute of Health grant R01AI170249.
Acknowledgments
The authors thank the Editor, the Associated Editor and the referees for their constructive comments and suggestions that led to a significant improvement of this article. All correspondence should be addressed to Shurong Zheng (the corresponding author) at zhengsr@nenu.edu.cn.
Citation
Songshan Yang. Shurong Zheng. Runze Li. "A new test for high-dimensional two-sample mean problems with consideration of correlation structure." Ann. Statist. 52 (5) 2217 - 2240, October 2024. https://doi.org/10.1214/24-AOS2433
Information