Multivariate data analysis: The French way
Susan Holmes
Abstract
This paper presents exploratory techniques for multivariate data, many of them well known to French statisticians and ecologists, but few well understood in North American culture. We present the general framework of duality diagrams which encompasses discriminant analysis, correspondence analysis and principal components, and we show how this framework can be generalized to the regression of graphs on covariates.
First Page:
Show
Hide
Primary Subjects: 62H25, 62H20
Keywords: bootstrap; correspondence analysis; duality diagram; RV-coefficient; STATIS
Full-text: Open access
Links and Identifiers
Permanent link to this document: http://projecteuclid.org/euclid.imsc/1207580085
Digital Object Identifier: doi:10.1214/193940307000000455
References
[1] Cailliez, F. and Pagès, J. (1976). Introduction à l’analyse des données. SMASH, Paris.
[2] Charnomordic, B. and Holmes, S. (2001). Correspondence analysis for microarrays. Statist. Graph. Comput. Newlsetter 12 19–25.
[3] Chessel, D., Dufour, A. B. and Thioulouse, J. (2004). The ade4 package – I: One-table methods. R News 4 5–10.
[4] Cox, D. R. and Brandwood, L. (1959). On a discriminatory problem connected with the works of Plato. J. Roy. Statist. Soc. Ser. B 21 195–200.
Mathematical Reviews (MathSciNet):
MR109102
JSTOR: links.jstor.org
[5] Diaconis, P. and Freedman, D. (1984). Asymptotics of graphical projection pursuit. Ann. Statist. 12 793–815.
Mathematical Reviews (MathSciNet):
MR751274
Zentralblatt MATH:
0559.62002
Digital Object Identifier: doi:10.1214/aos/1176346703
Project Euclid: euclid.aos/1176346703
[6] Diaconis, P. and Salzmann, J. (2007). Projection pursuit for discrete data. In Probability and Statistics: Essays in Honor of David A. Freedman (D. Nolan and T. Speed, eds.) 265–288. IMS, Hayward, CA.
Mathematical Reviews (MathSciNet):
MR2459955
Zentralblatt MATH:
1166.62048
Digital Object Identifier: doi:10.1214/193940307000000482
[7] Escoffier, B. and Pagès, J. (1998). Analyse factorielles simples et multiples: Objectifs, méthodes et interprétation. Dunod, Paris.
[8] Escoufier, Y. (1977). Operators related to a data matrix. In Recent Developments in Statistics (J. Barra, F. Brodeau, G. Romier and B. van Cutsem, eds.) 125–131. Amsterdam. North Holland.
Mathematical Reviews (MathSciNet):
MR461791
Zentralblatt MATH:
0367.62078
[9] Escoufier, Y. (1979). Cours d’analyse des données. Cours Polycopié 7901, IUT, Montpellier.
[10] Escoufier, Y. (1987). The duality diagram: A means of better practical applications. In Developments in Numerical Ecology (P. Legendre and L. Legendre, eds.) 139–156. Springer, Berlin.
Mathematical Reviews (MathSciNet):
MR913539
[11] Fouss, F., Renders, J.-M. and Saerens, M. (2004). Some relationships between Kleinberg’s hubs and authorities, correspondence analysis, and the SALSA algorithm. In JADT 2004, International Conference on the Statistical Analysis of Textual Data 445–455. Louvain-la-Neuve.
[12] Freedman, D. and Peters, S. (1984a). Bootstrapping a regression equation: Some empirical results. J. Amer. Statist. Assoc. 79 97–106.
Mathematical Reviews (MathSciNet):
MR742858
Digital Object Identifier: doi:10.2307/2288341
JSTOR: links.jstor.org
[13] Freedman, D. and Peters, S. (1984b). Bootstrapping an econometric model: Some empirical results. J. Business Econom. Statist. 2 150–158.
[14] Geary, R. (1954). The contiguity ratio and statistical mapping. The Incorporated Statistician 5 115–145.
[15] Glaçon, F. (1981). Analyse conjointe de plusieurs matrices de données. Comparaison de différentes méthodes. Ph.D. thesis, Scientific and Medicale Univ. Grenoble, Grenoble.
[16] Hill, M. and Gauch, H. (1980). Detrended correspondence analysis, an improved ordination technique. Vegetatio 42 47–58.
[17] Holmes, S. (1985). Outils Informatiques pour l’Evaluation de la Pertinence d’un Resultat en Analyse des Données. Ph.D. thesis, USTL, Montpellier.
[18] Hotelling, H. (1936). Relations between two sets of variables. Biometrika 28 321–377.
[19] Kleinberg, J. M. (1999). Hubs, authorities, and communities. ACM Comput. Surv. 31 Article 5.
[20] Lebart, L. (1979). Traitement des Données Statistiques. Dunod, Paris.
[21] Lebart, L., Morineau, A. and Warwick, K. M. (1984). Multivariate Descriptive Statistical Analysis. Wiley, New York.
Mathematical Reviews (MathSciNet):
MR744990
Zentralblatt MATH:
0658.62069
[22] Lebart, L., Piron, M. and Morineau, A. (2000). Statistique exploratoire multidimensionnelle. Dunod, Paris.
[23] Mom, A. (1988). Méthodologie statistique de la classification des réseaux de transports. Ph.D. thesis, USTL, Montpellier.
[24] Purdom, E. (2006). Comparative multivariate methods. Ph.D. thesis, Stanford Univ., Stanford, CA.
[25] Rao, C. R. (1964). The use and interpretation of principal component analysis in applied research. Sankhyā Ser. A 26 329–359.
Mathematical Reviews (MathSciNet):
MR184375
Institute of Mathematical Statistics Collections