Institute of Mathematical Statistics Collections

Multivariate data analysis: The French way

Susan Holmes

Full-text: Open access


This paper presents exploratory techniques for multivariate data, many of them well known to French statisticians and ecologists, but few well understood in North American culture. We present the general framework of duality diagrams which encompasses discriminant analysis, correspondence analysis and principal components, and we show how this framework can be generalized to the regression of graphs on covariates.

Chapter information

Deborah Nolan and Terry Speed, eds., Probability and Statistics: Essays in Honor of David A. Freedman (Beachwood, Ohio, USA: Institute of Mathematical Statistics, 2008), 219-233

First available in Project Euclid: 7 April 2008

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Zentralblatt MATH identifier

Primary: 62H25: Factor analysis and principal components; correspondence analysis 62H20: Measures of association (correlation, canonical correlation, etc.)

bootstrap correspondence analysis duality diagram RV-coefficient STATIS

Copyright © 2008, Institute of Mathematical Statistics


Holmes, Susan. Multivariate data analysis: The French way. Probability and Statistics: Essays in Honor of David A. Freedman, 219--233, Institute of Mathematical Statistics, Beachwood, Ohio, USA, 2008. doi:10.1214/193940307000000455.

Export citation


  • [1] Cailliez, F. and Pagès, J. (1976). Introduction à l’analyse des données. SMASH, Paris.
  • [2] Charnomordic, B. and Holmes, S. (2001). Correspondence analysis for microarrays. Statist. Graph. Comput. Newlsetter 12 19–25.
  • [3] Chessel, D., Dufour, A. B. and Thioulouse, J. (2004). The ade4 package – I: One-table methods. R News 4 5–10.
  • [4] Cox, D. R. and Brandwood, L. (1959). On a discriminatory problem connected with the works of Plato. J. Roy. Statist. Soc. Ser. B 21 195–200.
  • [5] Diaconis, P. and Freedman, D. (1984). Asymptotics of graphical projection pursuit. Ann. Statist. 12 793–815.
  • [6] Diaconis, P. and Salzmann, J. (2007). Projection pursuit for discrete data. In Probability and Statistics: Essays in Honor of David A. Freedman (D. Nolan and T. Speed, eds.) 265–288. IMS, Hayward, CA.
  • [7] Escoffier, B. and Pagès, J. (1998). Analyse factorielles simples et multiples: Objectifs, méthodes et interprétation. Dunod, Paris.
  • [8] Escoufier, Y. (1977). Operators related to a data matrix. In Recent Developments in Statistics (J. Barra, F. Brodeau, G. Romier and B. van Cutsem, eds.) 125–131. Amsterdam. North Holland.
  • [9] Escoufier, Y. (1979). Cours d’analyse des données. Cours Polycopié 7901, IUT, Montpellier.
  • [10] Escoufier, Y. (1987). The duality diagram: A means of better practical applications. In Developments in Numerical Ecology (P. Legendre and L. Legendre, eds.) 139–156. Springer, Berlin.
  • [11] Fouss, F., Renders, J.-M. and Saerens, M. (2004). Some relationships between Kleinberg’s hubs and authorities, correspondence analysis, and the SALSA algorithm. In JADT 2004, International Conference on the Statistical Analysis of Textual Data 445–455. Louvain-la-Neuve.
  • [12] Freedman, D. and Peters, S. (1984a). Bootstrapping a regression equation: Some empirical results. J. Amer. Statist. Assoc. 79 97–106.
  • [13] Freedman, D. and Peters, S. (1984b). Bootstrapping an econometric model: Some empirical results. J. Business Econom. Statist. 2 150–158.
  • [14] Geary, R. (1954). The contiguity ratio and statistical mapping. The Incorporated Statistician 5 115–145.
  • [15] Glaçon, F. (1981). Analyse conjointe de plusieurs matrices de données. Comparaison de différentes méthodes. Ph.D. thesis, Scientific and Medicale Univ. Grenoble, Grenoble.
  • [16] Hill, M. and Gauch, H. (1980). Detrended correspondence analysis, an improved ordination technique. Vegetatio 42 47–58.
  • [17] Holmes, S. (1985). Outils Informatiques pour l’Evaluation de la Pertinence d’un Resultat en Analyse des Données. Ph.D. thesis, USTL, Montpellier.
  • [18] Hotelling, H. (1936). Relations between two sets of variables. Biometrika 28 321–377.
  • [19] Kleinberg, J. M. (1999). Hubs, authorities, and communities. ACM Comput. Surv. 31 Article 5.
  • [20] Lebart, L. (1979). Traitement des Données Statistiques. Dunod, Paris.
  • [21] Lebart, L., Morineau, A. and Warwick, K. M. (1984). Multivariate Descriptive Statistical Analysis. Wiley, New York.
  • [22] Lebart, L., Piron, M. and Morineau, A. (2000). Statistique exploratoire multidimensionnelle. Dunod, Paris.
  • [23] Mom, A. (1988). Méthodologie statistique de la classification des réseaux de transports. Ph.D. thesis, USTL, Montpellier.
  • [24] Purdom, E. (2006). Comparative multivariate methods. Ph.D. thesis, Stanford Univ., Stanford, CA.
  • [25] Rao, C. R. (1964). The use and interpretation of principal component analysis in applied research. Sankhyā Ser. A 26 329–359.