Compositional Data Analysis in Population Studies

Christopher D. Lloyd, Vera Pawlowsky-Glahn, Juan José Egozcue

Research output: Contribution to journalArticle

19 Citations (Scopus)

Abstract

Many analyses of population data are based on percentages or proportions. Such data are referred to as compositional and complete compositions typically sum to 100 (if expressed as percentages) or one (if expressed as proportions). In geography, the vast majority of analyses of such data make use of standard statistical approaches. There are many warnings, however, both in the geographical literature and elsewhere, that such standard approaches should not be employed directly in the analysis of such data. Aitchison (1986) proposed a methodology for the analysis of compositional data based on taking log-ratios and then applying adapted standard approaches to the transformed data. This article highlights some problems associated with using standard approaches in the analysis of compositional data in population studies and outlines some key log-ratio-based approaches for transformation and the proper analysis of compositional data. A case study is used to demonstrate the application.

Original languageEnglish
Pages (from-to)1251-1266
Number of pages16
JournalAnnals of the Association of American Geographers
Volume102
Issue number6
DOIs
Publication statusPublished - 01 Nov 2012

Keywords

  • census data
  • log-ratios
  • percentages
  • proportions

ASJC Scopus subject areas

  • Geography, Planning and Development
  • Earth-Surface Processes

Fingerprint Dive into the research topics of 'Compositional Data Analysis in Population Studies'. Together they form a unique fingerprint.

  • Cite this