## Bayesian Analysis

- Bayesian Anal.
- Volume 1, Number 1 (2006), 41-92.

### Explaining species distribution patterns through hierarchical modeling

Alan E. Gelfand, Mark Holder, Andrew Latimer, Paul O. Lewis, Anthony G. Rebelo, John A. Silander, and Shanshan Wu

#### Abstract

Understanding spatial patterns of species diversity and the distributions of individual species is a consuming problem in biogeography and conservation. The Cape Floristic Region (CFR) of South Africa is a global hotspot of diversity and endemism, and the Protea Atlas Project, with some 60,000 site records across the region, provides an extraordinarily rich data set to analyze biodiversity patterns. Analysis for the region is developed at the spatial scale of one minute grid-cells (~37,000$ cells total for the region). We report on results for 40 species of a flowering plant family Proteaceae (of about 330 in the CFR) for a defined subregion.

Using a Bayesian framework, we develop a two stage, spatially explicit, hierarchical
logistic regression. Stage one models the *suitability* or potential presence for
each species at each cell, given species attributes along with grid cell (site-level)
climate, precipitation, topography and geology data using species-level coefficients, and
a spatial random effect. The second level of the hierarchy models, for each species,
observed presence$/$absence at a sampling site through a conditional specification of the
probability of presence at an arbitrary location in the grid cell given that the location
is suitable. Because the atlas data are not evenly distributed across the landscape, grid
cells contain variable numbers of sampling localities. Indeed, some grid cells are
entirely unsampled; others have been transformed by human intervention (agriculture,
urbanization) such that none of the species are there though some may have the potential
to be present in the absence of disturbance. Thus the modeling takes the sampling
intensity at each site into account by assuming that the total number of times that a
particular species was observed within a site follows a binomial distribution.

In fact, a range of models can be examined incorporating different first and second stage specifications. This necessitates model comparison in a misaligned multilevel setting. All models are fitted using MCMC methods. A "best" model is selected. Parameter summaries offer considerable insight. In addition, results are mapped as the model-estimated potential presence for each species across the domain. This probability surface provides an alternative to customary empirical "range of occupancy" displays. Summing yields the predicted species richness over the region. Summaries of the posterior for each environmental coefficient show which variables are most important in explaining species presence. Other biodiversity measures emerge as model unknowns. A considerable range of inference is available. We illustrate with only a portion of the analyses we have conducted, noting that these initial results describe biogeographical patterns over the modeled region remarkably well.

#### Article information

**Source**

Bayesian Anal. Volume 1, Number 1 (2006), 41-92.

**Dates**

First available in Project Euclid: 22 June 2012

**Permanent link to this document**

http://projecteuclid.org/euclid.ba/1340371072

**Digital Object Identifier**

doi:10.1214/06-BA102

**Mathematical Reviews number (MathSciNet)**

MR2227362

#### Citation

Gelfand, Alan E.; Silander, John A.; Wu, Shanshan; Latimer, Andrew; Lewis, Paul O.; Rebelo, Anthony G.; Holder, Mark. Explaining species distribution patterns through hierarchical modeling. Bayesian Analysis 1 (2006), no. 1, 41--92. doi:10.1214/06-BA102. http://projecteuclid.org/euclid.ba/1340371072.

#### See also

- Related item: Jennifer A. Hoeting. Some perspectives on modeling species distributions (comment on article by Gelfand et al. Bayesian Anal., Vol. 1, Iss. 1 (2006), 93-97.Project Euclid: euclid.ba/1340371073
- Related item: Jay M. Ver Hoef. Comment on article by Gelfand et al. Bayesian Anal., Vol. 1, Iss. 1 (2006), 99-101.Project Euclid: euclid.ba/1340371074
- Related item: Alan E. Gelfand, John A. Silander, Shanshan Wu, Andrew Latimer, Paul O. Lewis, Anthony G. Rebelo, Mark Holder. Rejoinder. Bayesian Anal., Vol. 1, Iss. 1 (2006), 103-104.Project Euclid: euclid.ba/1340371075