Abstract
Introduction
Recent years have seen an explosive proliferation of DNA sequence data for molecular phylogenetic analyses and a commensurate increase in the use of these data to draw inferences about macroevolutionary processes. A particularly active area of research involves the use of molecular phylogenies to study variation in rates of species origination and extinction, both among lineages (Slowinski and Guyer, 1989; Mooers and Heard, 1997) and over time (Nee et al. 1994; Paradis, 1997).
Likelihood Analysis of Speciation and Extinction Rates (LASER) is a package for the R programming environment that facilitates model-based analyses of diversification rates. LASER is the first software package to implement tests for temporal variation in diversification rates using likelihood methods based on the birth-death process (Nee et al. 1994). LASER is licensed under the GNU General Public License and complements the existing R libraries ‘ape’ (Paradis et al. 2004) and ‘apTreeshape’ (Bortolussi et al. 2006), which provide functions for phylogenetic tree manipulation and the analysis of among-lineage heterogeneity in diversification rates.
LASER was written to address several limitations of existing software for analyzing the tempo of diversification. Approaches such as the gamma statistic (Pybus and Harvey, 2000) and survival analysis (Paradis, 1997), which are implemented in the R library ‘ape’ (Paradis et al. 2004), test for departures from the pure-birth model of cladogenesis, and can only be used to infer temporal decreases in diversification rates (Nee, 2001; Rabosky, 2006). These methods are thus unable to address many questions of interest to evolutionary biologists, such as whether temperate faunas experienced elevated speciation rates during the Pleistocene (Weir and Schluter, 2004). Furthermore, existing methods suffer reduced power to detect temporal decreases in diversification rates when clades have diversified under high background extinction rates (Rabosky 2006).
LASER fits a candidate set of rate-variable diversification models to phylogenetic data and contrasts the likelihood of the data under these models to alternatives where speciation and extinction rates have remained constant over time. The null hypothesis that diversification rates have not changed over time is tested using the statistical approach described in Rabosky (2006). The test statistic for constancy of diversification rates is computed as
where AICRC is the Akaike Information Criterion (AIC) score for the best-fit rate-constant model of diversification, and AICRV is the AIC score for the best-fit rate-variable model under consideration. Thus, a positive ΔAICRC value suggests that the data are best approximated by a rate-variable model of diversification. Although several previous studies have used the AIC to distinguish among rate-constant and rate-variable models of diversification (Barraclough and Vogler, 2002; Turgeon et al. 2005), Rabosky (2006) found that this approach results in high Type I error rates unless critical values of the ΔAICRC distribution are explicitly addressed through simulation.
The LASER package provides a comprehensive toolkit for computing ΔAICRC for test phylogenies and for comparing the observed ΔAICRC statistic to its distribution under the null hypothesis. This is the first available approach that can detect temporal increases in diversification rates, and extensive simulation has shown that the method has greater power than other methods to detect temporal declines in diversification rates when clades have diversified under elevated background extinction rates (Rabosky, 2006).
Additional strengths of the model-fitting approach implemented in the LASER include the ability to test hypotheses of rate variation while estimating relevant diversification parameters. Furthermore, the package can be used to test
Usage
LASER operates on sets of branching times derived from ultrametric phylogenetic trees, and provides functions for obtaining branching times from several input formats, including the widely used ‘Newick’ (parenthetic) tree format. Likelihoods and parameter estimates can be obtained for a range of rate-variable diversification models, including logistic and exponential density-depe-dent models and multi-rate birth-death models. Additional functions permit batch processing of multiple phylogenies to obtain the null distribution of ΔAICRC or posterior distributions of diversification parameters.
The function fitdAICrc computes the ΔAICRC test statistic for a test phylogeny using arguments that specify the candidate set of rate-variable models to be considered. The null distribution of the test statistic is obtained by either simulating branching times with the function yuleSim or by importing simulated trees using the function getBtimes.batch. The latter function is particularly useful for the analysis of phylogenies with incomplete taxon sampling, because incomplete sampling can result in a spurious decline in diversification rates over time (Pybus and Harvey, 2000). To address this problem in LASER, one can simply generate rate-constant phylogenies with incomplete sampling using PhyloGen (Rambaut, 2002) or other software and import the trees into LASER to tabulate the null distribution of the ΔAICRC test statistic.
A call to the function fitdAICrc.batch will then generate the null distribution of the test statistic and return the probability of the observed ΔAICRC index under the null hypothesis. Functions are available to call any rate-variable and rate-constant diversification models individually, and additional functions permit exploration of diversification patterns for any user-defined temporal interval.
Diversification Models
Rate-constant diversification models implemented in LASER include the pure birth model, with a constant speciation rate λ > 0, and the birth-death model, with λ > 0 and extinction rate μ ≥ 0. Seven rate-variable diversification models are provided, including density-dependent and multi-rate variants of the pure birth and birth-death models. The package includes both logistic and exponential density-dependent speciation models. Under the logistic density-dependent model of cladogenesis, the speciation rate γ at time
where γ0 is the initial speciation rate,
where
Multi-rate variants of the pure birth and birth-death model assume the existence of one or more breakpoints in time, such that a clade has diversified under one set of diversification parameters before the breakpoint and another set of parameters after the breakpoint. For example, LASER includes a two-rate pure birth model with three parameters: the initial speciation rate, the final speciation rate, and the time of the rate shift.
Example: Holarctic Damselfly Radiation
Turgeon et al. (2005), tested whether Holarctic damselflies in the genus
The first line creates a vector data.bt of the branching times for
The first line simulates 5000 phylogenies with the same number of tips as the Distribution of the ΔAICRC test statistic for 5000 rate-constant phylogenies of the same size as 
Conclusion
LASER fits multiple rate-variable and rate-constant models of diversification to reconstructed phylogenies using maximum likelihood. Its main strength includes the use of Monte Carlo simulation to control for elevated Type I error rates associated with likelihood-based analyses of diversification. LASER is the first available package that can detect temporal increases in diversification rates, and has considerable power to detect temporal declines in diversification rates when clades have diversified under high background extinction rates. As a freely available package for the R programming environment, it is flexible and platform-independent, and can easily be tailored to a variety of user-specific applications.
