Skip to content

Latest commit

 

History

History
152 lines (108 loc) · 15.9 KB

Paleontology.md

File metadata and controls

152 lines (108 loc) · 15.9 KB
name topic maintainer email version source
Paleontology
Paleontology
William Gearty, Lewis A. Jones, Erin Dillon, Pedro Godoy, Harriet Drage, Christopher Dean, Bruna Farina
willgearty@gmail.com
2024-10-04

Overview

Computational Paleontology is an emerging field. Paleontologists are increasingly turning to a wide array of complex computational analyses to address various research questions and test hypotheses. Until recently, paleontologists have mostly leveraged resources designed for evolutionary biologists, ecologists, geographers, and data scientists to accomplish such analyses. However, R resources are now being developed that cater to paleontological tasks and datasets.

We have assembled this task view to bring together R packages that are specifically geared towards acquiring, cleaning, visualizing, and/or analyzing various kinds of paleontological and paleontology-adjacent data. We use this venue to showcase the wide variety of R packages available across the paleosciences and to provide a brief overview of each package for a broad audience of R users.

If you have any questions, feel free to reach out to the task view maintainers or the maintainers of specific packages. Questions may also be directed to the palaeoverse google group or the paleonet mailing list.

Scope

Packages within the task view fall within one or more of the following broad categories:

  1. Wrangling paleontological data: packages dedicated to the acquisition, cleaning, manipulation, and/or visualization of paleontological data
  2. Paleoecology and morphological evolution: packages that are useful for performing paleoecological and morphological analyses
  3. Paleobiogeography and biodiversity: packages that are useful for performing paleobiogeographical and/or paleobiodiversity analyses
  4. Phylogenetics: packages that are useful for performing phylogenetic analyses that include paleontological data
  5. Time series analysis: packages that are useful for performing time series analyses of paleontological data
  6. Stratigraphy and sedimentology: packages that are useful for acquiring, analyzing, and visualizing stratigraphic or sedimentological data
  7. Paleoclimate: packages that are useful for acquiring and analyzing paleoclimatic data

Wrangling paleontological data

Acquiring paleontological data

  • r pkg("paleobioDB") has functions to query, download, process, and visualize occurence and taxonomic data from the Paleobiology Database (PBDB).
  • r pkg("neotoma2") can query, download, and manipulate data from the Neotoma Paleoecology Database, which specializes in fossil data holdings at timescales covering the last several decades to the last several million years.
  • r pkg("rgbif") can query and download biological and paleontological occurrence data from the Global Biodiversity Information Facility (GBIF).
  • r pkg("ridigbio") can query and download biological and paleontological specimen record data from iDigBio.
  • r pkg("sepkoski") contains data on the stratigraphic ranges of fossil marine animal genera from Sepkoski's (2002) published compendium.
  • r pkg("folio") contains datasets for teaching quantitative approaches and modeling in archaeology and paleontology.
  • r pkg("chronosphere") can download time-stamped versions of various paleontological, paleoenvironmental, and paleoecological databases, including BioDeepTime (Smith et al. 2023), Triton (Fenton et al. 2021), the Paleobiology Database, and the Ancient Reef Traits Database.

Cleaning and/or manipulating paleontological data

  • r pkg("palaeoverse") has functionality to support data preparation and exploration for paleobiological analyses with a focus on improving code flow, reproducibility, and accessibility.
  • r pkg("CoordinateCleaner") can perform automated flagging of common spatial and temporal errors in biological and paleontological collection data.
  • r pkg("fossilbrush") can perform automated detection and resolution of taxonomic and stratigraphic errors in fossil occurrence datasets.
  • r pkg("rgplates") can query the GPlates desktop application and web service API to reconstruct past positions of geographic entities (e.g., plates, coastlines, and coordinates) based on user-selected rotation models. The palaeorotate function in r pkg("palaeoverse") can be used to query the GPlates API for fossil occurrences across multiple time intervals.

Visualizing paleontological data

  • r pkg("deeptime") extends the functionality of the ggplot2 package to help facilitate the plotting of data over long time intervals. Several functions are available to add highly customizable timescales to a variety of types of visualizations.
  • r pkg("palaeoverse") has functions for visualizing occurrence data through time and across space in base R. The axis_geo function can be used to add a timescale to a base R plot.
  • r pkg("GEOmap") includes a set of routines for making map projections, topographic maps, perspective plots, and geological maps.
  • r pkg("rphylopic") can query and fetch silhouettes of extant and extinct organisms from the PhyloPic database.

Paleoecology and morphological evolution

Paleoenvironmental reconstruction

  • r pkg("rioja") implements a number of numerical methods for inferring the value of an environmental variable from a set of species abundances.
  • r pkg("analogue") has functions for the prediction of environmental data from species data by fitting Modern Analogue Technique and Weighted Averaging transfer function models.

Users may also find packages in the r view("Environmetrics") task view useful for analyzing ecological and environmental data.

Quantifying ecological and morphological evolution

  • r pkg("ecospace") implements Monte Carlo simulations of ecological diversification models, using a user-specified ecospace (trait space) framework.
  • r pkg("fossil") has functions for estimating shared species/beta diversity, species area curves, and geographic distances and areas.
  • r github("HOPE-UIB-BIO/R-Fossilpol-package") has functions for processing and standardizing global paleoecological pollen data.
  • r pkg("Morphoscape") implements adaptive landscape methods (first described by Polly et al. 2016) for the integration, analysis and visualization of biological trait data on a phenotypic morphospace.
  • r pkg("RRphylo") can be used to estimate variation and shift in the rate of phenotypic evolution with fossil data using phylogenetic ridge regression.
  • r pkg("paleoTS"), r pkg("evoTS"), and r github("klvoje/adePEM") have functions for fitting evolutionary models to morphological time series (see Time series analysis for more information).

Also see the r view("Phylogenetics") task view for details about studying discrete and continuous morphological evolution in a phylogenetic context.

Paleobiogeography and biodiversity

  • r github("josteist/Compadre") can be used to estimate rates of speciation/origination, extinction, and sampling using Bayesian capture-mark-recapture techniques.
  • r pkg("divDyn") has functions to describe the sampling and diversity dynamics of fossil occurrence datasets.
  • r pkg("fossil") has functions for estimating species richness (Chao 1 and 2, ACE, ICE, Jacknife) and shared species/beta diversity.
  • r pkg("divvy") has functions to conduct spatial subsampling for biogeography and biodiversity studies and calculate common biodiversity and range-size metrics.
  • r github("Liudas-Dau/hespdiv") has functions to conduct hierarchical spatial sampling and perform analysis and visualization of these samples.
  • r pkg("ppgm") can be used to conduct paleophylogeographic modeling of climate niches and species distributions.
  • r pkg("CoordinateCleaner") can be used to generate inputs for PyRate, a program in Python that can estimate speciation and extinction rates from incomplete fossil data (Silvestro et al. 2014).

Users may also find packages in the r view("Spatial") task view useful for analyzing paleobiogeography.

Phylogenetics

  • r pkg("paleotree") provides tools for transforming, a posteriori time-scaling, and modifying phylogenies containing extinct lineages.
  • r pkg("FossilSim") can be used to simulate fossil occurrence data on phylogenetic trees under mechanistic models of speciation, fossil preservation, and fossil recovery. Quick simulations can be conducted in a graphical user interface with r pkg("FossilSimShiny").
  • r pkg("paleobuddy") can be used to simulate phylogenetic trees and fossil records with custom speciation, extinction, and fossil sampling rates.
  • r github("rachelwarnock/fbdR") has functions for estimating speciation and extinction rates from phylogenetic trees and fossil occurrence data.
  • r github("evolucionario/cladedate") has functions to use a MonteCarlo approach to generate empirical calibration information from the fossil record.
  • r pkg("RRphylo") can be used to estimate variation and shift in the rate of phenotypic evolution with fossil data using phylogenetic ridge regression.
  • r pkg("strap") has functions for the stratigraphic analysis of phylogenetic trees.

Also see the r view("Phylogenetics") task view for broader details about conducting various analyses in a phylogenetic context.

Time series analysis

  • r pkg("paleoTS") facilitates the analysis of paleontological temporal sequences of trait values by fitting evolutionary models using maximum likelihood.
  • r pkg("evoTS") facilitates univariate and multivariate analysis of evolutionary sequences of phenotypic change over time. The package extends the modeling framework available in r pkg("paleoTS").
  • r github("klvoje/adePEM") has functions for assessing the adequacy of models of phenotypic change within lineages, like those fit by r pkg("paleoTS") and r pkg("evoTS").
  • r pkg("StratPal") can be used to simulate biological processes in the time domain (e.g., trait evolution, fossil abundance), and examine how their expression in the rock record (stratigraphic domain) is influenced based on age-depth models, ecological niche models, and taphonomic effects.
  • r pkg("astrochron") can conduct routines for astrochronologic testing, astronomical time scale construction, and time series analysis. Also included are a range of statistical analysis and modeling routines that are relevant to time scale development and paleoclimate analysis.
  • r github("HOPE-UIB-BIO/R-Ratepol-package") has functions for estimating rate of change (RoC) from time series of community data.

Also see the r view("TimeSeries") task view for broader details about conducting time series analyses.

Stratigraphy and sedimentology

Acquiring stratigraphic and sedimentological data

  • r pkg("rmacrostrat") can fetch geological data from Macrostrat relevant to the spatial and temporal distribution of sedimentary, igneous, and metamorphic rocks as well as data extracted from them.

Analyzing stratigraphic and sedimentological data

  • r pkg("admtools") can be used to estimate age-depth models from stratigraphic and sedimentological data.
  • r pkg("clam") can be used to perform 'classical' age-depth modelling of dated sediment deposits.
  • r pkg("rbacon") can be used to perform age-depth modelling using Bayesian statistics to reconstruct accumulation histories for deposits, through combining radiocarbon and other dates with prior information on accumulation rates and their variability.
  • r pkg("Bchron") has functions for quick calibration of radiocarbon dates under various calibration curves and can be used to perform Bayesian age-depth modelling.
  • r github("nickmckay/GeoChronR") has functions to model, analyze, and visualize age-uncertain data and offers access to commonly-used age-modelling tools such as r pkg("rbacon") and r pkg("BChron").
  • r pkg("isogeochem") can be used to quickly calculate isotope fractionation factors and apply paleothermometry equations.
  • r pkg("DAIME") can be used to model the effects of changing deposition rates on geological data and rates.

Visualizing stratigraphic and sedimentological data

  • r pkg("StratigrapheR") can be used to generate highly customizable lithologs for stratigraphic and sedimentological data from outcrop sections and borehole logs with R base graphics. r pkg("SDAR") can be used to make similar, albeit less customizable, graphic logs with grid graphics.
  • r pkg("tidypaleo") provides a set of functions with a common framework for age-depth model management, stratigraphic visualization, and common statistical transformations.
  • r pkg("provenance") provides a set of functions for the visual interpretation of large datasets in sedimentary geology, including adaptive kernel density estimation, principal component analysis, correspondence analysis, multidimensional scaling, generalised procrustes analysis and individual differences scaling

Paleoclimate

Acquiring and manipulating paleoclimate reconstruction data

  • r pkg("rpaleoclim") can fetch paleoclimate data from PaleoClim, high resolution paleoclimate surfaces covering the whole globe. This includes data on surface temperature, precipitation and the standard bioclimatic variables commonly used in ecological modelling, derived from the HadCM3 general circulation model and downscaled to a spatial resolution of up to 2.5 minutes.
  • r pkg("pastclim") has methods to easily access, manipulate, and use paleoclimate reconstructions.

Reconstructing and modelling paleoclimate

  • r github("mchevalier2/crestr") can be used to create probabilistic reconstructions of past climate change from fossil assemblage data.
  • r github("rsh249/cRacle") can be used to perform the Climate Reconstruction Analysis using Coexistence Likelihood Estimation (CRACLE) method to estimate climate and paleoclimate from vegetation using large repositories of biodiversity data.
  • r pkg("sedproxy") can be used to conduct forward modelling of sediment archived climate proxy records based on hypothesized "true" past climate (e.g., climate model output), sedimentation, and sampling.

References

  • Fenton, I.S., Woodhouse, A., Aze, T., Lazarus, D., Renaudie, J., Dunhill, A.M., Young, J.R. and Saupe, E.E., 2021. Triton, a new species-level database of Cenozoic planktonic foraminiferal occurrences. Scientific Data, 8(1), 160. r doi("10.1038/s41597-021-00942-7").
  • Polly, P. D., Stayton, C. T., Dumont, E. R., Pierce, S. E., Rayfield, E. J., & Angielczyk, K. D. 2016. Combining geometric morphometrics and finite element analysis with evolutionary modeling: towards a synthesis. Journal of Vertebrate Paleontology, 36(4). r doi("10.1080/02724634.2016.1111225").
  • Sepkoski, J. J. 2002. A compendium of fossil marine animal genera. Bulletins of American Paleontology, 363, 1–560. https://www.biodiversitylibrary.org/item/40634
  • Silvestro, D., Salamin, N. and Schnitzler, J., 2014. PyRate: a new program to estimate speciation and extinction rates from incomplete fossil data. Methods in Ecology and Evolution, 5(10), pp.1126-1131. r doi("10.1111/2041-210X.12263").
  • Smith, J., Rillo, M.C., Kocsis, Á.T., Dornelas, M., Fastovich, D., Huang, H.H.M., Jonkers, L., Kiessling, W., Li, Q., Liow, L.H. and Margulis-Ohnuma, M., 2023. BioDeepTime: A database of biodiversity time series for modern and fossil assemblages. Global Ecology and Biogeography, 32(10), 1680-1689. r doi("10.1111/geb.13735").