Bridging data silos to holistically model plant macrophenology

This is a Preprint and has not been peer reviewed. This is version 2 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Lizbeth G Amador , Tadeo Ramirez-Parada, Isaac W Park, Susan J. Mazer, Aaron M Ellison, Margaret O'Brien, Eric R Sokol, Colin A Smith, Charles Davis , Sydne Record

Abstract

● Phenological shifts to global climate change impact ecosystem functions. There are various data sources from which spatiotemporal, and taxonomic phenological data may be obtained: mobilized herbaria, community-science initiatives, observatory networks, and remote-sensing. However, analyses conducted to date have generally relied on single sources of data, thus treating alternative data sources as isolated silos. ● Siloed treatment of data in analyses may be due to the lack of harmonization across different data sources, that offer partially non-overlapping information and are often complementary. Such treatment precludes a deeper understanding of phenological responses at macroecological scales. Here, we describe data harmonization as the direct integration of disparate sources of phenological data using a common schema. ● We highlight existing methods for data harmonization that can be applied to phenological data: data-design patterns, metadata standards, and ontologies. We describe how harmonized data from multiple sources can be integrated into analyses using existing methods and discuss the use of automated extraction techniques. ● Data harmonization is not a new concept in ecology but the harmonization of phenological data is long overdue. We aim to highlight the need for better data harmonization providing a roadmap for how harmonized phenological data may fill data gaps while simultaneously integrated into analyses

DOI

https://doi.org/10.32942/X2TS68

Subjects

Ecology and Evolutionary Biology, Life Sciences, Plant Sciences

Keywords

Data harmonization, data management, Ontologies, Scales, SDMs

Dates

Published: 2025-01-29 19:30

Last Updated: 2025-01-30 12:34

Older Versions
License

No Creative Commons license

Additional Metadata

Language:
English

Conflict of interest statement:
L. G. A. and S. R. are in a working group with Daijiang Li, Kai Zhu, and Tong Qui who may appear as 419 potential reviewers. The authors have no other conflicts of interest to disclose.

Data and Code Availability Statement:
The data used to create graphs from Box 1 are openly available in Environmental Data Initiative (EDI) at http://doi.org/[doi in progress], reference number [reference number in progress]. Additionally, the data derived in this article are available from USA-National Phenology Network at http://doi.org/10.5066/F78S4N1V, National Ecological Observatory Network at https://www.neonscience.org/data, Dryad at https://datadryad.org/stash, and EDI at https://edirepository.org/. These data were derived from the following resources available in the public domain: Switzer J, Chamberlain S, Marsh L, Wong K (2024). _rnpn: Interface to the National 'Phenology' Network 'API'_. R package version 1.2.8.0, . NEON (National Ecological Observatory Network). 2024. Plant phenology observations, DP1.10055.001 (RELEASE-2023). 2013-2021 for Region: Contiguous United States. Dataset accessed May 2023 via the USA National Phenology Network at http://doi.org/10.5066/F78S4N1V. Park, Isaac et al. (2023). Herbarium-Derived Phenological Data in North America [Dataset]. Dryad. https://doi.org/10.25349/D9WP6S. Accessed Feb 2023 Park, D., A. Williams, E. Law, A. Ellison, and C. Davis. 2023. Assessing Plant Phenological Patterns in the Eastern United States Over the Last 120 Years ver 5. Environmental Data Initiative. https://doi.org/10.6073/pasta/bfb70a1701ef23f686fcc73840e6ae17 (Accessed 2023- 08).