Skip to main content
Data aggregation blurs inferred temporal trends in bird sampling

Data aggregation blurs inferred temporal trends in bird sampling

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Supplementary Files

Authors

Martin Bulla , Peter Mikula

Abstract

Ellis-Soto et al.1 reported that disparity in bird-sampling density between U.S. neighbourhoods rated as risky versus safe for real estate investment (a practice known as “redlining”) increased by 35.6% between 2000 and 2020. We show that this reported trend arises from data aggregation and linear model misspecification. Using the original neighbourhood-level yearly data and mixed-effects models that account for spatial and temporal non-independence, we show that temporal disparities are strongly non-linear and exceed 200% across the study period, while absolute differences remain small (~zero for most of the time and 25 observations per km² per year at maximum). These non-linearities temporally coincide with major shifts in citizen-science participation, including smartphone adoption and COVID-19-related increases in urban greenspace use.

DOI

https://doi.org/10.32942/X2NS9V

Subjects

Social and Behavioral Sciences

Keywords

reproducibility, I4Replication, GBIF, iNaturalist, eBird, redligning, citizen science, bird sampling, aggregation bias, replication, redlining

Dates

Published: 2026-01-09 20:37

Last Updated: 2026-01-09 20:37

License

CC BY Attribution 4.0 International

Additional Metadata

Language:
English

Data and Code Availability Statement:
Supporting material, including the code and data generating the outputs is freely available at: https://martinbulla.github.io//MA_NHB/