Adaptive sampling for ecological monitoring using biased data: A stratum-based approach

This is a Preprint and has not been peer reviewed. This is version 3 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Oliver L. Pescott , Gary D. Powney, Rob James Boyd

Abstract

Indicators of biodiversity change across large extents of geographic, temporal and taxonomic space are frequent products of various types of ecological monitoring and other data collection efforts. Unfortunately, many such indicators are based on data that are highly unlikely to be representative of the intended statistical populations. Where there is full control over sampling processes, individual spatial units within a geographical population have known inclusion probabilities, but these are unknown in the absence of any statistical design. This could be due to the voluntary nature of surveys and/or because of dataset aggregation. In these cases some degree of sampling bias is inevitable and, depending on error tolerance relative to some real-world goal, we may need to ameliorate it. One option is poststratification to adjust for uneven surveying of strata assumed to be important for unbiased estimation. We propose that a similar strategy can be used for the prioritisation of future data collection: that is, an adaptive sampling process focused on increasing representativeness defined in terms of inclusion probabilities. This is easily achieved by monitoring the proportional allocation of sampled units in strata relative to that expected under simple random sampling. The allocation of new units is thus that which reduces the departure from randomness (or, equivalently, that equalising unit inclusion probabilities), allowing an estimator to approach that level of error expected under random sampling. We describe the theory supporting this, and demonstrate its application using sample locations from the UK National Plant Monitoring Scheme, a citizen science monitoring programme with uneven uptake, and data on the true distribution of the plant Calluna vulgaris. This in silico example demonstrates how the successful application of the method depends on the extent to which proposed strata capture correlations between inclusion probabilities and the response of interest.

DOI

https://doi.org/10.32942/X2MG82

Subjects

Applied Statistics, Ecology and Evolutionary Biology, Life Sciences, Other Ecology and Evolutionary Biology

Keywords

survey error, survey quality, poststratification, weighting, response propensity, R-indicators, time-trends

Dates

Published: 2024-09-10 03:29

Last Updated: 2025-01-13 12:09

Older Versions
License

CC BY Attribution 4.0 International

Additional Metadata

Language:
English

Conflict of interest statement:
None

Data and Code Availability Statement:
https://doi.org/10.5281/zenodo.13736327