Skip to main content
The weak driver conundrum: data archiving and biological phenomena impact macrogenetic findings

The weak driver conundrum: data archiving and biological phenomena impact macrogenetic findings

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Ivo Colmonero-Costeira, Deborah M Leigh

Abstract

Macrogenetics seeks to identify the global drivers and patterns in intraspecific genetic diversity, yet many reported patterns are weak or inconsistent. To achieve multispecies global inference, many macrogenetic studies leverage open sequencing data that can suffer from archiving biases. It remains unclear if macrogenetic inconsistencies are innate genetic phenomena, or are the product of open data limitations. Using three widely available genetic markers from the mitochondrion (cytb, co1) and nuclear (TLR4) genomes archived as haplotypes, here we demonstrate archiving biases are powerful enough to distort nucleotide diversity estimates and patterns. Distortion is worsened in analysis using geographic gridded cells, where archiving efforts both outweigh and interact with ecological predictors. Nevertheless, previously described incongruences in drivers of nuclear and mitochondrial diversity appear to be biologically meaningful, indicating some inconsistencies are innate to genetic data.

DOI

https://doi.org/10.32942/X2Q36F

Subjects

Biodiversity, Evolution, Genetics, Genomics, Other Ecology and Evolutionary Biology

Keywords

Open-data bias; Haplotype; Genetic Variation; Nucleotide diversity.

Dates

Published: 2025-12-10 18:18

Last Updated: 2025-12-10 18:18

License

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata

Language:
English

Data and Code Availability Statement:
https://github.com/deborahmleigh/Archiving-in-macrogenetics