Best practices for genetic and genomic data archiving

This is a Preprint and has not been peer reviewed. This is version 3 of this Preprint.

Add a Comment

You must log in to post a comment.


There are no comments or no comments have been made public for this article.


Download Preprint


Deborah M Leigh, Amy Vandergast, Maggie Hunter, Eric Crandall, W. Chris Funk, Colin J Garroway, Sean Hoban, Sara J Oyler-McCance, Christian Rellstab, Gernot Segelbacher, Chloe Schmidt, Ella Vázquez-Domínguez, Ivan Paz-Vinas


Genetic and genomic data are collected for a vast array of scientific and applied purposes. Despite mandates for public archiving, data are typically used only by the generating authors. The reuse of genetic and genomic datasets remains uncommon because it is difficult, if not impossible, due to non-standard archiving practices and lack of contextual metadata. But as the new field of macrogenetics is demonstrating, if genetic data and their metadata were more accessible and FAIR compliant, they could be reused for many additional purposes. We discuss the main challenges with existing genetic and genomic data archives, and suggest best practices for archiving genetic and genomic data. Recognising that this is a longstanding issue due to little formal data management training within the fields of ecology and evolution, we highlight steps that research institutions and publishers could take to improve data archiving.



Bioinformatics, Biology, Ecology and Evolutionary Biology, Genetics and Genomics, Life Sciences


genetics, genomics, data archiving, Best practices, standardisation, ecology, evolution, open data


Published: 2023-09-25 11:29

Last Updated: 2024-04-03 22:54

Older Versions

CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Additional Metadata


Conflict of interest statement:

Data and Code Availability Statement:
Open data/code are not available before publication