This is a Preprint and has not been peer reviewed. This is version 3 of this Preprint.
Downloads
Authors
Abstract
Genetic and genomic data are collected for a vast array of scientific and applied purposes. Despite mandates for public archiving, data are typically used only by the generating authors. The reuse of genetic and genomic datasets remains uncommon because it is difficult, if not impossible, due to non-standard archiving practices and lack of contextual metadata. But as the new field of macrogenetics is demonstrating, if genetic data and their metadata were more accessible and FAIR compliant, they could be reused for many additional purposes. We discuss the main challenges with existing genetic and genomic data archives, and suggest best practices for archiving genetic and genomic data. Recognising that this is a longstanding issue due to little formal data management training within the fields of ecology and evolution, we highlight steps that research institutions and publishers could take to improve data archiving.
DOI
https://doi.org/10.32942/X29025
Subjects
Bioinformatics, Biology, Ecology and Evolutionary Biology, Genetics and Genomics, Life Sciences
Keywords
genetics, genomics, data archiving, Best practices, standardisation, ecology, evolution, open data
Dates
Published: 2023-09-25 11:29
Last Updated: 2024-04-03 22:54
Older Versions
License
CC-By Attribution-NonCommercial-NoDerivatives 4.0 International
Additional Metadata
Language:
English
Conflict of interest statement:
None
Data and Code Availability Statement:
Open data/code are not available before publication
There are no comments or no comments have been made public for this article.