This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

De Novo Gene Emergence: Summary, Classification, and Challenges of Current Methods
Downloads
Authors
Abstract
A novel mechanism of de novo gene origination from non-genic sequences was first proposed in the early 2000s. Subsequent studies have since provided evidence of de novo gene emergence across all domains of life, revealing its occurrence to be more frequent than initially anticipated.
While studies mainly agree on the general concept of de novo emergence from non-genic DNA, the exact methods and definitions for detecting de novo genes differ significantly.
Here, we provide a comprehensive step-by-step description of the most commonly used methods for de novo gene detection. In addition, we address the limitations of nomenclature and detection methods and clarify some complex concepts that are sometimes misused.
This review is accompanied by the publication of a de novo gene annotation format to standardise the reporting of methodology, enable reproducibility and improve the comparability of datasets.
DOI
https://doi.org/10.32942/X2DP88
Subjects
Bioinformatics, Genomics
Keywords
de novo genes, proto-genes, annotation format, standardisation and comparability
Dates
Published: 2025-04-08 09:49
Last Updated: 2025-04-08 09:49
License
CC BY Attribution 4.0 International
Additional Metadata
Language:
English
Data and Code Availability Statement:
https://github.com/EDohmen/denofo
There are no comments or no comments have been made public for this article.