Skip to main content
De Novo Gene Emergence: Summary, Classification, and Challenges of Current Methods

De Novo Gene Emergence: Summary, Classification, and Challenges of Current Methods

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Anna Grandchamp, Margaux Aubel, Lars A Eicholt, Paul Roginski, Victor Luria, Amir Karger, Elias Dohmen 

Abstract

A novel mechanism of de novo gene origination from non-genic sequences was first proposed in the early 2000s. Subsequent studies have since provided evidence of de novo gene emergence across all domains of life, revealing its occurrence to be more frequent than initially anticipated.
While studies mainly agree on the general concept of de novo emergence from non-genic DNA, the exact methods and definitions for detecting de novo genes differ significantly.
Here, we provide a comprehensive step-by-step description of the most commonly used methods for de novo gene detection. In addition, we address the limitations of nomenclature and detection methods and clarify some complex concepts that are sometimes misused.
This review is accompanied by the publication of a de novo gene annotation format to standardise the reporting of methodology, enable reproducibility and improve the comparability of datasets.

DOI

https://doi.org/10.32942/X2DP88

Subjects

Bioinformatics, Genomics

Keywords

de novo genes, proto-genes, annotation format, standardisation and comparability

Dates

Published: 2025-04-08 09:49

Last Updated: 2025-04-08 09:49

License

CC BY Attribution 4.0 International

Additional Metadata

Language:
English

Data and Code Availability Statement:
https://github.com/EDohmen/denofo