Skip to main content
Substitution Models in Phylogenetic Reconstruction from Molecular Data: Theoretical Principles, Implementation and  Selection Methodologies, Limits of Results Interpretation, and Recent Advances

Substitution Models in Phylogenetic Reconstruction from Molecular Data: Theoretical Principles, Implementation and Selection Methodologies, Limits of Results Interpretation, and Recent Advances

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Richard Murdoch Montgomery

Abstract

Substitution models constitute the mathematical foundation of modern phylogenetic inference, providing the probabilistic framework necessary for reconstructing evolutionary relationships from molecular sequence data. These models describe the stochastic processes governing nucleotide or amino acid changes over evolutionary time through continuous-time Markov chains, enabling maximum likelihood and Bayesian approaches to phylogenetic reconstruction. This comprehensive review examines the theoretical principles underlying substitution models, from the foundational Jukes-Cantor model to sophisticated general time-reversible frameworks, whilst addressing critical aspects of model selection, parameter estimation, and computational implementation. We analyse the mathematical formulations of key models including JC69, K80, F81, HKY85, and GTR, presenting their rate matrices, transition probabilities, and equilibrium conditions in formal notation suitable for publication. Through computational illustrations and graphical analyses, we demonstrate the behaviour of these models under varying parameter conditions and evolutionary scenarios. The article critically evaluates model selection

DOI

https://doi.org/10.32942/X2FD0S

Subjects

Life Sciences

Keywords

substitution models, phylogenetic reconstruction, molecular evolution, maximum likelihood, Bayesian inference, model selection, DNA evolution, protein evolution, Markov chains, evolutionary genomics

Dates

Published: 2025-07-17 23:55

Last Updated: 2025-07-17 23:55

License

CC BY Attribution 4.0 International

Additional Metadata

Language:
English

Conflict of interest statement:
The Author declares there are no conflicts of Interest.