Skip to main content
IQ-NET: A Deep Learning Approach for Fast and Accurate Phylogenetic Inference from Real Alignments

IQ-NET: A Deep Learning Approach for Fast and Accurate Phylogenetic Inference from Real Alignments

This is a Preprint and has not been peer reviewed. This is version 3 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Chen Yang, Bui Quang Minh, Nhan Ly-Trong, Zixin Zhuang, Piyumal Demotte, Le Sy Vinh, Cuong Cao Dang

Abstract

Phylogenetic inference is fundamental to modern biology, with applications spanning evolutionary biology, epidemiology, and comparative genomics. While maximum likelihood and Bayesian methods remain the gold standard due to their statistical rigor, they rely on simplifying evolutionary assumptions and are computationally intensive. Existing machine learning approaches offer speed advantages, but face several limitations: exclusive reliance on simulated training data, inadequate handling of gaps, focus primarily on topology rather than complete tree reconstruction, and sensitivity to input sequence order. Here, we introduce IQ-NET (Intelligent Quartet NETwork), a machine learning framework that addresses these limitations through training exclusively on real datasets, simultaneous inference of topology and branch lengths from gapped alignments without substitution model assumptions, and robustness to the order of input sequences. IQ-NET outperforms existing machine learning methods and achieves both higher accuracy and a 24-fold speedup over the IQ-TREE software. We also demonstrate IQ-NET's utility in species tree reconstruction by integrating it with ASTRAL.

DOI

https://doi.org/10.32942/X2ND3S

Subjects

Artificial Intelligence and Robotics

Keywords

Phylogenetic inference, machine learning, Quartet analysis, Empirical data training

Dates

Published: 2025-11-25 15:50

Last Updated: 2025-11-26 05:17

Older Versions

License

CC BY Attribution 4.0 International

Additional Metadata

Language:
English