This is a Preprint and has not been peer reviewed. This is version 3 of this Preprint.
IQ-NET: Fast and Accurate Quartet Phylogenetic Inference Using Deep Learning Trained on Empirical DNA Alignments
Downloads
Authors
Abstract
Phylogenetic inference is fundamental to modern biology, with many applications including evolutionary biology, epidemiology, and comparative genomics. While maximum likelihood and Bayesian methods remain the gold standard for phylogenetic analysis, they rely on simplifying assumptions and are computationally intensive. Recent machine learning approaches for phylogenetics offer speed advantages, but have several limitations: exclusive reliance on simulated data for training, inadequate handling of gaps, and sensitivity to input sequence order. Here, we introduce IQ-NET (Intelligent Quartet NETwork), a deep learning framework that solves these limitations to infer four-taxon trees. IQ-NET estimates both tree topology and branch lengths directly from gapped alignments. IQ-NET outperforms existing machine learning methods in terms of accuracy, and obtained a 24-fold speedup compared with the widely used maximum likelihood software, IQ-TREE. We finally introduce a pipeline using IQ-NET and the ASTRAL software to reconstruct a larger species tree, i.e., with more than four taxa.
DOI
https://doi.org/10.32942/X2ND3S
Subjects
Artificial Intelligence and Robotics
Keywords
Phylogenetic inference, machine learning, Quartet analysis, Empirical data training
Dates
Published: 2025-11-25 16:50
Last Updated: 2026-05-05 06:42
License
CC BY Attribution 4.0 International
Additional Metadata
Language:
English
There are no comments or no comments have been made public for this article.