Premium accounts now available! Sign up and create a premium account. Read more Close

Advertisement

Image

Alignment-free phylogenetic inference via hyperbolic protein language models

Preprint Created on 30 May 2026 bioRxiv

Conventional phylogenetic methods rely on multiple sequence alignments which are computationally intensive and often fail for highly divergent lineages. Here, we introduce LucaPhylo, an alignment-free framework that infers evolutionary relationships directly from unaligned sequences. Through a cascaded learning strategy LucaPhylo integrates protein language models with hyperbolic geometry, a representation space naturally suited to hierarchical branching, to capture deep evolutionary constraints without explicit homology matching. Using highly divergent RNA virosphere as a test case, LucaPhylo places unaligned sequences into phylogenetic trees with an accuracy comparable to leading alignment-based tree construction tools, while retaining divergent sequences that conventional pipelines frequently discard. It further enables the integration of divergent viral lineages into phylogenetic trees, thereby expanding the evolutionary landscape of RNA viruses. Together, LucaPhylo establishes an AI-driven, alignment-free paradigm for phylogenetic inference and provides a robust computational foundation for resolving deep evolutionary relationships among RNA viruses and other biological systems.

Shan, Y., Fang, P., Liu, Y., Pan, Y., Liu, K., He, Y., Liu, X., Wu, W., Xue, G., He, J., Guo, D., He, J., Holmes, E. C., Shi, M., Li, Z.

Advertisement

Stats

  • Recommendations n/a n/a positive of 0 vote(s)
  • Views 9
  • Comments 0

Recommended by

  • No recommendations yet.

Post a comment

You need to be signed in to post comments. You can sign in here.

Comments

There are no comments yet.

Advertisement