Kuru, Nurdan and Adebali, Ogün (2025) PHACE: phylogeny-aware detection of molecular coevolution. Molecular Biology and Evolution, 42 (7). ISSN 0737-4038 (Print) 1537-1719 (Online)
PHACE.pdf
Available under License Creative Commons Attribution Non-commercial.
Download (1MB)
Official URL: https://dx.doi.org/10.1093/molbev/msaf150
Abstract
The coevolution trends of amino acids within or between genes offer key insights into protein structure and function. Existing tools for uncovering coevolutionary signals primarily rely on multiple sequence alignments, often overlooking phylogenetic relatedness and shared evolutionary history. Here, we introduce PHACE, a phylogeny-aware coevolution algorithm that maps amino acid substitutions onto a phylogenetic tree to detect molecular coevolution. PHACE categorizes amino acids at each position into "tolerable"and "intolerable"groups, based on their independent recurrence across the tree, reflecting a position's tolerance to specific substitutions. Gaps are treated as a third character type, with only phylogenetically independent gap changes considered. The method computes substitution scores per branch by traversing the tree and quantifying probability differences across adjacent nodes for each group. To avoid artifacts from alignment errors, we apply a multiple sequence alignment-masking procedure. Compared to phylogeny-based methods (CAPS, CoMap) and state-of-the-art multiple sequence alignment-based approaches (DCA, GaussDCA, PSICOV, mutual information), PHACE shows significantly superior accuracy in identifying coevolving residue pairs, as measured by statistical metrics including Matthews correlation coefficient, area under the ROC curve, and F1 score. This performance stems from PHACE's explicit modeling of phylogenetic dependencies, often ignored in coevolution analyses.
| Item Type: | Article |
|---|---|
| Additional Information: | This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. |
| Uncontrolled Keywords: | amino acid substitution; coevolution; phylogenetics; protein structure |
| Divisions: | Faculty of Engineering and Natural Sciences |
| Depositing User: | Nurdan Kuru |
| Date Deposited: | 03 Sep 2025 10:41 |
| Last Modified: | 22 Jan 2026 14:15 |
| URI: | https://research.sabanciuniv.edu/id/eprint/52111 |

