Rock, paper, scissors: harnessing complementarity in ortholog detection methods improves comparative genomic inference.
Bottom Line: This task involves accurately identifying genes across species that descend from a common ancestral sequence.In head-to-head comparisons, we find that these algorithms significantly outperform one another for 38-45% of the genes analyzed.Further, this improvement in alignment quality yields more confidently aligned sites and higher levels of overall conservation, while simultaneously detecting of up to 180% more positively selected sites.
Affiliation: Department of Epidemiology and Biostatistics, University of California, San Francisco, University of California, San Francisco, San Francisco, California.Show MeSH
Related in: MedlinePlus
Mentions: In Figure 6B, we see that MOSAIC leads to greater gene-level conservation (lower dN/dS) compared with every method except Blat, for which the difference was not statistically significant. Full distributions of dN/dS for each method are presented in Figure S7. Despite greater levels of conservation, MOSAIC was able to detect ~30–180% more positively selected sites than any of its component methods. This was not due to an increase in the inferred rate of positive selection. Rather, most of this increase in power was attributable to the fact that more sites were aligned to high confidence and therefore included in the analysis. This step of filtering for alignment quality is important because site-wise estimates of positive selection are highly sensitive to short poorly aligned regions (Jordan and Goldman 2012).
Affiliation: Department of Epidemiology and Biostatistics, University of California, San Francisco, University of California, San Francisco, San Francisco, California.