Rock, paper, scissors: harnessing complementarity in ortholog detection methods improves comparative genomic inference.
Bottom Line: This task involves accurately identifying genes across species that descend from a common ancestral sequence.In head-to-head comparisons, we find that these algorithms significantly outperform one another for 38-45% of the genes analyzed.Further, this improvement in alignment quality yields more confidently aligned sites and higher levels of overall conservation, while simultaneously detecting of up to 180% more positively selected sites.
Affiliation: Department of Epidemiology and Biostatistics, University of California, San Francisco, University of California, San Francisco, San Francisco, California.Show MeSH
Related in: MedlinePlus
Mentions: With HMMER3, we queried protein sequences against all PfamA protein family profiles, annotating each protein according to its top protein family hit. This allowed for an ascertainment of functional concordance that is more comprehensive than relying on gene-by-gene annotation across species, while retaining many of the advantages of manual curation where it exists. This assessment reveals that, for the set of orthologous sequences proposed by all methods, MOSAIC provides levels of functional concordance that are slightly better than the best performing component method (Figure 5). Gains are particularly large for gorilla, bushbaby, and cat orthologs (Figure S4).
Affiliation: Department of Epidemiology and Biostatistics, University of California, San Francisco, University of California, San Francisco, San Francisco, California.