HAMAP in 2015: updates to the protein family classification and annotation system.
Bottom Line: Here we report on the growth of HAMAP and updates to the HAMAP system since our last report in the NAR Database Issue of 2013.We demonstrate how the complex logic of HAMAP rules allows for precise annotation of individual functional variants within large homologous protein families.We also describe improvements to our web-based tool HAMAP-Scan which simplify the classification and annotation of sequences, and the incorporation of an improved sequence-profile search algorithm.
Affiliation: Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 rue Michel-Servet, CH-1211 Geneva 4, Switzerland.Show MeSH
Related in: MedlinePlus
Mentions: HAMAP defines 8 PFK families in line with the currently accepted classification of PFKs (32,35) (Table 1). Several of the eight HAMAP families include both PPi-dependent and ATP-dependent members, suggesting that phosphoryl-donor specificity may have changed at multiple times during the evolution of the PFK superfamily. Figure 2 illustrates how this functional variation within families is treated by HAMAP using annotation rule MF_01976, which describes members of the mixed substrate PFK group III subfamily. The precise annotation that is applied to members of this family depends on the nature of the active site residue (D104 in the experimentally characterized template of Amycolatopsis methanolica—UniProtKB/Swiss-Prot record Q59126). Case statements within the rule specify the correct protein name, catalytic activity (including EC number), function, keywords, GO terms and other annotations for family members bearing either D or G at their active site. Sequences having neither of these residues are annotated as generic 6-phosphofructokinases of unknown substrate-specificity. The example of PFK illustrates how a single residue may determine substrate specificity and enzyme function, but HAMAP rule syntax also allows conditional annotation based on the combination of multiple residues or sequence motifs. The methylthioadenosine (MTA) phosphorylases are one example, where conserved amino acid substitutions in the substrate binding pocket convert the substrate specificity of this enzyme from 6-aminopurine (EC 220.127.116.11) to 6-oxopurine nucleosides (EC 18.104.22.168 and EC 22.214.171.124) (described in MF_01963).
Affiliation: Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CMU, 1 rue Michel-Servet, CH-1211 Geneva 4, Switzerland.