Limits...
Pattern similarity study of functional sites in protein sequences: lysozymes and cystatins.

Nakai S, Li-Chan EC, Dou J - BMC Biochem. (2005)

Bottom Line: Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes.Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences.This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available.

View Article: PubMed Central - HTML - PubMed

Affiliation: Food, Nutrition and Health, The University of British Columbia, 6650 Marine Drive, Vancouver, B.C., Canada. shuryo.nakai@ubc.ca

ABSTRACT

Background: Although it is generally agreed that topography is more conserved than sequences, proteins sharing the same fold can have different functions, while there are protein families with low sequence similarity. An alternative method for profile analysis of characteristic conserved positions of the motifs within the 3D structures may be needed for functional annotation of protein sequences. Using the approach of quantitative structure-activity relationships (QSAR), we have proposed a new algorithm for postulating functional mechanisms on the basis of pattern similarity and average of property values of side-chains in segments within sequences. This approach was used to search for functional sites of proteins belonging to the lysozyme and cystatin families.

Results: Hydrophobicity and beta-turn propensity of reference segments with 3-7 residues were used for the homology similarity search (HSS) for active sites. Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes. The profiles of similarity constants and average values of these parameters as functions of their positions in the sequences could identify both active and substrate binding sites of the lysozyme of Streptomyces coelicolor, which has been reported as a new fold enzyme (Cellosyl). The same approach was successfully applied to cystatins, especially for postulating the mechanisms of amyloidosis of human cystatin C as well as human lysozyme.

Conclusion: Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences. This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available.

Show MeSH

Related in: MedlinePlus

PCS scattergram of cystatins when hydrophobicity was used as property index. Human cystatin C (HuC) was used as the reference. Human cystatins M, E, S, SA, SN, D, F, A and B are labelled as 1, 2, 7, 8, 9, 10, 15, 16 and 17. Labels 3, 4, 6, and 11–14 are for cystatins from mouse C, rat C, bovine, hen, rainbow trout, chum salmon and carp, respectively.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
getmorefigures.php?uid=PMC1173080&req=5

Figure 5: PCS scattergram of cystatins when hydrophobicity was used as property index. Human cystatin C (HuC) was used as the reference. Human cystatins M, E, S, SA, SN, D, F, A and B are labelled as 1, 2, 7, 8, 9, 10, 15, 16 and 17. Labels 3, 4, 6, and 11–14 are for cystatins from mouse C, rat C, bovine, hen, rainbow trout, chum salmon and carp, respectively.

Mentions: The PCS scattergram of the 17 cystatins using human cystatin C (HCC) as a reference and hydrophobicity as side-chain property index is shown in Figure 5; alteration of the index to α-helix and β-strand propensities did not appreciably change the grouping results. The three groups include human cystatins C, D, S, SA, SN and hen cystatin (Group I), human cystatins E, F and M (Group II), and human cystatins A and B (Group III) which are the stefin group cystatins that are smaller in molecular size and have slightly lower papain inhibitory activity than HCC [22].


Pattern similarity study of functional sites in protein sequences: lysozymes and cystatins.

Nakai S, Li-Chan EC, Dou J - BMC Biochem. (2005)

PCS scattergram of cystatins when hydrophobicity was used as property index. Human cystatin C (HuC) was used as the reference. Human cystatins M, E, S, SA, SN, D, F, A and B are labelled as 1, 2, 7, 8, 9, 10, 15, 16 and 17. Labels 3, 4, 6, and 11–14 are for cystatins from mouse C, rat C, bovine, hen, rainbow trout, chum salmon and carp, respectively.
© Copyright Policy - open-access
Related In: Results  -  Collection

License
Show All Figures
getmorefigures.php?uid=PMC1173080&req=5

Figure 5: PCS scattergram of cystatins when hydrophobicity was used as property index. Human cystatin C (HuC) was used as the reference. Human cystatins M, E, S, SA, SN, D, F, A and B are labelled as 1, 2, 7, 8, 9, 10, 15, 16 and 17. Labels 3, 4, 6, and 11–14 are for cystatins from mouse C, rat C, bovine, hen, rainbow trout, chum salmon and carp, respectively.
Mentions: The PCS scattergram of the 17 cystatins using human cystatin C (HCC) as a reference and hydrophobicity as side-chain property index is shown in Figure 5; alteration of the index to α-helix and β-strand propensities did not appreciably change the grouping results. The three groups include human cystatins C, D, S, SA, SN and hen cystatin (Group I), human cystatins E, F and M (Group II), and human cystatins A and B (Group III) which are the stefin group cystatins that are smaller in molecular size and have slightly lower papain inhibitory activity than HCC [22].

Bottom Line: Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes.Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences.This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available.

View Article: PubMed Central - HTML - PubMed

Affiliation: Food, Nutrition and Health, The University of British Columbia, 6650 Marine Drive, Vancouver, B.C., Canada. shuryo.nakai@ubc.ca

ABSTRACT

Background: Although it is generally agreed that topography is more conserved than sequences, proteins sharing the same fold can have different functions, while there are protein families with low sequence similarity. An alternative method for profile analysis of characteristic conserved positions of the motifs within the 3D structures may be needed for functional annotation of protein sequences. Using the approach of quantitative structure-activity relationships (QSAR), we have proposed a new algorithm for postulating functional mechanisms on the basis of pattern similarity and average of property values of side-chains in segments within sequences. This approach was used to search for functional sites of proteins belonging to the lysozyme and cystatin families.

Results: Hydrophobicity and beta-turn propensity of reference segments with 3-7 residues were used for the homology similarity search (HSS) for active sites. Hydrogen bonding was used as the side-chain property for searching the binding sites of lysozymes. The profiles of similarity constants and average values of these parameters as functions of their positions in the sequences could identify both active and substrate binding sites of the lysozyme of Streptomyces coelicolor, which has been reported as a new fold enzyme (Cellosyl). The same approach was successfully applied to cystatins, especially for postulating the mechanisms of amyloidosis of human cystatin C as well as human lysozyme.

Conclusion: Pattern similarity and average index values of structure-related properties of side chains in short segments of three residues or longer were, for the first time, successfully applied for predicting functional sites in sequences. This new approach may be applicable to studying functional sites in un-annotated proteins, for which complete 3D structures are not yet available.

Show MeSH
Related in: MedlinePlus