CATH: comprehensive structural and functional annotations for genome sequences.
Bottom Line: The latest version of the CATH-Gene3D protein structure classification database (4.0, http://www.cathdb.info) provides annotations for over 235,000 protein domain structures and includes 25 million domain predictions.This article provides an update on the major developments in the 2 years since the last publication in this journal including: significant improvements to the predictive power of our functional families (FunFams); the release of our 'current' putative domain assignments (CATH-B); a new, strictly non-redundant data set of CATH domains suitable for homology benchmarking experiments (CATH-40) and a number of improvements to the web pages.
Affiliation: Institute of Structural and Molecular Biology, UCL, 636 Darwin Building, Gower Street, WC1E 6BT, UK email@example.com.Show MeSH
Mentions: Since the previous paper, we have added superfamily superpositions to the CATH superfamily web pages. These are superpositions of all representative domains within a superfamily (selected from sequence clusters at 35% sequence identity over 80% overlap). Figure 3 shows a superposition of the 15 representative domains in the Homing endonucleases superfamily in CATH v4.0 (126.96.36.199) coloured by secondary structure and by rainbow.
Affiliation: Institute of Structural and Molecular Biology, UCL, 636 Darwin Building, Gower Street, WC1E 6BT, UK firstname.lastname@example.org.