CATH--a hierarchic classification of protein domain structures
- PMID: 9309224
- DOI: 10.1016/s0969-2126(97)00260-8
CATH--a hierarchic classification of protein domain structures
Abstract
Background: Protein evolution gives rise to families of structurally related proteins, within which sequence identities can be extremely low. As a result, structure-based classifications can be effective at identifying unanticipated relationships in known structures and in optimal cases function can also be assigned. The ever increasing number of known protein structures is too large to classify all proteins manually, therefore, automatic methods are needed for fast evaluation of protein structures.
Results: We present a semi-automatic procedure for deriving a novel hierarchical classification of protein domain structures (CATH). The four main levels of our classification are protein class (C), architecture (A), topology (T) and homologous superfamily (H). Class is the simplest level, and it essentially describes the secondary structure composition of each domain. In contrast, architecture summarises the shape revealed by the orientations of the secondary structure units, such as barrels and sandwiches. At the topology level, sequential connectivity is considered, such that members of the same architecture might have quite different topologies. When structures belonging to the same T-level have suitably high similarities combined with similar functions, the proteins are assumed to be evolutionarily related and put into the same homologous superfamily.
Conclusions: Analysis of the structural families generated by CATH reveals the prominent features of protein structure space. We find that nearly a third of the homologous superfamilies (H-levels) belong to ten major T-levels, which we call superfolds, and furthermore that nearly two-thirds of these H-levels cluster into nine simple architectures. A database of well-characterised protein structure families, such as CATH, will facilitate the assignment of structure-function/evolution relationships to both known and newly determined protein structures.
Similar articles
-
The CATH classification revisited--architectures reviewed and new ways to characterize structural divergence in superfamilies.Nucleic Acids Res. 2009 Jan;37(Database issue):D310-4. doi: 10.1093/nar/gkn877. Epub 2008 Nov 7. Nucleic Acids Res. 2009. PMID: 18996897 Free PMC article.
-
Structural diversity of domain superfamilies in the CATH database.J Mol Biol. 2006 Jul 14;360(3):725-41. doi: 10.1016/j.jmb.2006.05.035. Epub 2006 Jun 2. J Mol Biol. 2006. PMID: 16780872
-
The CATH Database provides insights into protein structure/function relationships.Nucleic Acids Res. 1999 Jan 1;27(1):275-9. doi: 10.1093/nar/27.1.275. Nucleic Acids Res. 1999. PMID: 9847200 Free PMC article.
-
Protein folds, functions and evolution.J Mol Biol. 1999 Oct 22;293(2):333-42. doi: 10.1006/jmbi.1999.3054. J Mol Biol. 1999. PMID: 10529349 Review.
-
The history of the CATH structural classification of protein domains.Biochimie. 2015 Dec;119:209-17. doi: 10.1016/j.biochi.2015.08.004. Epub 2015 Aug 4. Biochimie. 2015. PMID: 26253692 Free PMC article. Review.
Cited by
-
US-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes.Nat Methods. 2022 Sep;19(9):1109-1115. doi: 10.1038/s41592-022-01585-1. Epub 2022 Aug 29. Nat Methods. 2022. PMID: 36038728
-
Mapping small molecule binding data to structural domains.BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S11. doi: 10.1186/1471-2105-13-S17-S11. Epub 2012 Dec 13. BMC Bioinformatics. 2012. PMID: 23282026 Free PMC article.
-
Rifampin phosphotransferase is an unusual antibiotic resistance kinase.Nat Commun. 2016 Apr 22;7:11343. doi: 10.1038/ncomms11343. Nat Commun. 2016. PMID: 27103605 Free PMC article.
-
An efficient algorithm for protein structure comparison using elastic shape analysis.Algorithms Mol Biol. 2016 Sep 29;11:27. doi: 10.1186/s13015-016-0089-1. eCollection 2016. Algorithms Mol Biol. 2016. PMID: 27708689 Free PMC article.
-
Structural and thermodynamic consequences of b heme binding for monomeric apoglobins and other apoproteins.Gene. 2007 Aug 15;398(1-2):12-28. doi: 10.1016/j.gene.2007.02.046. Epub 2007 May 1. Gene. 2007. PMID: 17550789 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous