Protein tertiary structure
This article needs additional citations for verification. (December 2013) (Learn how and when to remove this template message)
Protein tertiary structure is the three dimensional shape of a protein. The tertiary structure will have a single polypeptide chain "backbone" with one or more protein secondary structures, the protein domains. Amino acid side chains may interact and bond in a number of ways. The interactions and bonds of side chains within a particular protein determine its tertiary structure. The protein tertiary structure is defined by its atomic coordinates. These coordinates may refer either to a protein domain or to the entire tertiary structure. A number of tertiary structures may fold into a quaternary structure.
- 1 History
- 2 Determinants
- 3 Determination
- 4 Projects
- 5 See also
- 6 References
- 7 External links
The science of the tertiary structure of proteins has progressed from one of hypothesis to one of detailed definition. Although Emil Fischer had suggested proteins were made of polypeptide chains and amino acid side chains, it was Dorothy Maud Wrinch who incorporated geometry into the prediction of protein structures. Wrinch demonstrated this with the Cyclol model, the first prediction of the structure of a globular protein. Contemporary methods are able to determine, without prediction, tertiary structures to within 5 Å (0.5 nm) for small proteins (<120 residues) and, under favorable conditions, confident secondary structure predictions.
Stability of native states
A protein folded into its native state or native conformation typically has a lower Gibbs free energy (a combination of enthalpy and entropy) than the unfolded conformation. A protein will tend towards low-energy conformations, which will determine the protein's fold in the cellular environment. Because many similar conformations will have similar energies, protein structures are dynamic, fluctuating between a large these similar structures.
Globular proteins have a core of hydrophobic amino acid residues and a surface region of water-exposed, charged, hydrophilic residues. This arrangement may stabilise interactions within the tertiary structure. For example, in secreted proteins, which are not bathed in cytoplasm, disulfide bonds between cysteine residues help to maintain the tertiary structure. There is a commonality of stable tertiary structures seen in proteins of diverse function and diverse evolution. For example, the TIM barrel, named for the enzyme triosephosphateisomerase, is a common tertiary structure as is the highly stable, dimeric, coiled coil structure. Hence, proteins may be classified by the structures they hold. Databases of proteins which use such a classification include SCOP and CATH.
Folding kinetics may trap a protein in a high-energy conformation, i.e. a high-energy intermediate conformation blocks access to the lowest-energy conformation. The high-energy conformation may contribute to the function of the protein. For example, the influenza hemagglutinin protein is a single polypeptide chain which when activated, is proteolytically cleaved to form two polypeptide chains. The two chains are held in a high-energy conformation. When the local pH drops, the protein undergoes an energetically favorable conformational rearrangement that enables it to penetrate the host cell membrane.
Some tertiary protein structures may exist in long-lived states that are not the expected most stable state. For example, many serpins (serine protease inhibitors) show this metastability. They undergo a conformational change when a loop of the protein is cut by a protease.
It is commonly assumed that the native state of a protein is also the most thermodynamically stable and that a protein will reach its native state, given its chemical kinetics, before it is translated. Protein chaperones within the cytoplasm of a cell assist a newly synthesised polypeptide to attain its native state. Some chaperone proteins are highly specific in their function, for example, protein disulfide isomerase; others are general in their function and may assist most globular proteins, for example, the prokaryotic GroEL/GroES system of proteins and the homologous eukaryotic heat shock proteins (the Hsp60/Hsp10 system).
Prediction of protein tertiary structure relies on knowing the protein's primary structure and comparing the possible predicted tertiary structure with known tertiary structures in protein data banks. This only takes into account the cytoplasmic environment present at the time of protein synthesis to the extent that a similar cytoplasmic environment may also have influenced the structure of the proteins recorded in the protein data bank.
The structure of a protein, for example an enzyme, may change upon binding of its natural ligands, for example a cofactor. In this case, the structure of the protein bound to the ligand is known as holo structure, of the unbound protein as apo structure.
X-ray crystallography is the most common tool used to determine protein structure. It provides high resolution of the structure but it does not give information about protein's conformational flexibility.
Protein NMR gives comparatively lower resolution of protein structure. It is limited to smaller proteins. However, it can provide information about conformational changes of a protein in solution.
Cryogenic electron microscopy
Cryogenic electron microscopy (cryo-EM) can give information about both a protein's tertiary and quaternary structure. It is particularly well-suited to large proteins and symetrical complexes of protein subunits.
Dual polarisation interferometry
Dual polarisation interferometry provides complementary information about surface captured proteins. It assists in determining structure and conformation changes over time.
The [email protected] project at Stanford University is a distributed computing research effort which uses approximately 5 petaFLOPS (≈10 x86 petaFLOPS) of available computing. It aims to find an algorithm which will consistently predict protein tertiary and quaternary structures given the protein's amino acid sequence and its cellular conditions.
A list of software for protein tertiary structure prediction can be found at List of protein structure prediction software.
Protein aggregation diseases
Protein aggregation diseases such as Alzheimer's disease and Huntington's disease and prion diseases such as bovine spongiform encephalopathy can be better understood by constructing (and reconstructing) disease models. This is done by causing the disease in laboratory animals, for example, by administering a toxin, such as MPTP to cause Parkinson's disease, or through genetic manipulation. Protein structure prediction is a new way to create disease models, which may avoid the use of animals.
Protein Tertiary Structure Retrieval Project (CoMOGrad)
Matching patterns in tertiary structure of a given protein to huge number of known protein tertiary structures and retrieve most similar ones in ranked order is in the heart of many research areas like function prediction of novel proteins, study of evolution, disease diagnosis, drug discovery, antibody design etc. The CoMOGrad project at BUET is a research effort to device an extremely fast and much precise method for protein tertiary structure retrieval and develop online tool based on research outcome. The developed online tool is available at http://research.buet.ac.bd:8080/Comograd/. The pattern matching method is available at http://www.nature.com/articles/srep13275.
- IUPAC, Compendium of Chemical Terminology, 2nd ed. (the "Gold Book") (1997). Online corrected version: (2006–) "tertiary structure". doi:10.1351/goldbook.T06282
- Branden C. and Tooze J. "Introduction to Protein Structure" Garland Publishing, New York. 1990 and 1991.
- Kyte, J. "Structure in Protein Chemistry." Garland Publishing, New York. 1995. ISBN 0-8153-1701-8
- Senechal M. "I died for beauty: Dorothy Wrinch and the cultures of science." Oxford University Press, 2012. Chapter 14. ISBN 0-19-991083-9, 9780199910830. Accessed at Google Books 8 December 2013.
- Whisstock J (2006). "Molecular gymnastics: serpiginous structure, folding and scaffolding". Current Opinion in Structural Biology. 16 (6): 761–68. doi:10.1016/j.sbi.2006.10.005. PMID 17079131.
- Gettins PG (2002). "Serpin structure, mechanism, and function". Chem Rev. 102 (12): 4751–804. doi:10.1021/cr010170. PMID 12475206.
- Whisstock JC, Skinner R, Carrell RW, Lesk AM (2000). "Conformational changes in serpins: I. The native and cleaved conformations of alpha(1)-anti-trypsin". J Mol Biol. 296 (2): 685–99. doi:10.1006/jmbi.1999.3520. PMID 10669617.
- Seeliger, D; De Groot, B. L. (2010). "Conformational transitions upon ligand binding: Holo-structure prediction from apo conformations". PLoS Computational Biology. 6 (1): e1000634. doi:10.1371/journal.pcbi.1000634. PMC 2796265. PMID 20066034.
- "[email protected]" Stanford University. Accessed 18 December 2013.
- "[email protected] – FAQ" Stanford University. Accessed 18 December 2013.
- "[email protected] – Science." Stanford University.
- Schober A (October 2004). "Classic toxin-induced animal models of Parkinson's disease: 6-OHDA and MPTP". Cell Tissue Res. 318 (1): 215–24. doi:10.1007/s00441-004-0938-y. PMID 15503155.
- "Tp53 Knockout Rat". Cancer. Retrieved 2010-12-18.
- "Feature – What is Folding and Why Does it Matter?". Retrieved December 18, 2010.
- Protein Data Bank
- Display, analyse and superimpose protein 3D structures
- Alphabet of protein structures.
- Display, analyse and superimpose protein 3D structures
- WWW-based course teaching elementary protein bioinformatics
- Critical Assessment of Structure Prediction (CASP)
- Structural Classification of Proteins (SCOP)
- CATH Protein Structure Classification
- DALI/FSSP software and database of superposed protein structures
- TOPOFIT-DB Invariant Structural Cores between proteins
- PDBWiki — PDBWiki Home Page – a website for community annotation of PDB structures.