Collagen - Molecular Structure

Molecular Structure

The tropocollagen or collagen molecule is a subunit of larger collagen aggregates such as fibrils. At approximately 300 nm long and 1.5 nm in diameter, it is made up of three polypeptide strands (called alpha peptides, see step 2), each possessing the conformation of a left-handed helix (its name is not to be confused with the commonly occurring alpha helix, a right-handed structure). These three left-handed helices are twisted together into a right-handed coiled coil, a triple helix or "super helix", a cooperative quaternary structure stabilized by numerous hydrogen bonds. With type I collagen and possibly all fibrillar collagens if not all collagens, each triple-helix associates into a right-handed super-super-coil referred to as the collagen microfibril. Each microfibril is interdigitated with its neighboring microfibrils to a degree that might suggest they are individually unstable, although within collagen fibrils, they are so well ordered as to be crystalline.

A distinctive feature of collagen is the regular arrangement of amino acids in each of the three chains of these collagen subunits. The sequence often follows the pattern Gly-Pro-X or Gly-X-Hyp, where X may be any of various other amino acid residues. Proline or hydroxyproline constitute about 1/6 of the total sequence. With glycine accounting for the 1/3 of the sequence, this means approximately half of the collagen sequence is not glycine, proline or hydroxyproline, a fact often missed due to the distraction of the unusual GX1X2 character of collagen alpha-peptides. The high glycine content of collagen is important with respect to stabilization of the collagen helix as this allows the very close association of the collagen fibers within the molecule, facilitating hydrogen bonding and the formation of intermolecular cross-links. This kind of regular repetition and high glycine content is found in only a few other fibrous proteins, such as silk fibroin. About 75-80% of silk is (approximately) -Gly-Ala-Gly-Ala- with 10% serine, and elastin is rich in glycine, proline, and alanine (Ala), whose side group is a small methyl group. Such high glycine and regular repetitions are never found in globular proteins save for very short sections of their sequence. Chemically-reactive side groups are not needed in structural proteins, as they are in enzymes and transport proteins; however, collagen is not quite just a structural protein. Due to its key role in the determination of cell phenotype, cell adhesion, tissue regulation and infrastructure, many sections of its nonproline-rich regions have cell or matrix association / regulation roles. The relatively high content of proline and hydroxyproline rings, with their geometrically constrained carboxyl and (secondary) amino groups, along with the rich abundance of glycine, accounts for the tendency of the individual polypeptide strands to form left-handed helices spontaneously, without any intrachain hydrogen bonding.

Because glycine is the smallest amino acid with no side chain, it plays a unique role in fibrous structural proteins. In collagen, Gly is required at every third position because the assembly of the triple helix puts this residue at the interior (axis) of the helix, where there is no space for a larger side group than glycine’s single hydrogen atom. For the same reason, the rings of the Pro and Hyp must point outward. These two amino acids help stabilize the triple helix—Hyp even more so than Pro; a lower concentration of them is required in animals such as fish, whose body temperatures are lower than most warm-blooded animals. Lower proline and hydroxyproline contents are characteristic of cold-water, but not warm-water fish; the latter tend to have similar proline and hydroxyproline contents to mammals. The lower proline and hydroxproline contents of cold-water fish and other poikilotherm animals leads to their collagen having a lower thermal stability than mammalian collagen. This lower thermal stability means that gelatin derived from fish collagen is not suitable for many food and industrial applications.

The tropocollagen subunits spontaneously self-assemble, with regularly staggered ends, into even larger arrays in the extracellular spaces of tissues. In the fibrillar collagens, the molecules are staggered from each other by about 67 nm (a unit that is referred to as ‘D’ and changes depending upon the hydration state of the aggregate). Each D-period contains four plus a fraction collagen molecules, because 300 nm divided by 67 nm does not give an integer (the length of the collagen molecule divided by the stagger distance D). Therefore, in each D-period repeat of the microfibril, there is a part containing five molecules in cross-section, called the “overlap”, and a part containing only four molecules, called the "gap". The triple-helices are also arranged in a hexagonal or quasihexagonal array in cross-section, in both the gap and overlap regions.

There is some covalent crosslinking within the triple helices, and a variable amount of covalent crosslinking between tropocollagen helices forming well organized aggregates (such as fibrils). Larger fibrillar bundles are formed with the aid of several different classes of proteins (including different collagen types), glycoproteins and proteoglycans to form the different types of mature tissues from alternate combinations of the same key players. Collagen's insolubility was a barrier to the study of monomeric collagen until it was found that tropocollagen from young animals can be extracted because it is not yet fully crosslinked. However, advances in microscopy techniques (i.e. electron microscopy (EM) and atomic force microscopy (AFM)) and X-ray diffraction have enabled researchers to obtain increasingly detailed images of collagen structure in situ. These later advances are particularly important to better understanding the way in which collagen structure affects cell-cell and cell-matrix communication, and how tissues are constructed in growth and repair, and changed in development and disease. For example using AFM –based nanoindentation it has been shown that a single collagen fibril is a heterogeneous material along its axial direction with significantly different mechanical properties in its gap and overlap regions, correlating with its different molecular organizations in these two regions.

Collagen fibrils are semicrystalline aggregates of collagen molecules. Collagen fibers are bundles of fibrils.

Collagen fibrils/aggregates are arranged in different combinations and concentrations in various tissues to provide varying tissue properties. In bone, entire collagen triple helices lie in a parallel, staggered array. 40 nm gaps between the ends of the tropocollagen subunits (approximately equal to the gap region) probably serve as nucleation sites for the deposition of long, hard, fine crystals of the mineral component, which is (approximately) Ca10(OH)2(PO4)6. Type I collagen gives bone its tensile strength.

Read more about this topic:  Collagen

Famous quotes containing the word structure:

    It is difficult even to choose the adjective
    For this blank cold, this sadness without cause.
    The great structure has become a minor house.
    No turban walks across the lessened floors.
    The greenhouse never so badly needed paint.
    Wallace Stevens (1879–1955)