Kozak Consensus Sequence

The Kozak consensus sequence, Kozak consensus or Kozak sequence, is a sequence which occurs on eukaryotic mRNA and has the consensus (gcc)gccRccAUGG. The Kozak consensus sequence plays a major role in the initiation of the translation process. The sequence was named after the person who brought it to prominance, Marilyn Kozak.

The sequence is identified by the notation (gcc)gccRccAUGG, which summarizes data analysed by Kozak from a wide variety of sources (about 699 in all) as follows:

(1) a lower case letter denotes the most common base at a position where the base can nevertheless vary,
(2) upper case letters indicate highly-conserved bases, i.e. the 'AUGG' sequence is constant or rarely, if ever, changes, with the exception being the IUPAC ambiguity code 'R' which indicates that a purine (adenine or guanine) is always observed at this position (with adenine being claimed by Kozak to be more frequent),
(3) the sequence in brackets ((gcc)) is of uncertain significance.

Kozak's paper was limited to a subset of vertebrates (i.e. human, cow, cat, dog, chicken, guinea pig, hampster, mouse, pig, rabbit, sheep and xenopus).

Read more about Kozak Consensus Sequence:  Introduction, Mutations, Variations in The Consensus Sequence

Famous quotes containing the words consensus and/or sequence:

    No consensus of men can make an error erroneous. We can only find or commit an error, not create it. When we commit an error, we say what was an error already.
    Josiah Royce (1855–1916)

    We have defined a story as a narrative of events arranged in their time-sequence. A plot is also a narrative of events, the emphasis falling on causality. “The king died and then the queen died” is a story. “The king died, and then the queen died of grief” is a plot. The time sequence is preserved, but the sense of causality overshadows it.
    —E.M. (Edward Morgan)