DNA is the genetic material that carries genetic information in a cell and from generation to generation. At this stage, an attempt will be made to determine in what manner the genetic information exists in DNA molecule? Are they written in coded language on a DNA molecule? If they occur in the language of codes what is the nature of genetic code? The translation of proteins follows the triplet rule; a sequence of three mRNA base (a codon) designates one of the 20 different kinds of amino acids used in protein synthesis.
Genetic code is the sequence relationship between nucleotide in genes (or mRNA) and the amino acids in the proteins they encode. There are 64 possible triplets, and 61 of them are used to represent amino acids. The remaining three triplet codons are termination signals for polypeptide chains. Since there are only 20 amino acids involved in protein synthesis, most of them are encoded by more than one triplet. Two things make this multiple (degenerate) coding possible. First, there is more than one tRNA for most amino acids. Each tRNA has a different anticodon. Second, this pairing is highly specific for the first two portions on the codon, permitting Watson and Crick base pairs (A – U and G - C) to be formed. But at the third position there is a great deal of flexibility as to which base pairs are acceptable. Most part of the genetic code is universal, being the same in prokaryotes and eukaryotes.
The order of base pairs along DNA molecule controls the kind and order of amino acids found in the proteins of an organism. This specific order of base pairs is called genetic code, the blue print establishing the kinds of proteins to be synthesized which makes an organism unique.
Marshall Nirenberg, Severo Ochoa (enzyme polynucleotide phosphorylase called Ochoa’s enzyme), Hargobind Khorana, Francis Crick and many others have contributed significantly to decipher the genetic code. The order in which bases are arranged in mRNA decides the order in which amino acids are arranged in proteins. Finally a checker board for genetic code was prepared (table 5.1).
The salient features of genetic code are as follows:
• The genetic codon is a triplet code and 61 codons code for amino acids and 3 codons do not code for any amino acid and function as stop codon (Termination).
• The genetic code is universal. It means that all known living systems use nucleic acids and the same three base codons (triplet codon) direct the synthesis of protein from amino acids. For example, the mRNA (UUU) codon codes for phenylalanine in all cells of all organisms. Some exceptions are reported in prokaryotic, mitochondrial and
chloroplast genomes. However similarities are more common than differences.
• A non-overlapping codon means that the same letter is not used for two different codons. For instance, the nucleotide sequence GUU GUC represents only two codons.
• It is comma less, which means that the message would be read directly from one end to the other i.e., no punctuation are needed between two codes.
• A degenerate code means that more than one triplet codon could code for a specific amino acid. For example, codons GUU, GUC, GUA and GUG code for valine.
• Non-ambiguous code means that one codon will code for one amino acid.
• The code is always read in a fixed direction i.e. from 5’→3’ direction called polarity.
• AUG has dual functions. It acts as a initiator codon and also codes for the amino acid methionine.
Table 5.1 Genetic code dictionary
UUU Phe F PhenylalanineUUC Phe F PhenylalanineUUA Leu L LeucineUUG Leu L Leucine
UCU Ser S SerineUCC Ser S SerineUCA Ser S SerineUCG Ser S Serine
UAU Tyr Y TyrosineUAC Tyr Y TyrosineUAA TerminationUAG Termination
UGU Cys C CysteineUGC Cys C CysteineUGA TerminationUGG Trp W Tryptophan
CUU Leu L LeucineCUC Leu L LeucineCUA Leu L LeucineCUG Leu L Leucine
CCU Pro P ProlineCCC Pro P ProlineCCA Pro P ProlineCCG Pro P Proline
CAU His H HistidineCAC His H HistidineCAA Gln Q GlutamineCAG Gln Q Glutamine
CGU Arg R ArginineCGC Arg R ArginineCGA Arg R ArginineCGG Arg R Arginine
AUU Ile I IsoleucineAUC Ile I IsoleucineAUA Ile I IsoleucineAUG Met M Methionine
ACU Thr T ThreonineACC Thr T ThreonineACA Thr T ThreonineACG Thr T Threonine
AAU Asn N AsparagineAAC Asn N AsparagineAAA Lys K LysineAAG Lys K Lysine
AGU Ser S SerineAGC Ser S SerineAGA Arg R ArginineAGG Arg R Arginine
• UAA, UAG and UGA) codons are designated as termination (stop) codons and also are known as “non-sense” codons.
Wobble Hypothesis
It is a hypothesis proposed by Crick
(1966) which states that tRNA anticodon has the ability to wobble at its 5’ end by pairing with even non-complementary base of mRNA codon. According to this hypothesis, in codon-anticodon pairing the third base may not be complementary. The third base of the codon is called wobble base and this position is called wobble position. The actual base pairing occurs at first two positions only. The importance of Wobbling hypothesis is that it reduces the number of tRNAs required for polypeptide synthesis and it overcomes the effect of code degeneracy.
Alt text
In the above example though the codon and the anti codon do not match perfectly, yet the required amino acid is brought perfectly. This enables the economy of tRNA, GUU, GUC, GUA and GUG code for the amino acid - Valine.
example of point mutation that results in the change of amino acid residue glutamic acid to valine (Fig. 5.10). The mutant haemoglobin undergoes polymerisation under oxygen tension causing the change in the shape of the RBC from biconcave to a sickle shaped structure.
The effect of point mutation can be understood by the following example.
ABC DEF GHI JKL
If we insert a letter O between DEF and GHI the arrangement would be
ABC DEF OGH IJK L
If we insert OQ at the same place the arrangement would be
ABC DEF OQG HIJ KL
The above information shows that insertion or deletion of one or two bases, changes the reading frame from the point of insertions or deletions. Such mutations are referred to as frame shift insertion or deletion mutations. This forms the genetic basis of proof that codon is a triplet and is read in a continuous manner.