Gene expression is the process by which the information encoded in a gene is used to produce a functional product — most commonly a protein. It is the fundamental mechanism by which genotype gives rise to phenotype.
Proposed by Francis Crick (1958), the Central Dogma describes the directional flow of genetic information:
| Step | Process | Enzyme | Location |
|---|---|---|---|
| Transcription | RNA Polymerase | Nucleus (eukaryotes) | |
| Translation | Ribosomes | Cytoplasm |
Note: Reverse transcriptase (found in retroviruses) can also copy RNA back into DNA, which is an exception to the classical Central Dogma.
Transcription is the synthesis of mRNA from a DNA template.
The promoter is a specific DNA sequence upstream (5') of a gene where RNA polymerase binds to initiate transcription. In eukaryotes, the TATA box (a conserved AT-rich sequence ~25–30 bp upstream of the start site) is a key promoter element recognized by transcription factors.
In eukaryotes, the initial transcript is called pre-mRNA and must be processed before leaving the nucleus:
A modified guanosine nucleotide (7-methylguanosine cap) is added to the 5' end. It protects mRNA from degradation and assists ribosome binding.
A string of ~200 adenine nucleotides is added to the 3' end, protecting mRNA from degradation and aiding export from the nucleus.
| Feature | Exons | Introns |
|---|---|---|
| Meaning | Expressed sequences | Intervening sequences |
| Fate | Retained in mature mRNA | Removed (spliced out) |
| Translation | Yes — translated into protein | No |
Alternative splicing allows different combinations of exons to be joined, producing multiple different proteins from a single gene — greatly expanding the proteome.
Translation is the synthesis of a polypeptide chain using the mRNA sequence as a template, occurring on ribosomes in the cytoplasm.
| RNA Type | Role in Translation |
|---|---|
| mRNA (messenger RNA) | Carries the genetic code from DNA to ribosome |
| tRNA (transfer RNA) | Carries specific amino acids; anticodon pairs with mRNA codon |
| rRNA (ribosomal RNA) | Structural and catalytic component of ribosomes |
All somatic cells in a multicellular organism contain the same genome (identical DNA). However, different cell types (e.g., liver cells, neurons, muscle cells) look and function very differently because they express different subsets of genes.
This selective activation and silencing of genes is called differential gene expression and is the basis of:
Example: A liver cell expresses genes for albumin and clotting factors; a neuron expresses genes for neurotransmitter synthesis. Both cells have both sets of genes, but only the relevant ones are expressed.
Gene expression can be regulated at multiple levels: