Alternate meanings: transcription (linguistics) - conversion of spoken words into written language. See: transcript.

Transcription is the process of copying DNA to RNA by an enzyme called RNA polymerase (RNAP). Transcription to yield an mRNA is the first step of protein biosynthesis.

Table of contents
1 Bacterial transcription
2 Eukaryotic transcription

Bacterial transcription

A (simple) model for a bacterial gene to be transcribed looks like this :
  upstream        ~17 bp       The gene to transcribe     downstream
where the region between -35 and -10 base pairs is called promoter, and |T| stands for terminator. The DNA between promoter and terminator is copied to mRNA, which is then translated into protein.

Promoters can differ in strength, that is, how attractive they are for RNAP. The more similar they are to a consensus sequence, the stronger they are. The "ideal" promoter in E. coli looks like this:

5'----TTGACA---|17 bp|----TATAAT---|7bp|---|purines|----3'


The RNA polymerase
holoenzyme consists of a core, made of four subunits (2α, β, β'), ω factor and the σ factor. The followings steps occur upon initiation:
  1. The RNAP recognizes the promoter region of the gene and binds to the DNA at that specific location. At this stage, the DNA is still double-stranded and called closed complex.
  2. The DNA is unwound and becomes single-stranded at the initiation site (the -10 promoter region). This is called open complex.
  3. The DNA is melted (the strands are locally separated), the σ-factor leaves the holoenzyme, and the transcription process begins. This is the elongation phase.


The RNAP runs along the DNA, synthesizing mRNA in the process. In bacteria, the nascending mRNA is processed right away by ribosomes.


The elongation stops if:
  • The terminator is reached. The terminator is usually a palindromic DNA sequence that forms a hairpin.
  • A ρ factor (a protein) binds and runs along the mRNA towards the RNAP. When ρ-factor reaches the RNAP, it causes RNAP to dissociate from the DNA, terminating transcription.
  • The RNAP comes across a region with repetitious base pairs (for example, TTTTTT). This will terminate transcription.

Eukaryotic transcription

Gene expression in eukaryotes is largely controlled by transcription via transcription factors. As eukaryotes are much more complex than prokaryotes, and have their genetic material stored in the nucleus, the transcription mechanisms are more complicated here. For example, eukaryotes have three RNA polymerases, in contrast to prokaryotes, which only have one.
  • RNA Polymerase I is located in the nucleolus and transcribes only rRNAs.
  • RNA Polymerase II is the "standard" RNAP.
  • RNA Polymerase III transcribes tRNAs and other small RNAs.
Also, eukrayotic RNAPs need specific accessory proteins to become active. The C-terminus of all RNAPs is highly conserved and contains the actual transctiptional mechanism.


The core promoter of eukaryotic genes stretches from position -45 to 0. Additionally, there can be an upstream control element present at the -180 to -107 region, which can amplify the RNAP binding by a factor of up to 100. This UCE usually contains a TATA box, a highly conserved DNA sequence that reads
A similar sequence, thus not that highly conserved, is found in the INR element (initiator element, part of the complex core promoter).



A major difference between prokaryotic and eukaryotic transcription is that the latter have splicing of the primary transcript, modifying the mRNA created during transcription.

See also:

copyright 2004