In any compiled language, there's a step between source code and execution: compilation produces an intermediate representation — bytecode, IR, assembly. You don't run Java source files; you run .class files. The intermediate form is more convenient for execution, shorter-lived than the source, and can be optimized, cached, or discarded.
plays exactly this role in biology. is the persistent, protected source of truth. are the executing programs. is the transient intermediate that bridges them — produced on demand, modified, transported, , and degraded. But "intermediate" undersells it: turns out to have rich functionality of its own, including catalytic activity, regulatory roles, and structural functions.
Transcription: Synthesizing RNA from DNA
is the process by which polymerase a template and synthesizes a complementary strand. In eukaryotes, this happens in the nucleus.
The steps:
-
Initiation: bind the , recruit polymerase II (for -coding ), and open the double helix at the start site.
-
Elongation: pol II moves 3'→5' along the template strand, synthesizing 5'→3'. Unlike polymerase, it doesn't need a primer. Speed: ~20–50 /second.
-
Termination: pol II encounters a termination signal downstream of the and releases the transcript.
The product is the pre- (primary transcript) — a complete copy of the genomic region between start and termination sites, including all .
Eukaryotes have three distinct polymerases:
- Pol I — transcribes ribosomal (rRNA)
- Pol II — transcribes -coding → , plus most non-coding RNAs
- Pol III — transcribes tRNA, 5S rRNA, and other small structural RNAs
Bacteria have just one polymerase that does everything. This is why antibiotics that target bacterial polymerase (like rifampicin) are selective — the bacterial is structurally distinct from all three eukaryotic ones.
mRNA Processing: From Pre-mRNA to Mature mRNA
The pre- is extensively processed before it leaves the nucleus:
5' Capping
Shortly after begins, the 5' end of the pre- receives a 7-methylguanosine cap. This cap:
- Protects the from degradation by 5'→3' exonucleases
- Is recognized by the ribosome to initiate
- Is recognized by export machinery to transport out of the nucleus
The cap is not a standard — it's added in a 5'→5' linkage (reversed compared to the rest of the strand) by specific capping .
Polyadenylation
The 3' end of the pre- is cleaved ~10–30 downstream of a consensus signal (AATAAA in the , AAUAAA in the ). Then poly(A) polymerase adds a tail of ~150–250 adenine — the poly-A tail.
The poly-A tail:
- Protects from 3'→5' exonuclease degradation
- Promotes export from the nucleus
- Is recognized by poly-A binding (PABP), which promotes efficient
- Can be shortened over time — shorter tails correlate with faster degradation (this is a major post-transcriptional regulation mechanism)
Almost all -seq () protocols capture mRNAs using oligo-dT beads — short stretches of deoxyadenosine that hybridize to poly-A tails. This means standard specifically selects for polyadenylated transcripts, which includes most but not all mRNAs, and excludes most non-coding RNAs. If you want to capture rRNA or non-polyadenylated RNAs, you need ribo-depletion rather than poly-A selection.
Splicing
As discussed in the chapter, are removed and joined by the spliceosome. of the same pre- can produce different isoforms encoding different . We'll cover this in depth in Chapter 3.3.
The RNA Classes: More Than Just Messengers
When biologists say "," most people think . But is a small fraction of total cellular by mass. Here's the full landscape:
mRNA (Messenger RNA)
The working copies of -coding . Constitutes only ~2–5% of total by mass despite being the most diverse class. Lifetime: minutes to hours.
rRNA (Ribosomal RNA)
The structural and catalytic core of ribosomes. Makes up ~80% of total cellular by mass. Extremely stable. The 16S rRNA (prokaryotes) and 18S rRNA (eukaryotes) are frequently used as phylogenetic markers for species identification — their sequences evolve slowly enough to be conserved, but fast enough to distinguish species. 16S rRNA is the standard method for characterizing gut microbiome composition.
tRNA (Transfer RNA)
Small RNAs (~70–90 ) that carry to the ribosome. Each tRNA has an anticodon loop that -pairs with an codon, and a 3' CCA end that gets aminoacylated (loaded with the correct ) by aminoacyl-tRNA synthetases. There's one synthetase per , and they are among the most accurate molecular machines in the — error rate ~1 in 10⁴.
miRNA (MicroRNA)
Small (~22 ) non-coding RNAs that regulate post-transcriptionally. They -pair (often imperfectly) with sequences in the 3' UTR of target mRNAs, recruiting the RISC complex (-Induced Silencing Complex) to degrade or stall of the target. Each miRNA can regulate hundreds of targets; each can be regulated by dozens of miRNAs.
A miRNA functions like a feature flag system that acts on the level. The flag (miRNA) checks whether a specific pattern is present in the transcript (3' UTR sequence), and if so, reduces expression — not by editing the , but by suppressing the working copy. Different types express different miRNA sets, tuning the same to different expression profiles.
lncRNA (Long Non-Coding RNA)
RNAs >200 that don't encode . Over 50,000 human lncRNA have been identified — more than -coding . Functions include: chromatin remodeling, transcriptional regulation, regulation, and acting as decoys for miRNAs (competing endogenous RNAs). Many lncRNAs are tissue-specific and dysregulated in disease. The XIST lncRNA is a canonical example — it coats the inactive X and silences it in female .
snRNA and snoRNA
Small nuclear RNAs (snRNA) are core components of the spliceosome (U1, U2, U4, U5, U6). Small nucleolar RNAs (snoRNA) guide chemical modifications of rRNA and tRNA. Both are required for normal processing and are constitutively expressed.
siRNA (Small Interfering RNA)
Double-stranded RNAs that trigger sequence-specific degradation through the RNAi — the same as miRNA but with perfect complementarity to the target. In nature, siRNAs defend against and transposons. In the lab, synthetic siRNAs are a standard tool for knocking down . RNAi therapeutics (like Patisiran and Inclisiran) use modified siRNAs as drugs.
RNA Structure and the RNA World
Unlike , can fold into complex secondary and tertiary structures — because its 2'-OH group allows more diverse hydrogen bonding patterns. structures include:
- Stem-loops (hairpins): helical double-stranded regions with single-stranded loops
- Pseudoknots: more complex topologies where two stem-loops interlock
- Riboswitches: structures in the 5' UTR that change conformation in response to metabolite binding, controlling without
Ribozymes are catalytic RNAs. The ribosome itself — the machine that synthesizes all — is fundamentally an machine. The peptidyl transferase center that makes peptide bonds is composed of rRNA, not . This supports the world hypothesis: early life may have relied on for both information storage and catalysis, before and took over their respective roles.
RNA Stability and Degradation
is deliberately unstable — typical half-lives range from minutes (for rapidly regulated like immediate early response ) to hours (for housekeeping ). This instability allows the to rapidly change its output in response to signals.
Degradation :
- Deadenylation: Poly-A tail is shortened by deadenylases. When the tail is gone, decapping remove the 5' cap.
- 5'→3' decay: Once the cap is gone, the exoribonuclease Xrn1 degrades the rapidly.
- 3'→5' decay: The exosome complex can degrade from the 3' end.
- NMD (Nonsense-Mediated Decay): Special that detects and destroys mRNAs with premature stop codons — a quality control mechanism that prevents synthesis of truncated, potentially harmful .
Understanding stability is important for analysis: more stable transcripts accumulate to higher steady-state levels, so abundance reflects both rate and degradation rate. A that's rapidly but degraded quickly can have lower steady-state than a slowly but very stably.
The RNA Biology Revolution
Before the 2000s, was considered primarily a passive intermediate. The discovery of interference (RNAi, 1998 Nobel Prize in 2006), the non-coding explosion from ENCODE (2012), and the application of as medicine ( vaccines, 2021 Nobel Prize) have completely rewritten that view.
Today, biology intersects with:
- Transcriptomics (, single- )
- Epitranscriptomics ( modifications like m6A methylation)
- Structural biology (cryo-EM structures of ribosomes, spliceosomes)
- Therapeutics ( vaccines, siRNA drugs, ASO drugs)
- Diagnostics (liquid biopsy via circulating )
is not a secondary player in the . It's where most of the regulation actually happens.
RNA is a single-stranded nucleic acid that serves as an intermediary between DNA and protein. mRNA carries the genetic message; tRNA brings amino acids; rRNA forms the ribosome. RNA is chemically less stable than DNA — it degrades quickly, which is by design.
mRNA is a short-lived build artifact — compiled from DNA (source), executed once by ribosomes (runtime), then garbage collected. Its instability is a feature: the cell can adjust protein output by controlling mRNA half-life. tRNA is the ABI layer that maps 3-character codons to amino acids. rRNA is the virtual machine that runs the translation.