Principle of Illumina sequencing
Illumina sequencing (short read sequencing)
one of the Next Generation Sequencing (NGS) technologies
Most widely applied sequencing technique
Generates not one but millions of sequences at the same time
read = single sequence
high accuracy (>99.9%)
Nucleotides with reversible terminated 3’-OH group
Sequencing by synthesis - step by step
Assumptions:
Blue: Single stranded temple = sequence of interest
The DNA strand is fixed to a surface at the 5’-OH end.
Two terminal adapters (of known sequence) are ligated to both ends of the DNA strand.
A primer (oligonucleotide) is bound at one side.
Oligonucleotide as a free 3’ OH group.
Nucleotides with reversiblee terminating groups are equally frequent in the reaction.
DNA polymerase incorporates a single complementary nucleotide (matching wit the template).
Fluorescent dye acts as a terminator of polymerization (=strand synthesis).
Strand synthesis is terminated.
The excitation of the fluorescent dye happen at a specific wavelength.
The detection of the light emission can be used to identify the nucleotide.
Next, the fluorescent dye is removed by enzymatic cleavage.
The exposure of free 3’ OH group enables the elongation with the next ddNTP.
Which bases have been sequenced?
From top we see only the emitted light
DNA fragmentation prior to sequencing
Usually DNA is too large to be sequenced directly.
DNA is fragmented (enzymatically/mechanically) into smaller pieces.
Fragment sizes differ depending on the applied protocol.
Adapters with known sequences are added to the fragments.
Sequencing adapter
Binding of sequences to the flow cell
Flow cell is a slide on which both capture sequences are attached.
The attached capture sequences complement the adapters that have been ligated to the insert.
The entire workflow as a video:
Take-home message
Illumina uses
sequencing by synthesis
.
Billions of DNA fragments (
reads
) are sequenced in parallel on a flow cell
Large number of reads! But reads are short:
50 to 300 nucleotides
Clusters are created through
bridge PCR
amplification
Bases are detected by fluorescent labeled nucleotides
Strength:
high accuracy
Limitation:
short read length