TRANSCRIPTEnglish

Sanger DNA Sequencing, From Then to Now.

14m 32s2,000 words123 segmentsEnglish

FULL TRANSCRIPT

0:00

ClevaLab.

0:01

In 1977 Frederick Sanger described a method of DNA sequencing using chain-terminating Inhibitors. The

0:10

aim was to find out the sequence of nucleotides in a piece of DNA. This method became known as Sanger

0:16

sequencing. These chain-terminating Inhibitors are also called ddNTPs. DNA is made up of a chain

0:25

of four different nucleotides called dNTPs. To copy DNA and grow the DNA double strand.

0:32

DNA polymerase adds the complementary nucleotide. dNTP stands for deoxyribonucleoside triphosphate.

0:42

A closer look at its structure shows that a dNTP is one deoxyribose, a base and a triphosphate.

0:50

A nucleoside is a ribose sugar and base together. The base is one of four bases,

0:57

Guanine (G), Cytosine (C), Thymine (T) or Adenine (A). The sugar is deoxyribose because it has

1:04

one less oxygen than ribose. ddNTP is short for Dideoxyribonucleoside triphosphate. A ddNTP has

1:14

two oxygens less than ribose, as di- means two. The role of DNA polymerase is to add new bases to

1:23

a growing DNA strand. It does this by catalyzing a chemical reaction. The incoming dNTP's phosphate

1:31

group reacts with the bound dNTP's ribose oxygen. This results in the release of two

1:37

phosphate groups and the addition of dNTP to the strand. But, if a ddNTP gets added to the strand,

1:45

there is no ribose oxygen to add another dNTP. This lack of oxygen terminates the DNA chain.

1:52

It also makes sense to mention here the naming conventions 5' and 3'. 5' and 3' refer to the

2:01

positions of the carbon atoms in the deoxyribose of dNTP. They're numbered from the carbon linked

2:08

to the base to the phosphate. The oxygen needed to add new dNTPs to the DNA strand is bound to

2:15

the 3' carbon. So it's common to say that the DNA extends from the 3' end. The other

2:22

sticky part of the dNTP is the triphosphate.The triphosphate is bound to the 5' carbon. This end

2:29

of the dNTP is the start, and the 3' end is the finish. When you write down a sequence of DNA,

2:36

the order of nucleotides is always in the 5' to 3' direction. Also, note that the DNA polymerase only

2:44

adds a complementary base to the template DNA. So, C always pairs with G and A always with T.

2:52

So how does Sanger sequencing work? The original singer sequencing method is different from

2:58

the one used today. The original method was completely manual and used radioactive dyes.

3:04

Let's take a look at the original Sanger sequencing method. We need a primer,

3:09

DNA polymerase, dNTPs, DNA template and ddNTPs. One of the dNTPs, dATP, is labelled with a

3:20

radioactive tag. A total of four tubes, one for each ddNTP, are used. To start, the DNA,

3:27

primer and buffer are heated to 100 degrees. This separates the DNA into single strands. Remember,

3:34

this was before PCR existed. Heating up regular DNA polymerase inactivates it. So,

3:41

it gets added later. Next, the mixture cools to 67 degrees to allow the sequencing primers to

3:48

bind. Now we add DNA polymerase, all four dNTPs and one of the four ddNTPs to each tube. DNA

3:57

polymerase extends the DNA template. A ddNTP incorporates into the strand, terminating the

4:04

fragment. The ddNTP is at a lower concentration than the dNTPs, so this incorporation is random.

4:12

The result is a termination at each base, creating different-length fragments. All fragments in each

4:19

tube start with the same primer sequence and end in the same nucleotide. Low incorporation of the

4:26

ddNTP allows the sequencing of longer stretches of DNA. In the original Sanger method, sequencing of

4:34

up to 200 nucleotides was possible. Next, the four sequencing reactions get mixed together with a

4:41

loading dye. Each reaction is loaded in a separate lane of a polyacrylamide gel. The fragments move

4:48

through the gel at different speeds depending on their size. The smallest moves the fastest. This

4:55

type of gel can differentiate a single nucleotide difference in length. At this stage, the fragments

5:02

can't be seen. The loading dye tells you when the fragments have reached the end of the gel. To

5:08

visualize the fragments, the sequencing gel gets dried onto a paper support. Then, the radiation

5:14

from the dATPs in the fragments gets detected with X-ray film. This results in bands showing for each

5:22

fragment. The term used for reading a DNA sequence is "base calling". The DNA is read from 5' to 3'

5:30

to call the bases. So we start with the shortest fragment first. In this case, it's in the lane

5:37

with a ddTTP, so the first nucleotide is a "T". The next shortest is in the ddGTP lane and thus

5:45

is a "G". You continue up the gel based on size to read the whole sequence. So, on this gel, it would

5:52

read TGCATGCCA. The original Sanger sequencing method was very labour-intensive. It also took

6:03

four days to sequence 200 nucleotides from only a few samples. There was a great need to streamline

6:10

and automate this process. Applied Biosystems created the first commercial sequencing instrument

6:16

in 1987, the AB370A. Applied Biosystems had already shown that fluorescent dyes could replace

6:24

radioactive dyes. These are safer and cut out the time needed for X-ray film detection, which took

6:31

several days. In this instrument, the sequencing reaction had fluorescent sequencing primers. A

6:38

different coloured fluorescent dye labelled each of the four ddNTP reactions. After sequencing, the

6:44

four reactions could be mixed together and loaded in the same lane of the gel. The AB370A also had

6:52

a laser that scanned the bottom of the gel. This laser detected the fragments as they passed by.

6:58

The instrument fed the data into a computer to call the bases automatically. Sixteen samples

7:04

could be run on one gel with a read length of 450 nucleotides. The AB370A showed that sequencing

7:12

could be faster and more automated. Scientists started to think sequencing the whole human genome

7:19

could be within reach. In 1990 the U.S. government announced the Human Genome Project. This project

7:26

aimed to map and sequence all the genes in the human genome. By 1990 only <2% of the human genome

7:34

had been sequenced. Sequencing the human genome would have important implications for science

7:39

and medicine. In identifying disease-causing and associated genes to treat genetic disease. Kary

7:46

Mullis invented PCR in 1983. It wasn't until 1989 that Vincent Murray used Taq polymerase for Sanger

7:55

sequencing. In Sanger sequencing, the primer binds to the DNA, and the DNA polymerase extends the

8:02

fragment. But, as the primer is in excess. Most of the labelled sequencing primers are not extended

8:08

by DNA polymerase. With Taq polymerase, the DNA can be melted apart after the first extension.

8:15

Taq polymerase will survive this high heat. It can then be cooled again to anneal another sequencing

8:22

primer. These cycles of melting, annealing and extension repeat the same as in PCR. Many more

8:29

primers get incorporated into the fragments, increasing the fluorescent signal. But, as there

8:35

is only one primer. Only extra forward strands are made, and no reverse strands. So the number

8:41

of fragments increases by the same amount each cycle. This increase is linear over the cycles

8:47

and is called linear PCR. The method was later termed cycle sequencing. The higher fluorescent

8:54

signal also meant that less DNA was needed for each reaction. Another important advance was

9:00

in capillary electrophoresis. This is where a small amount of gel is in a fine tube. The DNA

9:07

is taken in one end, runs through the gel under an electric current, and gets detected by a laser

9:12

at the other end. The fine tube used in capillary electrophoresis allows heat to escape. A higher

9:19

current can be used without the gel overheating. Higher currents mean a faster run time and better

9:25

resolution. Beckman Coulter launched the first commercial capillary electrophoresis instrument

9:31

in 1989. This paved the way for the development of a capillary-based Sanger sequencing system,

9:38

the ABI PRISM 310. Applied Biosystems launched this system in 1995, and modern Sanger sequencing

9:47

was born. The ABI PRISM 310 had one capillary for electrophoresis in place of a PAGE gel. One sample

9:56

could be run in under three hours compared to 14 hours. The sequencing length was also improved

10:02

and could now sequence up to 600 base pairs. The capillary also allowed automation of the sample

10:08

loading. Up to 96 samples could be loaded in a plate on the system and left to run on its own.

10:15

Due to electrokinetic injection, low sample volumes and amounts of DNA are needed. This

10:22

is because DNA is pulled into the capillary by an electrical current. The current concentrates it at

10:29

the end of the capillary. The capillary then moves into a running buffer. Fragments pass through the

10:35

gel and separate based on size. Then the fragments pass by a laser at the end of the capillary. The

10:42

size and colour of the fragments get sent to a computer. The software then detects and calls

10:47

the bases. While fluorescent dNTPs were available. Sequencing was still performed with fluorescent

10:54

primers. This was because the peak heights were very even with fluorescent primers. Labelled

11:00

ddNTPs couldn't achieve this even peak height. Not until the introduction of BigDyeTerminators in

11:06

1997. With fluorescent primers, four reactions are needed. But, with fluorescent ddNTPs,

11:13

the sequencing reactions can all be in the same tube. Applied Biosystems continued to

11:19

improve its system. Demand continued to grow for the automation of Sanger sequencing. The Human

11:25

Genome Project was making slow progress. By 1998, only 6% of the human genome was sequenced. It was

11:33

in this year that Applied Biosystems launched the ABI PRISM 3700, which had 96 capillaries. At the

11:41

same time, they announced a partnership with The Institute of Genome Research, also known as TIGR.

11:48

TIGR was a not-for-profit institute headed by Craig Venter. Together they formed a new company

11:54

called Celera and purchased 230 ABI PRISM 3700s. Celera aimed to sequence the human genome faster

12:03

than the Human Genome Project. It planned to make money selling access to its sequence data. It also

12:09

planned to patent genes that could be useful for disease treatment. Profiting from sequencing the

12:15

human genome was controversial and upset many scientists. The race between public and private

12:22

sequencing of the human genome had begun. The ABI PRISM 3700 played a huge role in sequencing the

12:29

human genome. Each run of 96 samples took less than 2.5 hours and generated 800 base pairs of

12:37

sequence for each sample. With only 15 minutes of hands-on time by a technician, 1,536 samples could

12:45

be sequenced daily. With this instrument, the cost per base of sequencing was also reduced. with this

12:51

new technology, Celera produced a draft sequence of the human genome in three years. Publishing

12:57

their results in 2001. The Human Genome Project, also aided by ABI PRISM 3700, published its draft

13:05

genome at the same time in 2001. This modified Sanger sequencing method is still used today. But

13:13

why when there are newer technologies like Next Generation Sequencing (NGS). Let's look at how

13:18

they compare. Sanger sequencing remains the gold standard for sequencing. It is the method that all

13:24

other sequencing methods are compared against. This is because it's 99.9% accurate in calling

13:31

bases. NGS is 99 to 99.9% accurate but depends on the sequencing depth. Sanger sequencing is

13:41

more cost-effective for sample numbers under 20. It's also faster for this amount of samples. For

13:48

large sample numbers, NGS is more cost-effective and quicker to run. But, the sensitivity of Sanger

13:54

sequencing to detect a base within a background of other DNA is only 15 to 20%. Compared to NGS

14:01

with a sensitivity of 1%. Sanger sequencing also has a low sample coverage of one read per sample

14:08

of only 300 to 850 base pairs. In comparison, NGS can generate billions of reads per sample

14:16

of up to 16 Tb. So big that 128 human genomes can be sequenced in one run. So if you have less

14:25

than 20 samples or genes you'd like to sequence, Sanger sequencing is still the method of choice.

UNLOCK MORE

Sign up free to access premium features

INTERACTIVE VIEWER

Watch the video with synced subtitles, adjustable overlay, and full playback control.

SIGN UP FREE TO UNLOCK

AI SUMMARY

Get an instant AI-generated summary of the video content, key points, and takeaways.

SIGN UP FREE TO UNLOCK

TRANSLATE

Translate the transcript to 100+ languages with one click. Download in any format.

SIGN UP FREE TO UNLOCK

MIND MAP

Visualize the transcript as an interactive mind map. Understand structure at a glance.

SIGN UP FREE TO UNLOCK

CHAT WITH TRANSCRIPT

Ask questions about the video content. Get answers powered by AI directly from the transcript.

SIGN UP FREE TO UNLOCK

GET MORE FROM YOUR TRANSCRIPTS

Sign up for free and unlock interactive viewer, AI summaries, translations, mind maps, and more. No credit card required.

GET STARTED FREE SIGN IN