FORK-seq: replication landscape of the Saccharomyces cerevisiae genome by nanopore sequencing
Preprint posted on April 10, 2020 https://www.biorxiv.org/content/10.1101/2020.04.09.033720v1
To duplicate their genome on time, many Eukaryotes initiate DNA replication at multiple different sites on their chromosomes, known as origins of replication. When replication is initiated, the movement of the machinery required for replication forms structures known as a replication forks, which move bi-directionally from the origin. When they converge together, DNA replication can terminate (1). Most genome-wide approaches to study DNA replication are performed on cell populations. This means we lose resolution and important information on DNA replication events within individual cells. Here, Hennion and colleagues have developed a new protocol for examining single molecule DNA replication using the recently developed MinION nanopore sequencing technology from Oxford Nanopore Technologies (2). This technology can allow the sequencing of native single DNA strand in real time. When each DNA base enters and travels through the ‘nanopore’ channel in the flow cell, changes in the surrounding electrical field occur and corresponding electrical signature can be matched to a DNA base. Using the yeast Saccharomyces cerevisiae (S. cerevisiae) as a model, cells were labelled with bromodeoxyuridine (BrdU), an analogue of the DNA base thymidine (3), to track DNA replication on single strands of DNA by analysing location of BrdU incorporation and orientation of BrdU-abundance gradients. They term this approach ‘FORKseq’. Their data reveals a high-resolution and genome-wide picture of DNA replication events in yeast.
FORKseq takes a similar approach to the recently published D-NAscent method (4) which allows the examination of single molecule DNA replication events using BrdU incorporation and nanopore sequencing. The differences between the two techniques lie predominantly in their computational pipelines and their BrdU labelling strategies.
Figure 1. (1) The ‘Nanopore’ embedded in the membrane (blue). An ion current is flowed across the membrane. When the DNA library is applied to the membrane containing the Nanopore, the adaptors can bind to ‘tethers’ on the membrane surface. (2) A motor protein (green) helps to guide the DNA molecule through the pore. (c) As bases enter the Nanopore, the disrupted the electrical current can be read as patterns unique to each base allowing base calling to occur. Adapted from Leggett and Clark, 2017.
Nanopore sequencing technology can distinguishing between thymidine and its base analogue BrdU in a native single strand of DNA.
By sequencing primer extension products containing the presence or absence of BrdU incorporation, a difference between the electrical charge of BrdU and thymidine could be detected by the MinION sequencer using custom python scripts (Fig. 1). Next, by implementing two different machine learning algorithms (CNN and TM), the authors assessed abundance of BrdU incorporation within 100 bp windows, with their bioinformatics approach largely agreeing with average BrdU abundance detected by their mass spectrometry analysis confirming their analysis approach is viable.
FORKseq can determine replication fork direction
To look for replication fork progression, yeast cells were cultivated (pulsed) in BrdU and thymidine conditions for 2 mins then ‘chased’ using thymidine. When the resulting DNA was sequenced, the authors observed regions in which sharp transitions from low BrdU abundance to high BrdU abundance occurred (Fig.2).
Using their machine learning approach, they found these transitions revealed the direction of the replication fork, which they confirmed using a previously published approach known as OK-seq (4,5). They also show they can resolve these sites to within ~ 200 bps of the start and initiation site giving a high-resolution picture of replication events.
FORKseq can map individual replication initiation and termination sites in the yeast genome
Replication initiation and termination sites have been previously mapped throughout the yeast genome. Using FORKseq data, the authors largely confirm the positionings of these sites and additionally identified regions of initiation (9 %) and termination (18 %) that were likely missed previously due to a lack of sensitivity for sites of infrequent usage. Here, they reveal that these additional initiation sites are new sites of DNA replication initiation out with the known origin of replication supporting the model that yeast DNA replication can commence from canonical origins (91%) and non-canonical origin sites (9%). The additional termination sites were located within regions previously only associated with DNA replication initiation (Fig. 4-7).
FORKseq is adaptable for use in other Eukaryotic systems
The FORKseq approach relies on BrdU incorporation as a measure of DNA synthesis. BrdU has been successfully used widely in other Eukaryotic organisms to study DNA replication (3) and established published protocols are available for reference in the usage of this base analogue. Though the authors do acknowledge that organisms with larger genome could likely prove challenging to analyse due to restrictions on the throughput achieved by the MinION sequencer.
What I liked about this preprint:
DNA replication often studied in a population of cells as examining single molecules of DNA can be challenging meaning we are likely more familiar with a lower resolution representative of this process. I really enjoyed reading this article by Hennion and colleagues because of their approach to tackle this lack of resolution. They present their findings clearly, acknowledge their protocol in the context of the field and other techniques developed for similar purposes and employ clever strategies to maximise their datasets.
Questions for the authors:
- Your approach could be used to address how replication dynamics are affected by the presence of exogenous replication stress, such as the addition of chemicals like Hydroxyurea. Do you foresee any challenges or limitations for FORKseq in the analysis of such data?
- As mentioned, nanopore sequencing can be challenging to ensure reproducibility between experiments. What did you find most challenging about the sample preparation and/or the data analysis to minimise variation between experiments?
(1) Fragkos, M., Ganier, O., Coulombe, P & Mechali, M. DNA replication origin activation in space and time. Nature Reviews Molecular Cell Biology, 16 (2015)
(3) Cavanagh, B.L., Walker, T., Norazit, A. & Meedeniva, A.C. Thymidine analogues for tracking DNA synthesis. Molecules, 16(9) (2011).
(4) Muller, C.A., Boemo, M.A., Spingardi, P., Kessler, B.M., Kriaucionis, S., Simpson, J.T. & Nieduszynski, C.A. Capturing the dynamics of genome replication on individual ultra-long nanopore sequence reads. Nature Methods, 16 (2019).
(5) McGuffee, S.R., Smith, D.J. & Whitehouse, I. Quantitative, genome-wide analysis of Eukaryotic replication initiation and termination. Molecular Cell, 50(1) (2013).
(6) Petryk N, Kahli M, d’Aubenton-Carafa Y, Jaszczyszyn Y, Shen Y, Sylvain M, Thermes C, Chen CL, Hyrien O. Replication landscape of the human genome. Nature Comm., 7, 10208 (2016).
Leggett, R.M & Clark, M.D. A world of opportunities with nanopore sequencing. J Exp. Botany., 68, 20 (2017).
Posted on: 27th April 2020Read preprint
Also in the genomics category:
Endogenous retroviruses are a source of enhancers with oncogenic potential in acute myeloid leukaemia
|Selected by||Jesus Victorino|
Comparative analyses of two primate species diverged by more than 60 million years show different rates but similar distribution of genome-wide UV repair events
|Selected by||Kerryn Elliott|
A SARS-CoV-2-Human Protein-Protein Interaction Map Reveals Drug Targets and Potential Drug-Repurposing
|Selected by||Robert Mahen|
preListsgenomics category:in the
Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20
|List by||Maiko Kitaoka, Madhuja Samaddar, Miguel V. Almeida, Sejal Davla, Jennifer Ann Black, Gautam Dey|
A compilation of cutting-edge research that uses the zebrafish as a model system to elucidate novel immunological mechanisms in health and disease.
|List by||Shikha Nayar|