Lightning Fast and Highly Sensitive Full-Length Single-cell sequencing using FLASH-Seq
Preprint posted on July 14, 2021 https://www.biorxiv.org/content/10.1101/2021.07.14.452217v1
Understanding how differences in the DNA blueprint (genome) of a cell translate into differences in physical features (phenotype) can help us understand complex diseases like cancer (1). By investigating changes in gene expression levels i.e the ‘transcriptome’ of a cell, we can examine these effects. However, most transcriptomes examine populations of cells giving an overall picture of more common events. This means you miss information about what happens in individual cells, which is important, as individuals can have very different transcriptomes (2). It is possible to study the transcriptome of single cells using single-cell RNA sequencing (scRNA-seq), however most protocols are very time consuming, very expensive and, in some cases, lack sensitivity. Here, the authors modify a previously established scRNA-seq sample processing pipeline to create a protocol which is overall cheaper, more sensitive, and less time-consuming. Their approach ‘FLASH-seq’ will help improve the accessibility of scRNA-seq.
How does SMART-seq work?
FLASH-seq is based upon SMART-seq (switching mechanism at the 5’ end of the RNA template; 3). RNA is extracted from cells and converted into complementary DNA (cDNA) using a template switching reverse transcriptase (RT) enzyme. This enzyme adds a small sequence of bases (for example C nucleotides) to the 5’end of a full-length molecule of cDNA. It then stops and move to another molecule. This process is repeated until in theory all cDNA molecules now also have this small extra sequence of DNA at their 5’ end. These extra sequences, for example several C DNA bases, can now be recognised by another primer, allowing a second strand of DNA to be produced. Polymerase Chain Reaction (PCR) is then used to amplify the cDNAs. The cDNAs are then fragmented (‘broken up’) and prepared for sequencing using a next generation sequencing method. During this process an enzyme known as a transposase (tn5 transposase) adds a known short DNA sequence (Sequence Adaptor) required for the sequencing process. In a new protocol, it is now possible to also add Unique Molecular Identifies (UMIs) to cDNA molecules to enable researchers to find PCR duplicates (i.e duplicates from the same cDNA which enter different beads or flow cells during sequencing), count transcripts and help find rare variations of transcripts as some of these can be excluded during processing.
How does FLASH-seq improve the process?
Here, the authors develop their FLASH-seq protocol using human peripheral blood mononuclear cells (hPBMCs) from healthy individuals and from cultured HEK293T cells (an established line of human embryonic kidney cells). They sorted single cells using a flow cytometry approach into 96 well or 384 well plates. They compare FLASH-seq to established SMART-seq protocols.
They introduce several key modifications:
- Instead of using two steps to amplify the cDNA, they use one step to transcribe the RNA directly to cDNA (RT-PCR)
- They optimised the RT-PCR cycles
- They use an improved RT enzyme i.e less time is needed to make cDNA molecules
- They add extra nucleotides to the reaction to improve the number of cDNA molecules which have the extra 5′ sequences added by the template switching RT enzyme.
- Modified the primers used to copy the second strand to limit strand invasion. This is necessary as strand invasion results in shorter cDNA molecules as the RT process does not complete. This mean you would not get a complete cDNA molecule (4).
- No additional PCR amplification as the amplification during the RT-PCR reaction was adequate
- Modifications to the clean-up reaction after the RT PCR by limiting sample washing
These modifications improve:
- 8x more cDNA than other SMART-seq approaches
- Full coverage of sequence across gene bodies
- Works in a challenging cell line like hPBMCs
- Around $1 per cell
- Uses smaller volumes of reagents which cuts the costs
- FLASH-seq outperformed other SMART-seq methods at these volumes
- It is possible to also add UMI’s to cDNA molecules during the FLASH-seq protocol
- Adding additional space sequences in the primers used to amplify the second strand and add the UMI meant they found less artifactual strand invasion events
- Can be run in a 96 well or 384 well plate format
- Can be handled by most automated lab liquid handling robots
- Sequence ready libraries in about 4.5 hours
- A more ‘rapid’ version of FLASH-seq known as FLASH-seq LA (low amplification), is also available. This approach allows minimal RT-PCR cycles and very limited purification of PCR samples but still achieves high quality results.
Figure shows a selection of data from Hahaut et al. Fig. 1B shows how many genes were detected using different SMART-seq protocols (SS2 and SS3) compared with FLASH-seq (FS). Fig. 1D shows the correlation between gene expression levels across SS2, SS3 and FLASH-seq protocols. Fig. 1F shows the number of genes detected in hPBMCs using SS2, SS3, a commercial single cell kit SSsc (by Takara Bio) and the FS protocol using the same volume for the final reaction. Fig. 2A. shows an illustration of the workflow for the FLASH-seq low amplification (LA) approach compared to FS. Fig. 2F compares the number of genes detected using either the FS or FS-LA approach in hPBMCs. All figures were adapted from the preprint manuscript (CC-BY-NC-ND 4.0 International license).
What I liked about this preprint:
scRNA-seq is a new and powerful tool for looking at how individual cells behave. One major barrier however is the cost in addition to the time required to prepare samples. FLASH-seq now offers an opportunity for researchers to not only prepare high quality scRNA-seq samples rapidly but it reduces the costs significantly per cell. Though such experiments are still ‘costly’, this price reduction now means scRNA-seq technology will hopefully become much more accessible to a larger community of research labs.
Questions for Authors:
Q1: What was the most challenging part of the FLASH-seq protocol to optimise?
Q2: Here you have tested FLASH-seq on human cell lines. Do you anticipate any limitations when applying this technique to other experimental organisms, for example yeasts?
Q3: Do you see any sequencing artefacts related to the increased amount of C nucleotides during the RT-PCR step?
Q4: Alternative Polyadenylation (A)A events are also important perpetrators of gene expression alterations. Can you reliably detect APA events using FLASH-seq?
Q5: Did you notice any differences in the efficacy of samples prepared using Superscript IV versus Maxima H-Reverse Transcriptase?
- Zhang et al. Single-cell RNA sequencing in cancer research (2021). J. Experimental and Clinical Cancer Research
- L. Goldman et al. The impact of heterogeneity on Single-Cell sequencing. (2019). Frontiers in Genetics
- Picelli. Single-cell RNA-sequencing: The future of genome biology is now. (2017). RNA Biology
- T.P. Tang et al. Suppression of artifacts and barcode bias in high-throughput transcriptome analyses utilizing template switching. (2013). Nucleic Acids Research.
Posted on: 12th August 2021Read preprint
Also in the genomics category:
FAIR enough? A perspective on the status of nucleotide sequence data and metadata on public archives
|Selected by||Kristina Kuhbandner|
CDK9 and PP2A regulate the link between RNA polymerase II transcription termination and RNA maturation
|Selected by||Isabella Maudlin|
Origin, specification and differentiation of a rare supporting-like lineage in the developing mouse gonad
|Selected by||Martin Estermann|
preListsgenomics category:in the
20th “Genetics Workshops in Hungary”, Szeged (25th, September)
In this annual conference, Hungarian geneticists, biochemists and biotechnologists presented their works. Link: http://group.szbk.u-szeged.hu/minikonf/archive/prg2021.pdf
|List by||Nándor Lipták|
EMBL Conference: From functional genomics to systems biology
Preprints presented at the virtual EMBL conference "from functional genomics and systems biology", 16-19 November 2020
|List by||Jesus Victorino|
Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20
|List by||Maiko Kitaoka et al.|
A compilation of cutting-edge research that uses the zebrafish as a model system to elucidate novel immunological mechanisms in health and disease.
|List by||Shikha Nayar|