Lightning Fast and Highly Sensitive Full-Length Single-cell sequencing using FLASH-Seq
Preprint posted on 14 July 2021 https://www.biorxiv.org/content/10.1101/2021.07.14.452217v1
Article now published in Nature Biotechnology at http://dx.doi.org/10.1038/s41587-022-01312-3
Understanding how differences in the DNA blueprint (genome) of a cell translate into differences in physical features (phenotype) can help us understand complex diseases like cancer (1). By investigating changes in gene expression levels i.e the ‘transcriptome’ of a cell, we can examine these effects. However, most transcriptomes examine populations of cells giving an overall picture of more common events. This means you miss information about what happens in individual cells, which is important, as individuals can have very different transcriptomes (2). It is possible to study the transcriptome of single cells using single-cell RNA sequencing (scRNA-seq), however most protocols are very time consuming, very expensive and, in some cases, lack sensitivity. Here, the authors modify a previously established scRNA-seq sample processing pipeline to create a protocol which is overall cheaper, more sensitive, and less time-consuming. Their approach ‘FLASH-seq’ will help improve the accessibility of scRNA-seq.
How does SMART-seq work?
FLASH-seq is based upon SMART-seq (switching mechanism at the 5’ end of the RNA template; 3). RNA is extracted from cells and converted into complementary DNA (cDNA) using a template switching reverse transcriptase (RT) enzyme. This enzyme adds a small sequence of bases (for example C nucleotides) to the 5’end of a full-length molecule of cDNA. It then stops and move to another molecule. This process is repeated until in theory all cDNA molecules now also have this small extra sequence of DNA at their 5’ end. These extra sequences, for example several C DNA bases, can now be recognised by another primer, allowing a second strand of DNA to be produced. Polymerase Chain Reaction (PCR) is then used to amplify the cDNAs. The cDNAs are then fragmented (‘broken up’) and prepared for sequencing using a next generation sequencing method. During this process an enzyme known as a transposase (tn5 transposase) adds a known short DNA sequence (Sequence Adaptor) required for the sequencing process. In a new protocol, it is now possible to also add Unique Molecular Identifies (UMIs) to cDNA molecules to enable researchers to find PCR duplicates (i.e duplicates from the same cDNA which enter different beads or flow cells during sequencing), count transcripts and help find rare variations of transcripts as some of these can be excluded during processing.
How does FLASH-seq improve the process?
Here, the authors develop their FLASH-seq protocol using human peripheral blood mononuclear cells (hPBMCs) from healthy individuals and from cultured HEK293T cells (an established line of human embryonic kidney cells). They sorted single cells using a flow cytometry approach into 96 well or 384 well plates. They compare FLASH-seq to established SMART-seq protocols.
They introduce several key modifications:
- Instead of using two steps to amplify the cDNA, they use one step to transcribe the RNA directly to cDNA (RT-PCR)
- They optimised the RT-PCR cycles
- They use an improved RT enzyme i.e less time is needed to make cDNA molecules
- They add extra nucleotides to the reaction to improve the number of cDNA molecules which have the extra 5′ sequences added by the template switching RT enzyme.
- Modified the primers used to copy the second strand to limit strand invasion. This is necessary as strand invasion results in shorter cDNA molecules as the RT process does not complete. This mean you would not get a complete cDNA molecule (4).
- No additional PCR amplification as the amplification during the RT-PCR reaction was adequate
- Modifications to the clean-up reaction after the RT PCR by limiting sample washing
These modifications improve:
- 8x more cDNA than other SMART-seq approaches
- Full coverage of sequence across gene bodies
- Works in a challenging cell line like hPBMCs
- Around $1 per cell
- Uses smaller volumes of reagents which cuts the costs
- FLASH-seq outperformed other SMART-seq methods at these volumes
- It is possible to also add UMI’s to cDNA molecules during the FLASH-seq protocol
- Adding additional space sequences in the primers used to amplify the second strand and add the UMI meant they found less artifactual strand invasion events
- Can be run in a 96 well or 384 well plate format
- Can be handled by most automated lab liquid handling robots
- Sequence ready libraries in about 4.5 hours
- A more ‘rapid’ version of FLASH-seq known as FLASH-seq LA (low amplification), is also available. This approach allows minimal RT-PCR cycles and very limited purification of PCR samples but still achieves high quality results.
Figure shows a selection of data from Hahaut et al. Fig. 1B shows how many genes were detected using different SMART-seq protocols (SS2 and SS3) compared with FLASH-seq (FS). Fig. 1D shows the correlation between gene expression levels across SS2, SS3 and FLASH-seq protocols. Fig. 1F shows the number of genes detected in hPBMCs using SS2, SS3, a commercial single cell kit SSsc (by Takara Bio) and the FS protocol using the same volume for the final reaction. Fig. 2A. shows an illustration of the workflow for the FLASH-seq low amplification (LA) approach compared to FS. Fig. 2F compares the number of genes detected using either the FS or FS-LA approach in hPBMCs. All figures were adapted from the preprint manuscript (CC-BY-NC-ND 4.0 International license).
What I liked about this preprint:
scRNA-seq is a new and powerful tool for looking at how individual cells behave. One major barrier however is the cost in addition to the time required to prepare samples. FLASH-seq now offers an opportunity for researchers to not only prepare high quality scRNA-seq samples rapidly but it reduces the costs significantly per cell. Though such experiments are still ‘costly’, this price reduction now means scRNA-seq technology will hopefully become much more accessible to a larger community of research labs.
Questions for Authors:
Q1: What was the most challenging part of the FLASH-seq protocol to optimise?
Q2: Here you have tested FLASH-seq on human cell lines. Do you anticipate any limitations when applying this technique to other experimental organisms, for example yeasts?
Q3: Do you see any sequencing artefacts related to the increased amount of C nucleotides during the RT-PCR step?
Q4: Alternative Polyadenylation (A)A events are also important perpetrators of gene expression alterations. Can you reliably detect APA events using FLASH-seq?
Q5: Did you notice any differences in the efficacy of samples prepared using Superscript IV versus Maxima H-Reverse Transcriptase?
- Zhang et al. Single-cell RNA sequencing in cancer research (2021). J. Experimental and Clinical Cancer Research
- L. Goldman et al. The impact of heterogeneity on Single-Cell sequencing. (2019). Frontiers in Genetics
- Picelli. Single-cell RNA-sequencing: The future of genome biology is now. (2017). RNA Biology
- T.P. Tang et al. Suppression of artifacts and barcode bias in high-throughput transcriptome analyses utilizing template switching. (2013). Nucleic Acids Research.
Posted on: 12 August 2021Read preprint
Also in the genomics category:
Single-cell epigenomic reconstruction of developmental trajectories in human neural organoid systems from pluripotency
The brittle star genome illuminates the genetic basis of animal appendage regeneration
Double-strand breaks in facultative heterochromatin require specific movements and chromatin changes for efficient repair
preListsgenomics category:in the
preLights peer support – preprints of interest
This is a preprint repository to organise the preprints and preLights covered through the 'preLights peer support' initiative.
|preLights peer support
9th International Symposium on the Biology of Vertebrate Sex Determination
This preList contains preprints discussed during the 9th International Symposium on the Biology of Vertebrate Sex Determination. This conference was held in Kona, Hawaii from April 17th to 21st 2023.
Semmelweis Symposium 2022: 40th anniversary of international medical education at Semmelweis University
This preList contains preprints discussed during the 'Semmelweis Symposium 2022' (7-9 November), organised around the 40th anniversary of international medical education at Semmelweis University covering a wide range of topics.
20th “Genetics Workshops in Hungary”, Szeged (25th, September)
In this annual conference, Hungarian geneticists, biochemists and biotechnologists presented their works. Link: http://group.szbk.u-szeged.hu/minikonf/archive/prg2021.pdf
EMBL Conference: From functional genomics to systems biology
Preprints presented at the virtual EMBL conference "from functional genomics and systems biology", 16-19 November 2020
Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20
|Maiko Kitaoka et al.
A compilation of cutting-edge research that uses the zebrafish as a model system to elucidate novel immunological mechanisms in health and disease.