Benchmarking Single-Cell RNA Sequencing Protocols for Cell Atlas Projects
Posted on: 1 June 2019
Preprint posted on 13 May 2019
Article now published in Nature Biotechnology at http://dx.doi.org/10.1038/s41587-020-0469-4
Systematic comparative analysis of single cell RNA-sequencing methods
Posted on:
Preprint posted on 9 May 2019
RNAseq is now a practical, high-throughput and economically viable approach to determine cell fate and state at single cell resolution. As consortia form to generate tissue/organ ‘atlases’, these preprints compare RNA sequencing techniques.
Selected by Rob HyndsCategories: bioinformatics, cell biology, molecular biology
Background
Improvements to single cell (sc) and single nuclear RNA sequencing (snRNAseq) techniques now allow us to profile the phenotype of thousands of cells in an unbiased fashion. Rapid progress in the field has led to the identification of novel, rare cell types and also depictions of dynamic changes in cellular phenotype during development and regeneration.
In efforts to emulate the success of large-scale collaborative efforts to map cancer genomes, consortia such as the Human Cell Atlas are now being formed across the world to map the transcriptional landscapes of whole tissues, organs and organisms. However, the early cancer genome sequencing efforts were hindered by large numbers of groups undertaking independent sequencing efforts in parallel, using assorted protocols that generated diverse datasets between which it was difficult to compare mutation calls. Thus, benchmarking is of critical importance early on in such processes: it could be the difference between future investigators being able to use datasets from multiple parallel research efforts to increase the power of datasets and, in an extreme example, individual datasets being useful only for validation of the initial findings.
Since competing technologies vary in terms of cell/RNA capture, and consequently library complexity, as well as cost and scalability, these two preprints benchmark scRNAseq and snRNAseq technologies, comparing protocols side-by-side using reference samples with a view to informing future studies.
Approaches
Mereu et al. compared 13 techniques using a mixed sample containing 60% human peripheral blood mononuclear cells (PBMCs), 30% mouse colonic tissue, 6% HEK293T cell line, 3% NIH3T3 mouse embryonic fibroblast cell line and 1% dog MDCK cells. Ding et al. analysed 7 techniques (of which only Seq-Well and sci-RNA-seq were non-overlapping) in three independent samples: a 50:50 mix of human HEK293 and mouse NIH3T3 cell cultures, human PBMCs and mouse cortex tissue. The use of PBMCs in both studies produces cells of varying size, RNA quantity and doesn’t require enzymatic dissociation, while the use of cells from multiple species allowed the assessment of singlets vs. multiplets in both studies as barcodes that have a contribution from >1 species can be excluded. The use of PBMCs and mouse colon in combination allows assessment of ability to distinguish cell types and cell states, respectively, as PBMCs generate distinct clusters whereas the mouse colonic epithelium is a continuum of cell states along the crypt-villus axis.
Importantly for the interpretation of these data, low-throughput methods in which cells are sorted into multiwell plates typically generate an order of magnitude more reads per cell than higher throughput methods, making them suitable for deep characterisation of particular cell types or states. By contrast, high-throughput methods separate cells into individual wells or droplets with reagents and beads that will identify the RNA from that cell during analysis and achieve throughput at the expense of depth of interrogation.
Key Findings
Using downsampled data (i.e. reducing the number of cells analysed) to ensure fair comparison between techniques, both preprints demonstrate that the total number of genes detected across samples varies according to the technique used (Fig. 2B, Mereu et al.; Fig. 2D, Ding et al.). Unsurprisingly, lower throughput techniques generally afford deeper sequencing per cell and have reduced rates of doublets. 10X Chromium emerges as the leading candidate among the high-throughput methods in both preprints, even when datasets are substantially downsampled. When analyses were limited to known, cell type-specific ‘marker’ genes, Mereu et al. find that 83% are detected by all technologies but that the performance of different technologies varies widely. In particular, Quartz-seq2 and Smart-seq2 performed cell type identification well and so might be useful for future studies seeking to annotate poorly described tissues/organs.
As a result of protocol-dependent variables, the clustering of cells varied between experiments in both preprints (see Figure 4 from Mereu et al. below). For example, Ding et al. (Fig 5A) show that protocols differ in the proportion of cells within clusters and their ability to distinguish cell types, which is notable since PBMCs should have well demarcated population boundaries. Further, the ability to detect rare cell types in the mouse cortex varied, with pericytes only detected in one replicate of DroNc-seq and with oligodendrocyte precursors and microglia additionally missing from sci-RNA-seq datasets (Fig 6B, Ding et al.).
Figure 4 from Mereu et al.
t-SNE plots showing unsupervised clustering of human samples. Methods with good library complexity and marker detection were better clustered. Monocytes are a good example of a cell type for which some protocols clearly resolved CD14+ (cyan) and FCGR3A+ (black) subpopulations, whereas others could not.
Another key difference between protocols is mapping: more transcripts containing introns or intergene sequences were found in snRNAseq techniques due to the presence of recently transcribed, unprocessed mRNAs but wide variability was also seen between scRNAseq techniques (Fig. 1, Mereu et al.) with these sequences accounting for 7% of sequences in inDrops and 39.5% in sci-RNA-seq (Supp Fig 3., Ding et al). The proportion of anti-sense reads also varied between 31% in 10X Chromium vs 7.5% in sci-RNA-seq (Supp Fig 3., Ding et al). Finally, the detection of transcripts encoded by the mitochondria genome is of interest in studies that track mtDNA mutations to understand the lineage history of cells in a tissue. Ding et al. found some variability between protocols (Supp Fig. 4, Ding et al.) although for most techniques it was within a range expected based on bulk RNA sequencing. Of relevance for consortia performing disease characterisation, techniques such as 10X Chromium obtain short reads from the 3’ and 5’ ends of transcripts whereas other methods capture the full length. As such, the former provide less information on variants/splice junctions present in transcripts while the latter require greater sequencing depth to get equivalent numbers of transcripts represented.
The only sample type in either preprint that required enzymatic tissue dissociation was the mouse colon tissue (Mereu et al.). There were significantly more mouse colon cells in the reference snRNAseq dataset (which is less biased by sampling due to the isolation of nuclei) than in the digested samples. Samples that were dissociated but not subjected to viability selection also had greater representation of mouse cells, confirming that systematic biases can be introduced by sample preparation. Thus, for studies in which tissue composition is sought, it might be preferable to perform quality control steps in silico or to use snRNAseq, as is necessary in tissues such as brain where dissociation is not possible. Mereu et al. found nuclear RNA to be a reasonable surrogate for cytoplasmic profiles although the inclusion of introns caused some difficulties in comparisons between the two (Fig. 3C).
Both preprints generate new computational pipelines capable of handling input data from multiple scRNAseq techniques and which take on key challenges for the field, including how to remove low quality cells from data derived from differing protocols in a single analysis pipeline. Both preprints highlight the power of this approach: joint analyses are clustered primarily based on cell phenotypes, are able to resolve cell types and examples of cell state separation (e.g. T cell subpopulations) and identify rare cells, even when these were not detected by every individual technique. Ding et al. identify 10X chromium as the technique most consistent with combined datasets with regards to cell clustering. Deeper analysis within cell types did identify differences relating to the protocol-of-origin, however (Fig. 5E-H, Mereu et al.). Moreover, the ability of a technique to distinguish cells types was unrelated to its compatibility with other datasets and further downsampling led to reduced compatibility, leading the authors to suggest that consortia adopt minimum coverage thresholds to enable future dataset integration.
Conclusions
- These preprints provide a resource that can be integrated with cost and scale considerations for data-guided selection of techniques in future scRNAseq experiments.
- 10X Chromium performs well in both studies and has the added advantage that it requires the least time to perform experiments.
- The computational approaches in these preprints provide a framework for integration of datasets from alternative scRNAseq techniques and a means to assess and incorporate newer methodologies as they arise. These data will help consortia to establish high standards that allow the full benefits of parallel approaches to be realised.
Further reading
The Human Cell Atlas White Paper, The HCA Consortium (October 2017: https://www.humancellatlas.org/files/HCA_WhitePaper_18Oct2017.pdf
Benchmarking single cell RNA-sequencing analysis pipelines using mixture control experiments. Tian et al., Nature Methods (2019).
Thanks to Dr. Emilia Lim (@LimEmilia), Dr. Kyren Lazarus (@KyrenLazarus) and the authors of both preprints for thoughtful discussions about the data.
Questions for the authors
Q1: In Mereu et al. all samples were frozen before scRNAseq whereas in Ding et al. samples were prepared fresh. Is an unqueried aspect of benchmarking the ability to compare datasets that mix fresh and frozen samples? Would the pipelines handle this?
Q2: These preprints highlight some challenges of combining scRNAseq data to gain experimental power. What are the steps that individual laboratories can take to ensure that data from smaller scRNAseq experiments become useful in future, more powerful analyses?
Q3: Large consortia rightly involve experts in scRNAseq technology development and indeed these are not the first comparisons of scRNAseq methods, but the most comprehensive and recent. Does the temptation to use newer technologies as these emerge potentially create unforeseeable benchmarking issues?
Sign up to customise the site to your preferences and to receive alerts
Register hereAlso in the bioinformatics category:
Deep learning-based predictions of gene perturbation effects do not yet outperform simple linear methods
Benjamin Dominik Maier
Functional Diversity of Memory CD8 T Cells is Spatiotemporally Imprinted
Marina Schernthanner
Enhancer-driven cell type comparison reveals similarities between the mammalian and bird pallium
Rodrigo Senovilla-Ganzo
Also in the cell biology category:
Restoring mechanophenotype reverts malignant properties of ECM-enriched vocal fold cancer
Teodora Piskova
Germplasm stability in zebrafish requires maternal Tdrd6a and Tdrd6c
Justin Gutkowski
Leukocytes use endothelial membrane tunnels to extravasate the vasculature
Felipe Del Valle Batalla
Also in the molecular biology category:
Germplasm stability in zebrafish requires maternal Tdrd6a and Tdrd6c
Justin Gutkowski
Platelet-derived LPA16:0 inhibits adult neurogenesis and stress resilience in anxiety disorder
Harvey Roweth
Green synthesized silver nanoparticles from Moringa: Potential for preventative treatment of SARS-CoV-2 contaminated water
Safieh Shah, Benjamin Dominik Maier
preListsbioinformatics category:
in the‘In preprints’ from Development 2022-2023
A list of the preprints featured in Development's 'In preprints' articles between 2022-2023
List by | Alex Eve, Katherine Brown |
9th International Symposium on the Biology of Vertebrate Sex Determination
This preList contains preprints discussed during the 9th International Symposium on the Biology of Vertebrate Sex Determination. This conference was held in Kona, Hawaii from April 17th to 21st 2023.
List by | Martin Estermann |
Alumni picks – preLights 5th Birthday
This preList contains preprints that were picked and highlighted by preLights Alumni - an initiative that was set up to mark preLights 5th birthday. More entries will follow throughout February and March 2023.
List by | Sergio Menchero et al. |
Fibroblasts
The advances in fibroblast biology preList explores the recent discoveries and preprints of the fibroblast world. Get ready to immerse yourself with this list created for fibroblasts aficionados and lovers, and beyond. Here, my goal is to include preprints of fibroblast biology, heterogeneity, fate, extracellular matrix, behavior, topography, single-cell atlases, spatial transcriptomics, and their matrix!
List by | Osvaldo Contreras |
Single Cell Biology 2020
A list of preprints mentioned at the Wellcome Genome Campus Single Cell Biology 2020 meeting.
List by | Alex Eve |
Antimicrobials: Discovery, clinical use, and development of resistance
Preprints that describe the discovery of new antimicrobials and any improvements made regarding their clinical use. Includes preprints that detail the factors affecting antimicrobial selection and the development of antimicrobial resistance.
List by | Zhang-He Goh |
Also in the cell biology category:
November in preprints – the CellBio edition
This is the first community-driven preList! A group of preLighters, with expertise in different areas of cell biology, have worked together to create this preprint reading lists for researchers with an interest in cell biology. Categories include: 1) cancer cell biology 2) cell cycle and division 3) cell migration and cytoskeleton 4) cell organelles and organisation 5) cell signalling and mechanosensing 6) genetics/gene expression
List by | Felipe Del Valle Batalla et al. |
BSCB-Biochemical Society 2024 Cell Migration meeting
This preList features preprints that were discussed and presented during the BSCB-Biochemical Society 2024 Cell Migration meeting in Birmingham, UK in April 2024. Kindly put together by Sara Morais da Silva, Reviews Editor at Journal of Cell Science.
List by | Reinier Prosee |
‘In preprints’ from Development 2022-2023
A list of the preprints featured in Development's 'In preprints' articles between 2022-2023
List by | Alex Eve, Katherine Brown |
preLights peer support – preprints of interest
This is a preprint repository to organise the preprints and preLights covered through the 'preLights peer support' initiative.
List by | preLights peer support |
The Society for Developmental Biology 82nd Annual Meeting
This preList is made up of the preprints discussed during the Society for Developmental Biology 82nd Annual Meeting that took place in Chicago in July 2023.
List by | Joyce Yu, Katherine Brown |
CSHL 87th Symposium: Stem Cells
Preprints mentioned by speakers at the #CSHLsymp23
List by | Alex Eve |
Journal of Cell Science meeting ‘Imaging Cell Dynamics’
This preList highlights the preprints discussed at the JCS meeting 'Imaging Cell Dynamics'. The meeting was held from 14 - 17 May 2023 in Lisbon, Portugal and was organised by Erika Holzbaur, Jennifer Lippincott-Schwartz, Rob Parton and Michael Way.
List by | Helen Zenner |
9th International Symposium on the Biology of Vertebrate Sex Determination
This preList contains preprints discussed during the 9th International Symposium on the Biology of Vertebrate Sex Determination. This conference was held in Kona, Hawaii from April 17th to 21st 2023.
List by | Martin Estermann |
Alumni picks – preLights 5th Birthday
This preList contains preprints that were picked and highlighted by preLights Alumni - an initiative that was set up to mark preLights 5th birthday. More entries will follow throughout February and March 2023.
List by | Sergio Menchero et al. |
CellBio 2022 – An ASCB/EMBO Meeting
This preLists features preprints that were discussed and presented during the CellBio 2022 meeting in Washington, DC in December 2022.
List by | Nadja Hümpfer et al. |
Fibroblasts
The advances in fibroblast biology preList explores the recent discoveries and preprints of the fibroblast world. Get ready to immerse yourself with this list created for fibroblasts aficionados and lovers, and beyond. Here, my goal is to include preprints of fibroblast biology, heterogeneity, fate, extracellular matrix, behavior, topography, single-cell atlases, spatial transcriptomics, and their matrix!
List by | Osvaldo Contreras |
EMBL Synthetic Morphogenesis: From Gene Circuits to Tissue Architecture (2021)
A list of preprints mentioned at the #EESmorphoG virtual meeting in 2021.
List by | Alex Eve |
FENS 2020
A collection of preprints presented during the virtual meeting of the Federation of European Neuroscience Societies (FENS) in 2020
List by | Ana Dorrego-Rivas |
Planar Cell Polarity – PCP
This preList contains preprints about the latest findings on Planar Cell Polarity (PCP) in various model organisms at the molecular, cellular and tissue levels.
List by | Ana Dorrego-Rivas |
BioMalPar XVI: Biology and Pathology of the Malaria Parasite
[under construction] Preprints presented at the (fully virtual) EMBL BioMalPar XVI, 17-18 May 2020 #emblmalaria
List by | Dey Lab, Samantha Seah |
1
Cell Polarity
Recent research from the field of cell polarity is summarized in this list of preprints. It comprises of studies focusing on various forms of cell polarity ranging from epithelial polarity, planar cell polarity to front-to-rear polarity.
List by | Yamini Ravichandran |
TAGC 2020
Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20
List by | Maiko Kitaoka et al. |
3D Gastruloids
A curated list of preprints related to Gastruloids (in vitro models of early development obtained by 3D aggregation of embryonic cells). Updated until July 2021.
List by | Paul Gerald L. Sanchez and Stefano Vianello |
ECFG15 – Fungal biology
Preprints presented at 15th European Conference on Fungal Genetics 17-20 February 2020 Rome
List by | Hiral Shah |
ASCB EMBO Annual Meeting 2019
A collection of preprints presented at the 2019 ASCB EMBO Meeting in Washington, DC (December 7-11)
List by | Madhuja Samaddar et al. |
EMBL Seeing is Believing – Imaging the Molecular Processes of Life
Preprints discussed at the 2019 edition of Seeing is Believing, at EMBL Heidelberg from the 9th-12th October 2019
List by | Dey Lab |
Autophagy
Preprints on autophagy and lysosomal degradation and its role in neurodegeneration and disease. Includes molecular mechanisms, upstream signalling and regulation as well as studies on pharmaceutical interventions to upregulate the process.
List by | Sandra Malmgren Hill |
Lung Disease and Regeneration
This preprint list compiles highlights from the field of lung biology.
List by | Rob Hynds |
Cellular metabolism
A curated list of preprints related to cellular metabolism at Biorxiv by Pablo Ranea Robles from the Prelights community. Special interest on lipid metabolism, peroxisomes and mitochondria.
List by | Pablo Ranea Robles |
BSCB/BSDB Annual Meeting 2019
Preprints presented at the BSCB/BSDB Annual Meeting 2019
List by | Dey Lab |
MitoList
This list of preprints is focused on work expanding our knowledge on mitochondria in any organism, tissue or cell type, from the normal biology to the pathology.
List by | Sandra Franco Iborra |
Biophysical Society Annual Meeting 2019
Few of the preprints that were discussed in the recent BPS annual meeting at Baltimore, USA
List by | Joseph Jose Thottacherry |
ASCB/EMBO Annual Meeting 2018
This list relates to preprints that were discussed at the recent ASCB conference.
List by | Dey Lab, Amanda Haage |
Also in the molecular biology category:
2024 Hypothalamus GRC
This 2024 Hypothalamus GRC (Gordon Research Conference) preList offers an overview of cutting-edge research focused on the hypothalamus, a critical brain region involved in regulating homeostasis, behavior, and neuroendocrine functions. The studies included cover a range of topics, including neural circuits, molecular mechanisms, and the role of the hypothalamus in health and disease. This collection highlights some of the latest advances in understanding hypothalamic function, with potential implications for treating disorders such as obesity, stress, and metabolic diseases.
List by | Nathalie Krauth |
BSCB-Biochemical Society 2024 Cell Migration meeting
This preList features preprints that were discussed and presented during the BSCB-Biochemical Society 2024 Cell Migration meeting in Birmingham, UK in April 2024. Kindly put together by Sara Morais da Silva, Reviews Editor at Journal of Cell Science.
List by | Reinier Prosee |
‘In preprints’ from Development 2022-2023
A list of the preprints featured in Development's 'In preprints' articles between 2022-2023
List by | Alex Eve, Katherine Brown |
CSHL 87th Symposium: Stem Cells
Preprints mentioned by speakers at the #CSHLsymp23
List by | Alex Eve |
9th International Symposium on the Biology of Vertebrate Sex Determination
This preList contains preprints discussed during the 9th International Symposium on the Biology of Vertebrate Sex Determination. This conference was held in Kona, Hawaii from April 17th to 21st 2023.
List by | Martin Estermann |
Alumni picks – preLights 5th Birthday
This preList contains preprints that were picked and highlighted by preLights Alumni - an initiative that was set up to mark preLights 5th birthday. More entries will follow throughout February and March 2023.
List by | Sergio Menchero et al. |
CellBio 2022 – An ASCB/EMBO Meeting
This preLists features preprints that were discussed and presented during the CellBio 2022 meeting in Washington, DC in December 2022.
List by | Nadja Hümpfer et al. |
EMBL Synthetic Morphogenesis: From Gene Circuits to Tissue Architecture (2021)
A list of preprints mentioned at the #EESmorphoG virtual meeting in 2021.
List by | Alex Eve |
FENS 2020
A collection of preprints presented during the virtual meeting of the Federation of European Neuroscience Societies (FENS) in 2020
List by | Ana Dorrego-Rivas |
ECFG15 – Fungal biology
Preprints presented at 15th European Conference on Fungal Genetics 17-20 February 2020 Rome
List by | Hiral Shah |
ASCB EMBO Annual Meeting 2019
A collection of preprints presented at the 2019 ASCB EMBO Meeting in Washington, DC (December 7-11)
List by | Madhuja Samaddar et al. |
Lung Disease and Regeneration
This preprint list compiles highlights from the field of lung biology.
List by | Rob Hynds |
MitoList
This list of preprints is focused on work expanding our knowledge on mitochondria in any organism, tissue or cell type, from the normal biology to the pathology.
List by | Sandra Franco Iborra |