Simultaneous multiplexed amplicon sequencing and transcriptome profiling in single cells
Preprint posted on November 13, 2018 https://www.biorxiv.org/content/early/2018/11/13/328328
Article now published in Nature Methods at http://dx.doi.org/10.1038/s41592-018-0259-9
High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes
Preprint posted on September 24, 2018 https://www.biorxiv.org/content/early/2018/09/24/424945
Article now published in Nature Communications at http://dx.doi.org/10.1038/s41467-019-11049-4
Droplet-based single-cell RNA sequencing (scRNA-seq) is, and continues to be an important tool for estimating changes in gene expression at a single-cell level in a high-throughput manner. However, current techniques are largely limited to studying the 3’ ends of A-tailed mRNA transcripts at a transcriptome-wide level.
These two preprints by Saikia and Burnham et al. (on DART-seq) and Singh and Al-Eryani et al. (on RAGE-seq) tackle various limitations of current droplet-based scRNA-seq. For example, the diversity of B- and T- cell receptors (BCRs and TCRs respectively) generated by V(D)J recombination, and the further random addition or removal of nucleotides cannot be elucidated by traditional droplet-based scRNA-seq. The V(D)J regions of the B- and T-cell receptors are located at the 5’ end of their respective transcripts, while the combination of 3’ end capture, fragmentation and short-read Illumina sequencing in droplet-based scRNA-seq results in one getting sequences largely restricted to the 3’ end. The two preprints both tackle this problem, albeit slightly differently – DART-seq uses specific primers and modified Drop-seq beads to capture the heavy and light chain transcripts just downstream of the variable region, while RAGE-seq uses Oxford Nanopore technology to sequence the captured full-length BCR and TCR transcripts.
Additionally, Saikia and and Burnham et al. used DART-seq to characterise non-A-tailed transcripts of dsRNA viruses, while Singh and Al-Eryani et al. used RAGE-seq to characterise alternative splicing and gene rearrangements in BCR transcripts, both of which would be extremely difficult with traditional droplet-based scRNA-seq.
DART-seq (Saikia and Burnham et al.)
In DART-seq, customised primers are ligated to a fraction of poly-dTs on Drop-seq beads, with an efficiency of 25-40%, which allows the specific capture and amplification of transcripts of interest. The ligation reaction can be titrated to leave many poly-dTs available, enabling the simultaneous capture of transcripts-of-interest and A-tailed transcripts.
The authors use their technology to specifically capture and sequence heavy and light chain antibody variable regions, while simultaneously obtaining whole-transcriptomic data from those same cells. Antibody variable regions are located at the 5’ end of their transcripts and traditionally cannot be sequenced with droplet-based scRNA-seq, which focuses on the 3’ end of transcripts. By designing primers just downstream of the V(D)J segments, the Ig recovery rate could be increased significantly compared to Drop-seq.
DART-seq was then used to study the B-cell antibody repertoire within human peripheral blood mononuclear cells (PBMCs). Of the 818 B-cells identified, 564 (67%) had immunoglobulin transcripts and the complete heavy and light chain CDR3 regions were obtained for 120 cells (15%). Clone-specific pairing was measured, and the highest frequency was observed between the most highly expressed heavy and light chain transcripts, which supports previous data.
The authors also applied their technology to study the infection of cells with T3D reovirus, an RNA virus with non-A-tailed transcripts. Infected cells were subjected to DART-seq, with beads modified to capture segments of the viral genome. By using primers designed to sequence the entire S2 transcript, the authors were able to analyse point mutations in the viral transcripts. Upon study of the cell transcriptomes, they found four distinct cell subpopulations, one of which was not found in the non-infected control.
RAGE-seq (Singh and Al-Eryani et al.)
In contrast to DART-seq, RAGE-seq uses targeted capture coupled with long-read sequencing with Oxford Nanopore. To obtain full-length BCR and TCR sequences, a capture bait library targeting all V, J and constant (C) region exons was used. The cDNA library is simultaneously sequenced via traditional short-read sequencing (via 10X Genomics), enabling 3’ expression profiling.
Figure 1 of Singh and Al-Eryani et al., made available under a CC-BY-NC-ND 4.0 International License
The authors validated the technology with a mix of Jurkat (T cells), Ramos (B cells) and monocytes (as a negative control). They noted a 13-fold enrichment of nanopore reads aligned to the BCR and TCR, when compared to non-targeted capture, and were able to recover both full length transcripts from a subset of cells – TCRα and β from 18.9% of Jurkat cells and the Ig heavy and light chains from 31% of Ramos cells. By tracking the amino acid mutations present, the authors could plot and follow the evolution of Ramos cells undergoing somatic hypermutation.
The authors then analysed lymphocytes from a human lymph node, where they similarly identified T cells and B cells, and sequenced their respective TCR and immunoglobulin chains. They quantified differences in Ig heavy chain transcripts in different cell types, noting that naive and memory B cells have both membrane and secretory IGH isoforms, while plasmablasts and plasma cells only have the secretory form. In addition, many plasmablasts and plasma cells were assigned to IGHA1, which is consistent with differentiation to high-rate antibody secreting cells.
The lymph node analysis was combined with analysis of lymphocytes from the primary breast tumour of the same patient, to enable the tracking of clonally-related T and B cells across different tissues. 7 clones were found to be shared between the tumour and lymph node, of which 6 were found within the CD8 T cell cluster. These clonal expanded cells have a discrete gene signature associated to active tissue resident cytotoxic lymphocytes, but each clone also seemed to express unique sets of genes.
What I like about these works
High throughput droplet-based transcriptomic sequencing is an extremely powerful technique, but currently suffers from certain limitations, such as the inability to sequence nucleotides located at the 5’ end of transcripts and the lack of sequencing strategies to pick up non-A-tailed transcripts.
The developments proposed by Saikia and Burnham et al. and Singh and Al-Eryani et al. go a long way to solve these problems. The two groups of authors have come up with innovative and contrasting technologies that expand the single-cell transcriptomic toolkit. In particular, both preprints have demonstrated an ability to recover natively-paired heavy and light chains of antibody variable regions, which is difficult to obtain with traditional droplet-based single-cell transcriptomic sequencing.
I’m also enthralled by the recent increase in utilisation of Oxford Nanopore sequencing, and I find the combination of Oxford Nanopore and Illumina sequencing in RAGE-seq (Singh and Al-Eryani et al.) exciting and potentially useful for many applications.
Posted on: 20th November 2018
Also in the genomics category:
Six new reference-quality bat genomes illuminate the molecular basis and evolution of bat adaptations
|Selected by||Alexa Sadier, Alexa Sadier|
The transcriptional legacy of developmental stochasticity
|Selected by||Sergio Menchero|
The tuatara genome: insights into vertebrate evolution from the sole survivor of an ancient reptilian order
|Selected by||Miguel V. Almeida|