Discovering the drivers of clonal hematopoiesis
Preprint posted on October 23, 2020 https://www.biorxiv.org/content/10.1101/2020.10.22.350140v1
Categories: bioinformatics, cancer biology, genomics
INTRODUCTION
Blood cells are generated throughout the life of every person. The process of producing blood cells (i.e. hematopoiesis) starts in the bone marrow, where stem cells divide to give rise to hematopoietic stem cells. Every person has a limited amount of hematopoietic stem cells (HSCs) that, as any other cell in our body, accumulate random mutations with time. Most of these mutations are expected to be neutral, but in some cases they confer an advantage to the HSC progenitors, so that their clonal progeny is over-represented in the total population of blood cells.
This phenomenon is called clonal hematopoiesis (CH) and can be identified by sequencing the DNA of blood cells: if a relatively large proportion of blood cells have a specific cell mutation, these are assumed to be a clonal population derived from a single HSC progenitor. Although this seems to be a normal phenomenon related with age, it has been also associated with risk of hematological cancer and death [1].
Recurrent mutations in specific genes have been found in CH suggesting these might be driver mutations, i.e. they provide a selective advantage to the cells leading to clonal expansion. A complete characterisation of gene specific mutations that might drive CH would be useful to investigate the mechanisms that lead to CH in the first place. Pich et al. tackle this problem by identifying signals of positive selection in blood somatic mutations, similar to cancer research studies.
About the preprint
For identifying positive selection in blood somatic mutations, the authors employed a clever approach, repurposing paired blood-tumour samples of more than 12,000 donors, divided in primary (N=~ 8,000; whole-exome level) and metastatic (N=~4,000; whole-genome level) cohorts.
These datasets were originally used to analyse mutations in the DNA sequence of the tumour sample, using the blood sample as control (variant calling). In this study, they analysed the mutations in the blood cells, using the tumour sample as control (reverse variant calling).
On the many genome somatic mutations obtained by reverse calling in both cohorts, they applied the IntOGen bioinformatic pipeline, which analyses the mutation data with seven different driver discovery algorithms combining the results to identify signals of positive selection [2]. Signals of positive selection can be: unexpected high recurrence of mutations; unexpected clustering of mutations in certain regions of the gene; or exceptionally high functional impact of the observed mutations.
After the IntOGen pipeline the authors filtered out few genes deemed as possible artifacts to reduce the probability of having false positives and kept only genes with previous evidence of being associated with CH, myeloid malignancies, or tumorigenesis in general, leading to 26 and 23 driver candidate genes in the metastasis and the primary cohorts, respectively (Fig 1).

Importantly, most genes already associated with CH were recovered, as well as previously found associations such as, CH being positively influenced by age and by the exposure to cytotoxic treatments. Interestingly, although a set of CH genes common to both cohorts was found, many genes were specific to each cohort. The authors point out that this might be due to differences in treatments (e.g. chemoterapy) ethnicity or lifestyle exposures between the donors of both cohorts. Mutations in some CH-related genes are indeed known to provide an advantage to hematopoietic cells under exposure to certain cytotoxic treatments.
The authors then enriched these datasets with a “targeted-cohort”, consisting of N=~24,000 paired blood-tumour samples where a subset of all protein coding genes (N=468) have been sequenced (mostly genes involved in tumour development). Interestingly, they showed that only 44 genes show positive selection signatures. Other genes appeared mutated but didn’t show selection signatures, suggesting these are just passenger mutations and do not drive CH.
This study shows the utility of repurposing massive datasets of paired blood-tissue samples for the study of CH and represents a step towards a targeted sequencing panel of CH drivers that could be widely used for screening and discovery of early stages of CH in patients.
Questions to the authors
- In some types of cancers, studies have shown some mutations seem to occur as a recurrent series of events. Do you think this could also be the case for CH driver genes?
- Is there any evidence for differences in CH drivers for different human populations?
References
[1] Genovese G, Kähler AK, Handsaker RE, et al. Clonal Hematopoiesis and Blood-Cancer Risk Inferred from Blood DNA Sequence. N Engl J Med. 2014;371(26):2477-2487. doi:10.1056/nejmoa1409405
[2] Martínez-Jiménez F, Muiños F, Sentís I, et al. A compendium of mutational cancer driver genes. Nat Rev Cancer. 2020;20(10):555-572. doi:10.1038/s41568-020-0290-x
Posted on: 23rd November 2020
doi: https://doi.org/10.1242/prelights.25824
Read preprintSign up to customise the site to your preferences and to receive alerts
Register hereAlso in the bioinformatics category:
Dissecting Mammalian Spermatogenesis Using Spatial Transcriptomics
Selected by | Martin Estermann |
The hourglass model of evolutionary conservation during embryogenesis extends to developmental enhancers with signatures of positive selection
Selected by | Irepan Salvador-Martinez |
1
The mutational landscape of human somatic and germline cells
Selected by | Kerryn Elliott |
Also in the cancer biology category:
EccDNA formation is dependent on MMEJ, repressed by c-NHEJ pathway, and stimulated by DNA double-strand break
Selected by | Ram |
1
ALC1 links chromatin accessibility to PARP inhibitor response in homologous recombination deficient cells
Selected by | Giuseppina D'Alessandro |
1
Endothelial cell invasiveness is controlled by myosin IIA-dependent inhibition of Arp2/3 activity
Selected by | Mariana De Niz |
1
Also in the genomics category:
Dissecting Mammalian Spermatogenesis Using Spatial Transcriptomics
Selected by | Martin Estermann |
EccDNA formation is dependent on MMEJ, repressed by c-NHEJ pathway, and stimulated by DNA double-strand break
Selected by | Ram |
1
Association analysis of repetitive elements and R-loop formation across species
Selected by | Ram |
1
preListsbioinformatics category:
in theSingle Cell Biology 2020
A list of preprints mentioned at the Wellcome Genome Campus Single Cell Biology 2020 meeting.
List by | Alex Eve |
Antimicrobials: Discovery, clinical use, and development of resistance
Preprints that describe the discovery of new antimicrobials and any improvements made regarding their clinical use. Includes preprints that detail the factors affecting antimicrobial selection and the development of antimicrobial resistance.
List by | Zhang-He Goh |
Also in the cancer biology category:
Single Cell Biology 2020
A list of preprints mentioned at the Wellcome Genome Campus Single Cell Biology 2020 meeting.
List by | Alex Eve |
ASCB EMBO Annual Meeting 2019
A collection of preprints presented at the 2019 ASCB EMBO Meeting in Washington, DC (December 7-11)
List by | Madhuja Samaddar, Ramona Jühlen, Amanda Haage, Laura McCormick, Maiko Kitaoka |
Lung Disease and Regeneration
This preprint list compiles highlights from the field of lung biology.
List by | Rob Hynds |
Anticancer agents: Discovery and clinical use
Preprints that describe the discovery of anticancer agents and their clinical use. Includes both small molecules and macromolecules like biologics.
List by | Zhang-He Goh |
Biophysical Society Annual Meeting 2019
Few of the preprints that were discussed in the recent BPS annual meeting at Baltimore, USA
List by | Joseph Jose Thottacherry |
Also in the genomics category:
EMBL Conference: From functional genomics to systems biology
Preprints presented at the virtual EMBL conference "from functional genomics and systems biology", 16-19 November 2020
List by | Jesus Victorino |
TAGC 2020
Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20
List by | Maiko Kitaoka, Madhuja Samaddar, Miguel V. Almeida, Sejal Davla, Jennifer Ann Black, Dey Lab |
Zebrafish immunology
A compilation of cutting-edge research that uses the zebrafish as a model system to elucidate novel immunological mechanisms in health and disease.
List by | Shikha Nayar |