Ancient genomic regulatory blocks are a major source for gene deserts in vertebrates after whole genome duplications
Preprint posted on September 25, 2019 https://www.biorxiv.org/content/10.1101/776369v1
Background & Summary
Whole Genome Duplication (WGD) events, which might have resulted from erroneous chromosome segregation during meiotic or zygotic cell division, have occurred in many lineages such as yeasts or plants . Having all genes duplicated at once is likely a source of rapid evolutionary change: from the incredible amount of raw material a set of new features could be acquired through specialization and subfunctionalization of the duplicated paralogs. Such potential source of variation was “inserted” into the vertebrate genome after the two rounds of WGD that took place more than 450 million years ago .
However, the extra copies of each gene (ohnologs) produced after a WGD event often disappear due to the lack of providing a selective advantage to the organism, making it hard to trace evolutionary paths back, and explaining why there are not four paralogs of each gene in humans. Microsyntenic associations are formed between a developmental transcription factor (trans-dev, as they are referred to in the preprint) and another gene (bystander) when the regulatory elements of the trans-dev gene are found in the introns of bystander genes; such microsyntenic associations are known as genomic regulatory blocks (GRBs). In this work, Touceda-Suárez et al. investigated how GRBs have evolved in vertebrates in the context of trans-dev genes.
The authors studied the dismantling of the GRB microsyntenic associations after the two rounds of WGD and found that the non-trans-dev gene disappeared while the intronic regulatory elements were kept. This “exon erosion” likely resulted in large intergenic regions full of non-coding regulators, and the authors postulate that this is an important mechanism of gene desert formation (Figure 1).
Figure 1.- Schematic representation of the formation of a gene desert. An ancient Genomic Regulatory Block (GRB) present in a common ancestor is kept in microsynteny after Whole Genome Duplication (WGD) at one of the loci, while at the other loci the bystander gene is lost in the duplicated region, giving rise to a gene desert in the vicinities of the trans-dev.
– Most microsyntenic associations (GRB pairs) common to chordates are present in humans. 131 out of the 156 syntenic gene pairs studied (84%) in 116 putative GRBs were present in the human genome in at least one copy.
– The majority of these GRB pairs are conserved in single copy. Despite the two rounds of WGD, most GRBs analyzed (present in humans) only kept the microsyntenic association between the trans-dev and the bystander gene in one place in the genome, even when there were more ohnologs maintained.
– The dismantling of the extra copies of GRB pairs involve preferential loss of the non-trans-dev gene. In most cases, only the trans-dev gene has been kept in more than one copy (70.1%).
– Exon erosion of the non-trans-dev gene is likely the mechanism of gene loss. In most of the GRBs analyzed, the absence of synteny in more than one trans-dev ohnolog could have been produced by genomic rearrangement or by the loss of one of the genes. However, the longer the intergenic region, the higher the number of putative regulatory elemements (as identified by ATAC-seq peaks). The number of putative regulatory elements increases to reach as many as there would be from the trans-dev gene to the next one after the bystander, which would have not been the case if the synteny had been lost by a genomic rearrangement event. For instance, the ohnologs isl2a and isl1 in zebrafish both maintain a highly conserved non-coding region located within an intron of the scaper gene in the first case, and within the gene desert originated in the second scenario.
– Large intergenic regions (gene deserts) arise upon the loss of large non-trans-dev genes. Trans-dev genes were enriched for large intergenic regions and, after dismantling of the bystander gene, the size of these intergenic regions would increase. The size of these larger intergenic regions reached approximately the distance between the trans-dev genes and the following gene after the bystanders in those cases where the bystander gene had been kept.
Why I liked this preprint
The regulation of gene expression in metazoans is coordinated to a large extent by distal enhancers, which are highly prevalent in vertebrates. Genes encoding developmental transcription factors usually lie near large intergenic regions in humans. These regions are commonly known as “gene deserts” and are an important source of regulatory elements. However, the evolutionary paths that these genomic blocks might have followed all the way through the vertebrate lineage until they became large intergenic regions remained largely unknown.
This research led by M. Irimia, J.L. Gómez-Skarmeta and I. Maeso shows a very likely mechanism by which these enigmatic genomic regions might have appeared in evolved vertebrates. A beautiful answer to an interesting question, which is the main reason why I liked this preprint.
Questions to the authors
– What has been the criteria used by the authors to define a gene as a transcription factor involved in development?
– What do the authors think could be the main reason behind the preferential loss of the bystander gene? Is it known what’s the proportion of trans-dev (retained in multiple copies) that acquired new functions or specialization?
– The authors show examples of trans-dev and bystander pairs in synteny, while contrasting it with non-trans-dev genes likely sharing bidirectional promoters. Did the authors explore the frequency of trans-dev and bystander genes sharing a bidirectional promoter? Would this GRB pairs be expected to undergo the same exon erosion phenomena?
– Do the authors think that having large intergenic regions -sort of specialized in trans-dev gene regulation- confer any evolutionary advantage over having regulatory elements at the intronic regions of a bystander?
Posted on: 14th October 2019 , updated on: 5th November 2019Read preprint
Also in the evolutionary biology category:
Six new reference-quality bat genomes illuminate the molecular basis and evolution of bat adaptations
|Selected by||Alexa Sadier, Alexa Sadier|
The tuatara genome: insights into vertebrate evolution from the sole survivor of an ancient reptilian order
|Selected by||Miguel V. Almeida|
Oxygenation properties of hemoglobin and the evolutionary origins of isoform multiplicity in an amphibious air-breathing fish, the blue-spotted mudskipper (Boleophthalmus pectinirostris)
|Selected by||Charlotte Nelson|
preListsevolutionary biology category:in the
Coronavirus | 2019-nCoV
Regularly updated list of preprints dealing with the ongoing coronavirus outbreak originating in Wuhan, China in Dec 2019
|List by||Gautam Dey, Zhang-He Goh|
SDB 78th Annual Meeting 2019
A curation of the preprints presented at the SDB meeting in Boston, July 26-30 2019. The preList will be updated throughout the duration of the meeting.
|List by||Alex Eve|
Pattern formation during development
The aim of this preList is to integrate results about the mechanisms that govern patterning during development, from genes implicated in the processes to theoritical models of pattern formation in nature.
|List by||Alexa Sadier|