Association analysis of repetitive elements and R-loop formation across species
Preprint posted on November 10, 2020 https://www.biorxiv.org/content/10.1101/2020.11.09.374124v1
Article now published in Mobile DNA at https://mobilednajournal.biomedcentral.com/articles/10.1186/s13100-021-00231-5
Categories: genomics, molecular biology
Context1-3
R-loops are non-canonical three-stranded nucleic acid structures that are formed when the RNA hybridizes with the complementary DNA strand displacing the other strand free. The factors that influence the formation and genome-wide distribution of R-loops include several proteins involved in transcription, splicing, replication, recombination, DNA repair, and chromatin modifiers, etc. Additionally, R-loops tend to form at repetitive elements and skewed sequences (GC-skew and AT-skew). However, it is not clear if R-loops have any sequence bias in their genome-wide distribution among different species. Therefore, the authors of the current preprint looked into published datasets to understand the cis-regulatory elements associated with genome-wide R-loop distribution.

Key findings
- The authors reanalyzed publicly available datasets generated in human cells (U2OS), fly (D. melanogaster embryos, S2 cells), and plants (seedling of A. thaliana) using different controls. They used R-loop (DRIP-seq) and nascent RNA profiles (GRO-seq) for the study. They observed that R-loops in plants tend to be longer (~998 nucleotides) than humans and fly (~414-618 nucleotides). Across the species, R-loops tend to enrich at gene promoters. Of note, plants harbor about 60% of the total R-loops at their promoters (and 0.2% at their introns). They also found a 70%, 24%, 39%, and 54% overlap between R-loops and transcribing regions in humans, fly embryos, S2 cells, and plants. However, when they analyzed further, flies tend to harbor R-loops more at intergenic regions (~90%) and possibly independent of transcription, suggesting the presence of trans R-loops.
- The authors report some species-specific differences. They show that human and plant R-loops were marginally enriched at ribosomal DNA and underrepresented at short interspersed nuclear elements (SINEs). But more enriched at retrotransposons and satellite DNA. This is in contrary to the fly genome that has underrepresented R-loops at the satellite DNA (they also notice some difference in R-loop genome-wide distribution in the fly genome between embryos and S2 cells, possibly reflecting the developmental stages).
- Overall, all the species analyzes showed a positive correlation between repetitive genetic elements and R-loop genome-wide distribution. In human cells, telomeres, centromeres, ribosomal DNA, and retrotransposons are enriched for R-loops. In the fly genome, Long interspersed nuclear elements (LINEs), Long terminal repeats (LTRs), and low complexity regions enriched for R-loops. However, in the plant genome, about half of the repeat families were enriched in R-loops; these include LINEs, LTRs, and low complexity regions, etc.
Perspective
Either a cause or consequence, R-loops seem to play a crucial role in developmental pathways, cancer progression, and neurodegenerative diseases. Thus, many researchers are drawn to understand their precise physiological role. While most of the work in R-loop biology looked at trans-acting factors, here, the authors investigated the association of cis-regulatory elements or sequence determinants of R-loop formation. The authors found strong correlations between R-loops and repetitive DNA sequences reinforcing earlier studies.
(Note: I only highlighted the key findings of the preprint without commenting on the methodology. Anyone is free to comment on the methodology, in case the preprint excites you.)
Acknowledgments: I am thankful to all the authors for their support, especially Chao Zeng for taking the time to comment on the preLight.
References:
- https://doi.org/10.1093/nar/gkw1054
- https://doi.org/10.1101/gr.158436.113
- https://doi.org/10.1038/s41477-017-0004-x
- https://doi.org/10.7554/elife.17548
- https://doi.org/10.1038/s41467-017-00338-5
- https://doi.org/10.1016/j.cub.2017.01.011
Posted on: 28th December 2020 , updated on: 22nd January 2021
doi: https://doi.org/10.1242/prelights.25813
Read preprintSign up to customise the site to your preferences and to receive alerts
Register hereAlso in the genomics category:
Dissecting Mammalian Spermatogenesis Using Spatial Transcriptomics
Selected by | Martin Estermann |
EccDNA formation is dependent on MMEJ, repressed by c-NHEJ pathway, and stimulated by DNA double-strand break
Selected by | Ram |
1
The impact of biological sex on alternative splicing
Selected by | Jennifer Ann Black |
Also in the molecular biology category:
GAK and PRKCD are positive regulators of PRKN-independent mitophagy
Selected by | Kirsty Hooper |
1
Epigenetic control of coordinated hematopoietic and cardiovascular development by Rnf2 in zebrafish
Selected by | Tim Petzold |
1
Dissecting Mammalian Spermatogenesis Using Spatial Transcriptomics
Selected by | Martin Estermann |
preListsgenomics category:
in theEMBL Conference: From functional genomics to systems biology
Preprints presented at the virtual EMBL conference "from functional genomics and systems biology", 16-19 November 2020
List by | Jesus Victorino |
TAGC 2020
Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20
List by | Maiko Kitaoka, Madhuja Samaddar, Miguel V. Almeida, Sejal Davla, Jennifer Ann Black, Dey Lab |
Zebrafish immunology
A compilation of cutting-edge research that uses the zebrafish as a model system to elucidate novel immunological mechanisms in health and disease.
List by | Shikha Nayar |
Also in the molecular biology category:
FENS 2020
A collection of preprints presented during the virtual meeting of the Federation of European Neuroscience Societies (FENS) in 2020
List by | Ana Dorrego-Rivas |
ECFG15 – Fungal biology
Preprints presented at 15th European Conference on Fungal Genetics 17-20 February 2020 Rome
List by | Hiral Shah |
ASCB EMBO Annual Meeting 2019
A collection of preprints presented at the 2019 ASCB EMBO Meeting in Washington, DC (December 7-11)
List by | Madhuja Samaddar, Ramona Jühlen, Amanda Haage, Laura McCormick, Maiko Kitaoka |
Lung Disease and Regeneration
This preprint list compiles highlights from the field of lung biology.
List by | Rob Hynds |
MitoList
This list of preprints is focused on work expanding our knowledge on mitochondria in any organism, tissue or cell type, from the normal biology to the pathology.
List by | Sandra Franco Iborra |