Endogenous retroviruses drive species-specific germline transcriptomes in mammals

Akihiko Sakashita, So Maezawa, Kris G. Alavattam, Masashi Yukawa, Artem Barski, Mihaela Pavlicev, Satoshi H. Namekawa

Preprint posted on March 11, 2020

A case of endogenous retroviral control of male germline genes in mammals

Selected by Petra Kovacikova


Transposable elements (TE) are sequences of DNA capable of changing their position in the genome by first being cut from their initial position and then pasted in a new place. They occupy a surprisingly large portion of mammalian genomes and have long been thought to be just “junk” DNA, often lacking coding potential for any protein except the transposase which is responsible for the jumping feature of these genes. Some of TEs are derived from endogenous retroviruses (ERV), retroviruses that integrated into the germline genome and have been inherited for generations. In fact, many have lost the ability to transpose completely. Nonetheless, they pose a threat to the host genome by introducing mutations in place of their integration, hence the cell tries to prevent such events by methylation or other silencing mechanisms of the chromatin region they lie in. This is especially true for germ cells that carry the heritable genome.

In this preprint, the authors envisage the possible mechanism for germline transcriptome divergence in mammals by looking at the role of endogenous retroviruses (ERV) in the male germline. Despite their presumably deleterious effects on the germline, this class of transposable elements is known to regulate germline expression patterns, mainly through influencing post‑transcriptional control or promoter function. This time, however, the authors present evidence for the evolutionarily young K family of endogenous retroviruses (ERVKs) being used as cis-regulatory elements, containing binding sites for the master regulator of male meiosis and driving transcription of genes important for the mitosis-meiosis transition in the germline.

Key findings:

Endogenous retroviruses are differentially expressed at the mitosis-meiosis transition in the male germline, reside in open chromatin, exhibit enhancer-like histone marks and bind a key spermatogenesis master regulator

To test the hypothesis whether ERVs contribute to dynamic transcriptomic changes during spermatogenesis, the authors analyzed published RNA-seq datasets from 5 representative stages of mouse spermatogenesis from postnatal day 7 to adulthood (Fig.1 a).

Figure 1 a.) Spermatogenesis timepoints assessed for expression of repetitive elements. b.) Heatmap visualization of differential expression of repetitive elements throughout spermatogenesis and two classes of elements. Reproduced with author’s permission from Fig1 in Sakashita et al 2020.


Differential expression of mouse consensus repetitive elements revealed two temporally regulated groups of ERVS: mitotic and meiotic (Fig.1 b). The most represented types of TEs in both groups were members of ERVK, ERV1 and ERVL families of endogenous retroviruses. Interestingly, all of them were enriched in intergenic or intronic regions of the mouse genome and didn’t have any significant coding potential. Genome-wide chromatin accessibility assays showed that only some part of ERV loci exhibit open chromatin. Clustering of accessible ERV loci across different timepoints in spermatogenesis again resulted in mitotic and meiotic groups. The most represented family of meiotic class, ERVK, showed strong enrichment for three subgroups: RLTR10, RMER17 and RLTR51. Likewise, the enrichment of ATAC- peaks for these subgroups turned out to be specific for late spermatogenesis, based on the comparison to ATAC- seq data from cell lines derived from other tissues. In addition to tissue specificity, many of accessible RLTR10 and RMER17 in pachytene stage mapped to sex chromosomes compared to autosomes. This would mean that they become active despite ongoing meiotic sex chromosome inactivation at this meiotic phase.

Even more telling about the function of these ERVK open chromatin loci is the fact that they carry the histone modification H3K27ac, typical of enhancer sequences. Enhancer-like function is further supported by the presence of the binding site for A-MYB, a key transcription factor that acts as a master regulator of male germline from early meiosis. Chromatin immunoprecipitation with A-MYB antibodies recovers the same loci as ERVK enhancer-like loci on both autosomes and sex chromosomes. A significant portion of genes located in ERVK enhancer-like loci proximity are highly

expressed in meiosis and are involved in processes like chromosome segregation, chromatin silencing and spermatogenesis. More importantly, some of these genes are differentially expressed in A-MYB mutant mouse, indicating that A-MYB is indeed the mediator of ERVK enhancer-like activation of transcription.

Evolutionary conservation of ERVs driving species-specific spermatogenesis needs?

Perhaps even more surprising is the orthology assignment of these ERVK-adjacent genes. It seems, that for the most part, they consist of evolutionary young genes that share very little sequence identity (Fig 2), even if compared to the rat within the same clade. How frequent then is this phenomenon of ERV-based expression regulation in evolution? Analysis of human testis expression and immunoprecipitation datasets pointed at members of ERV1 and ERVK families, bearing binding motifs for A-MYB, as possible enhancers of mitosis-meiosis transition genes. This led the authors to propose that ERVK-driven meiotic enhancers are a general feature of mammals responsible for divergence of transcriptomes in late spermatogenesis.

Figure 2 Mouse ERVK-adjacent genes exhibit low sequence conservation across mammalian species. Reproduced with author’s permission from Fig5 in Sakashita et al 2020.


Why is this work important?

The work of Sakashita and colleagues nicely demonstrates the innovative use of repetitive elements in genome regulation. While it builds on previous studies that focused on the function of endogenous retroviruses in male meiosis, it offers a thought-provoking view on the possible involvement of retroviral elements in an important task such as forming a species-specific germline transcriptome and eventually the gametes. I enjoyed this different take on underlying mechanisms of controlling dynamics of the germline transcription landscape, while exploring the avenue of non-coding repetitive elements’ purpose that we still know very little about.


Questions for the authors:

  1. Did the authors look into differential expression of repetitive elements in oogenesis? Would they expect to see the same trend with sharp difference between mitotic and meiotic clusters?
  2. Could the authors describe their algorithm of k-means clustering of ATAC-peaks (used for Fig 2a) in more detail? Did the selection of k=2 come out as the most suitable option for the analysis, or were there any subtle changes caused by increasing the k value that would point to a finer and/or gradual regulation rather than a very “sharp” line on mitosis-meiosis transition?
  3. Approximately one-fifth of the ERVK-adjacent genes involved in mitosis-meiosis transition are under control of A-MYB transcription factor (results of A-MYB mutant mouse data analysis). Do the authors have any hypothesis on which potential TFs could bind to ERVK enhancer-like loci in proximity of the remaining ERVK-adjacent genes?
  4. As the authors outlined in the discussion, one possibility of how the accessibility of ERVK enhancers loci chromatin is being so tightly regulated at the mitosis-meiosis boundary is by the function of KRAB domain zinc finger proteins. However, the ATAC-seq data shows that in pre meiotic period, the chromatin of “meiotic” ERVK loci is closed. Could it then be simply inaccessibility that prevents A-MYB from binding rather than suppression by KRAB-ZF?
  5. Looking at the Figure 5a in the paper (Fig2 in here), it seems there is a large fraction of ERVK-adjacent genes (Supplemental table 2) with sequence identity >70 % (upper cluster in the Fig 5a). Are these the genes associated with the GO terms “chromosome segregation” and “chromatin silencing”, (or possibly some of the other enriched GO terms) rather than spermatogenesis? Perhaps, it would be more evident for the reader to distinguish which subset of these genes the text and conclusions refer to if the clusters were labelled (also in the Suppl. Table2 which could include the GO terms?). However, the overall conclusion in the paper is that the ERVK-adjacent genes are not conserved in closely related species, let alone other mammals (“no unambiguous homolog found”). Is it the portion of A‑MYB dependent genes that is species-specific and forms the basis for this conclusion? If so, did the authors try to explore the genomic environment of ERVK loci in other mammalian species for which spermatogenesis ATAC‑seq data are available?
  6. How likely do the authors think this phenomenon of using ERVs as gametogenesis specific enhancers can be found outside of mammals? Or perhaps even in invertebrates?


Posted on: 27th May 2020


Read preprint (No Ratings Yet)

  • Author's response

    Satoshi H. Namekawa shared

    Thank you very much for highlighting our study. We appreciate your nice summary of the importance of this study.
    We are very excited about this study. Please see my answers to your questions:

    1. We have not examined oogenesis yet; however, we are interested in testing this possibility in future studies. We expect that TEs function at the mitosis-to-meiosis transition in oogenesis. Notably, some related studies suggested that specific types of TEs function as cis-regulatory elements in oogenesis (Franke V. et al. Genome Res 2017, and Brind’Amour J. et al. Nat Commun 2018).
    2. We used the trinity package for k-means clustering with k=5. The optimal value of k was determined based on the hierarchical clustering of accessible ERVs in spermatogenesis. After k-means clustering, we only focused on two major sub-clusters: mitotic- and meiotic-types.
    3. Unfortunately, we do not have any hypothesis for other TFs. However, this is an interesting question. We already confirmed that several TFs, whose binding motifs were detected on enhancer-like ERVK loci in spermatogenesis, were highly expressed in meiosis. These data suggest that a specific TF network acts on young ERVs to drive germline transcriptomes.
    4. We still do not know why the chromatin of “meiotic” ERVK loci is closed in pre meiotic period. One possibility is KRAB-ZF binding to these ERVK loci induces closed chromatin.
    5. As you expected, adjacent genes that have higher sequence identities were associated with clear GO terms such as “chromosome segregation (D1pas1Nek10Stag3Cenpw, and Cdk5rap2”) and “chromatin silencing (Morf4l1Gm14920, H2afb1, and Msl3l2)”. By contrast, specific GO terms were not detected from most of the genes whose names starting with “Gm” and “LOC” or ending with “Rik,” because these genes were not annotated/characterized. Therefore, we could not expect biological significances of these unannotated genes by GO analysis (may need knock-out experiments). However, since several unannotated genes were also dysregulated in A-myb mutants, they were regulated in an A-MYB dependent manner. In our recent finding, sequences of the mouse (Mus musculus) enhancer-like ERVKs were less conserved across species, even rats and other Mus genus. To explore species-specific adaptions of enhancer-like ERVs in the regulation of adjacent late spermatogenesis genes, we are planning to explore the genomic environment of ERVK loci in other species, including ATAC-seq analysis.
    6. We expect that the mechanism underlying gene expression, mediated by species-specific ERVs, is conserved across species, even in invertebrates. This is an important question to be addressed.

    Have your say

    Your email address will not be published. Required fields are marked *

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Sign up to customise the site to your preferences and to receive alerts

    Register here

    preLists in the developmental biology category:

    Planar Cell Polarity – PCP

    This preList contains preprints about the latest findings on Planar Cell Polarity (PCP) in various model organisms at the molecular, cellular and tissue levels.


    List by Ana Dorrego-Rivas

    Cell Polarity

    Recent research from the field of cell polarity is summarized in this list of preprints. It comprises of studies focusing on various forms of cell polarity ranging from epithelial polarity, planar cell polarity to front-to-rear polarity.


    List by Yamini Ravichandran

    TAGC 2020

    Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20


    List by Maiko Kitaoka, Madhuja Samaddar, Miguel V. Almeida, Sejal Davla, Jennifer Ann Black, Gautam Dey

    3D Gastruloids

    A curated list of preprints related to Gastruloids (in vitro models of early development obtained by 3D aggregation of embryonic cells)


    List by Paul Gerald L. Sanchez and Stefano Vianello

    ASCB EMBO Annual Meeting 2019

    A collection of preprints presented at the 2019 ASCB EMBO Meeting in Washington, DC (December 7-11)


    List by Madhuja Samaddar, Ramona Jühlen, Amanda Haage, Laura McCormick, Maiko Kitaoka

    EDBC Alicante 2019

    Preprints presented at the European Developmental Biology Congress (EDBC) in Alicante, October 23-26 2019.


    List by Sergio Menchero, Jesus Victorino, Teresa Rayon, Irepan Salvador-Martinez

    EMBL Seeing is Believing – Imaging the Molecular Processes of Life

    Preprints discussed at the 2019 edition of Seeing is Believing, at EMBL Heidelberg from the 9th-12th October 2019


    List by Gautam Dey

    SDB 78th Annual Meeting 2019

    A curation of the preprints presented at the SDB meeting in Boston, July 26-30 2019. The preList will be updated throughout the duration of the meeting.


    List by Alex Eve

    Lung Disease and Regeneration

    This preprint list compiles highlights from the field of lung biology.


    List by Rob Hynds

    Young Embryologist Network Conference 2019

    Preprints presented at the Young Embryologist Network 2019 conference, 13 May, The Francis Crick Institute, London


    List by Alex Eve

    Pattern formation during development

    The aim of this preList is to integrate results about the mechanisms that govern patterning during development, from genes implicated in the processes to theoritical models of pattern formation in nature.


    List by Alexa Sadier

    BSCB/BSDB Annual Meeting 2019

    Preprints presented at the BSCB/BSDB Annual Meeting 2019


    List by Gautam Dey

    Zebrafish immunology

    A compilation of cutting-edge research that uses the zebrafish as a model system to elucidate novel immunological mechanisms in health and disease.


    List by Shikha Nayar