Mutation bias shapes gene evolution in Arabidopsis thaliana

J. Grey Monroe, Thanvi Srikant, Pablo Carbonell-Bejerano, Moises Exposito-Alonso, Mao-Lun Weng, Matthew T. Rutter, Charles B. Fenster, Detlef Weigel

Posted on: 6 July 2020 , updated on: 14 July 2020

Preprint posted on 18 June 2020

Article now published in Nature at http://dx.doi.org/10.1038/s41586-021-04269-6

Not so random: De novo mutations in Arabidopsis are biased due to cytogenetic features and have shaped gene evolution.

Selected by Facundo Romani

Categories: evolutionary biology, plant biology

Context

Mutations in the DNA are one of the main drivers of genome evolution in all organisms. These mutations include transitions and transversions (single nucleotide polymorphisms, SNPs), insertions and deletions (INDELS) that could impact regulatory regions and coding sequences and affect the fitness of the organisms. It was observed that mutation rates are influenced by the DNA sequence and epigenetic features in wild populations. However, this mutation bias is affected by strong selection. Lack of studies analysing large de novo mutation catalogues in plants not subject to strong selection limit our knowledge on whether this bias is independent of selection or not. Grey Monroe and colleagues reanalysed a collection of spontaneous mutations in A. thaliana and associated them with cytogenetic features (GC content, DNA methylation, histone marks, chromatin accessibility (ATAC-seq) and gene expression) to generate a regression model and compare it with natural variation.

Figure 2 from the pre-print. (A) Schematic representation of the regression model. (B-C) contribution of different cytogenetic features to the model. (D-E) Comparison of gene-level distribution compared predicted mutation rates and polymorphism in wild populations.

Major findings

The generated model weighed the contribution of each cytogenetic feature in mutation rate. Regions with high GC content had the lowest mutation rate, whereas chromatin accessibility showed the opposite trend. Histone modifications associated with active gene expression (such as H3K4me1, H3K27ac and H3H36me3) also showed lower mutation rates, whereas H3K9me1 and cytosine methylation were associated with high mutation rates. These correlations are consistent with works in mammals and yeast and suggest that the bias could be explained by the different target preference of the DNA mismatch repair machinery. In addition, the predictive model also has a similar gene-level distribution compared with polymorphisms in wild populations, with peaks in the transcription starting sites (TSS) and transcription termination sites (TTS). This suggests that the mutation bias observed in the natural population is a consequence of de novo mutation bias and not necessarily a product of selection.

Authors also analysed mutation bias in each gene feature (promoters, UTRs, exons, etc.). This is particularly interesting for coding regions which can have major impacts on fitness. They find that mutation frequency is correlated with functional constraints (synonymous vs non-synonymous mutations, gene expression level, etc.). Moreover, high mutation rates are anti-correlated with genes annotated with core biological function ontologies.

Future directions

The preprint questions many concepts generally accepted in the classic theories of evolution. It is also clear and concise regarding the problems that motivate the work and the answers that the authors provide with the existing data.

This preprint is a provocative piece with many important novel findings associated with features that are frequently passed over. Their findings will have a broad impact on the evolutionary biology community, not only plant biologist. The release of the preprint sparked a great and interesting discussion on social media between the authors and readers. In a very innovative initiative, the authors also open a Google docs file in order to receive feedback from the community. Many interesting questions remain open, particularly associated with genetic and epigenetic features that were not specifically covered in the regression model and downstream analysis, such as transposable and repetitive elements or nucleosome positioning. Also, there could be important differences between SNPs and INDELS that could be missed in the analysis when both events are combined in mutations as a whole (Lujan et al., 2014). Certainly, the preprint will open pathways to future works assessing the impact of mutation bias in evolutionary events. Recently, Boukas et al. (2020) have released another pre-print addressing similar questions in humans but focused on promoter region methylation and CpG islands.

References

Lujan, S. A., Clausen, A. R., Clark, A. B., MacAlpine, H. K., MacAlpine, D. M., Malc, E. P., Mieczkowski, P. A., Burkholder, A. B., Fargo, D. C., Gordenin, D. A., & Kunkel, T. A. (2014). Heterogeneous polymerase fidelity and mismatch repair bias genome variation and composition. Genome research, 24(11), 1751–1764. https://doi.org/10.1101/gr.178335.114

Boukas, L, Bjornsson H. T., Hansen K. D. (2020). Purifying selection acts on germline methylation to modify the CpG mutation rate at promoters. bioRxiv 2020.07.04.187880. https://doi.org/10.1101/2020.07.04.187880

Tags: arabidopsis, epigenetics, evolution

doi: https://doi.org/10.1242/prelights.22698

Read preprint

(No Ratings Yet)

Author's response

Grey Monroe shared

(FR) How was the experience of open the pre-prints for comments in a Google Docs? How the feedback from the community helped you to delineate the future version of the work?

Thank you for the thoughtful write up and questions!

This has been an exciting project to work on and when deciding how to proceed with publication of our findings we felt it would be important to expand the scale of peer review given that some of the results might be viewed by some as unorthodox. The preprint “phase” of publication, which has become so popular in the life sciences, provides an opportunity to seek input from the community before a paper even makes its way to an editor and formal peer review. In addition to the standard mechanisms of feedback after posting a preprint such as direct contacts, Twitter discussions, and comments on biorxiv, we decided to explore another option to make community peer review even simpler (and possibly even anonymous) – an open Google Doc of the manuscript in “comment-only mode”, where anyone could provide in-line comments.

We were at first unsure how well this experimental approach to peer review would go. No doubt there were some concerns that an open Google Doc might attract “trolls” or anyone using the anonymity of the internet to act in bad faith – but we have seen nothing of the sort. The comments we have received have all been constructive and many were very helpful. In addition to providing a direct outlet to facilitate community feedback, by posting the manuscript for all to comment, taking this open approach seems to have served as a message to the community that we are eager to benefit from feedback and we have been contacted by a number of researchers directly with constructive comments.

This feedback will considerably improve the future version of this work. One simple but valuable contribution from several researchers was pointing us in the direction of relevant references that we had overlooked. With a vast and ever-growing literature, crowdsourcing literature review is incredibly powerful and allows for gaps to be filled toward a more complete picture of the work. For example, we had missed a remarkable paper from 2004 that found functional bias in mutation hot and cold spots in humans (Chuang and Li 2004). In particular, genes involved in RNA processing tended to be associated with mutational “cold spots” in the genome, which we also found in our study.

Another direction we will examine based on community feedback is exploring the difference between mutation rates of single nucleotide variants, insertions, and deletions. A cursory comparison has revealed that single nucleotide mutations are more likely to be predicted by cytosine methylation than insertions and deletions, which is consistent with cytosine deamination being an important source of single nucleotide mutations but not indels.

Finally, we will use ideas based on feedback from the community to better articulate and improve our attempts to control for false positive calls (e.g., sequencing errors). Because DNA sequencing and mapping are imperfect, striking the right balance between filtering out false positives and keeping real variants is key for a project like this. We are now exploring new analyses to test how robust the results are here to such filtering to ensure that they are not an artifact of bias in the distribution of false positive calls that made it through our original filtering steps. One new step in our pipeline we are exploring is explicitly removing variants detected in an unexpectedly high number of independent lines as these may be more likely to be erroneous.

Overall, reaching out to the community and asking for input has been an incredibly valuable and positive experience. Not only is a more thorough and open peer review process good for the scientific literature as a whole, but when faced with surprising results like we were here, it brings peace of mind to know that the work has been rigorously examined by more than just a handful of reviewers and colleagues. We are extremely grateful to everyone for their time to read and respond to our recent preprint. We feel lucky to be part of such a generous community.

References
Chuang, Jeffrey H., and Hao Li. “Functional bias and spatial organization of genes in mutational hot and cold regions in the human genome.” PLoS Biol 2.2 (2004): e29.

Have your say Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Sign up to customise the site to your preferences and to receive alerts

Also in the evolutionary biology category:

Drift drives phenotypic evolution in a rapid island radiation

Jenna M. McCullough, Chad M. Eliason, Allison J. Shultz, et al.

Selected by 13 July 2026

Zoha Sadaqat

Discussion

Cell position is more important than cell shape or age for the acquisition of cell identity in the brown alga Ectocarpus

Denis Saint-Marcoux, Bernard Billoud, Sabine Chenivesse, et al.

Selected by 18 June 2026

Urvashi Goswami

Inhibition of the gut ceramidase Asah2 decelerates the vertebrate ageing rate

Ayami Takaochi, Kota Abe, Yuki Sugiura, et al.

Selected by 29 May 2026

Jeny Jose

Discussion

Also in the plant biology category:

A drought stress-induced MYB transcription factor regulates pavement cell shape in leaves of European aspen (Populus tremula)

Sijia Liu, Siamsa M. Doyle, Kathryn M. Robinson, et al.

Selected by 20 February 2026

Jeny Jose

Actin Counters Geometry to Guide Plant Cell Division

Camila Goldy, Samantha Moulin, Yutaro Shimizu, et al.

Selected by 26 November 2025

Jeny Jose

Discussion

The nucleus follows an internal cellular scale during polarized root hair cell development

Jessica M. Orr, M. Arif Ashraf

Selected by 04 September 2025

Jeny Jose

Discussion

preLists in the evolutionary biology category:

SciELO preprints – From 2025 onwards

SciELO has become a cornerstone of open, multilingual scholarly communication across Latin America. Its preprint server, SciELO preprints, is expanding the global reach of preprinted research from the region (for more information, see our interview with Carolina Tanigushi). This preList brings together biological, English language SciELO preprints to help readers discover emerging work from the Global South. By highlighting these preprints in one place, we aim to support visibility, encourage early feedback, and showcase the vibrant research communities contributing to SciELO’s open science ecosystem.

Mutation bias shapes gene evolution in Arabidopsis thaliana

Context

Major findings

Future directions

References

Share this:

Have your say Cancel reply

Sign up to customise the site to your preferences and to receive alerts

Also in the evolutionary biology category:

Drift drives phenotypic evolution in a rapid island radiation

Cell position is more important than cell shape or age for the acquisition of cell identity in the brown alga Ectocarpus

Inhibition of the gut ceramidase Asah2 decelerates the vertebrate ageing rate

Also in the plant biology category:

A drought stress-induced MYB transcription factor regulates pavement cell shape in leaves of European aspen (Populus tremula)

Actin Counters Geometry to Guide Plant Cell Division

The nucleus follows an internal cellular scale during polarized root hair cell development

preLists in the evolutionary biology category:

SciELO preprints – From 2025 onwards

November in preprints – DevBio & Stem cell biology

October in preprints – DevBio & Stem cell biology

October in preprints – Cell biology edition

Biologists @ 100 conference preList

‘In preprints’ from Development 2022-2023

preLights peer support – preprints of interest

EMBO | EMBL Symposium: The organism and its environment

9th International Symposium on the Biology of Vertebrate Sex Determination

EMBL Synthetic Morphogenesis: From Gene Circuits to Tissue Architecture (2021)

Planar Cell Polarity – PCP

TAGC 2020

ECFG15 – Fungal biology

COVID-19 / SARS-CoV-2 preprints

SDB 78th Annual Meeting 2019

Pattern formation during development

Also in the plant biology category:

SciELO preprints – From 2025 onwards

‘In preprints’ from Development 2022-2023

The Society for Developmental Biology 82nd Annual Meeting

CSHL 87th Symposium: Stem Cells

SDB 78th Annual Meeting 2019