The genomic basis of colour pattern polymorphism in the harlequin ladybird

Mathieu Gautier, Junichi Yamaguchi, Julien Foucaud, Anne Loiseau, Aurelien Ausset, Benoit Facon, Bernhard Gschloessl, Jacques Lagnel, Etienne Loire, Hugues Parrinello, Danny Severac, Celine Lopez-Roques, Cecile Donnadieu, Maxime Manno, Helene Berges, Karim Gharbi, Lori Lawson-Handley, Lian-Sheng Zang, Heiko Vogel, Arnaud Estoup, Benjamin Prud'homme

Preprint posted on June 13, 2018

How do ladybirds get their spots? A new study uses modern genomics to solve an old puzzle.

Selected by Fillip Port

Generations of children and biologists have marveled at the seemingly endless variations of colour and pattern on the back of ladybirds. As far as biologists are concerned, much of the attention has focused on the question how such large phenotypic variation is encoded in the genome. Classic genetic experiments in the 1930s have suggested that colour variation in ladybirds is encoded at a single locus, but the identity of that locus has remained enigmatic. The existence of a single colour pattern locus is puzzling, given that more than 200 colour patterns have been described, raising the question what kind of mechanism can support the stable existence of so many phenotypes in a single species. A new preprint by the labs of Arnaud Estoup and Benjamin Prud’homme now offers new insights and presents strong evidence that variation of the cis-regulatory region of a single gene encoding a transcription factor is responsible.


The preprint

The authors started out by producing a new genome assembly of the harlequin ladybird Harmonia axyridis. For this they used a MinION sequencer, a device the size of an USB stick, capable of producing extremely long sequencing reads. They then used conventional short-read sequencing at high depth to assay the genomic variation present in 14 samples, each containing many ladybirds of various colours. Knowing the frequency of the different colour morphs in each sample allowed them to ask, which genetic variations are likely to be associated with the different patterns. This strongly suggested the importance of a single locus, in agreement with the genetic experiments done 80 years earlier.

The sequence identified by the genome-wide association study encodes two genes, GATAe and pannier, which both encode transcription factors. Neither of these genes has so far been implicated in animal colouration, so the authors performed RNAi in ladybirds to directly test for a role in this process. While GATAe had no effect on colour pattering, animals injected with dsRNA against pannier lacked all dark pigmentation. This strongly suggested that pannier is the long sought colour pattern gene in ladybirds. But how does genomic variation at this locus leads to all the different colour patterns?


Dark pigmentation in ladybirds requires expression of pannier. Top row: Naturally occuring colour pattern. Bottom row: Animals in which pannier expression was inhibited by RNAi. From Figure 2 of Gautier, Yamaguchi et al. 2018.


To get at this question the authors first investigated the expression of pannier in ladybirds of different colour morphs. They found that there is a strong correlation between the level of expression and the amount of dark cuticle, with the dark areas expressing high levels of pannier and light areas expressing low levels. These differential expression patterns appear to be encoded in the non-coding sequences surrounding pannier, which are highly diverse in animals of the different colour morphs. Intriguingly, genomic variation at this locus includes at least one very large inversion, which is expected to suppress recombination, a possible mechanism that could contribute to the stable existence of multiple alleles.


My take

Besides the iconic object of study, what I really love about this preprint is how the authors use modern genomics to shed light on a very old biological puzzle. The price and availability of sequencing has changed dramatically over the last 15 years, making large-scale sequencing projects feasible for individual labs. Furthermore, new sequencing technologies producing long sequencing reads make de novo genome assembly significantly easier and more accurate. Here, Gautier, Yamaguchi et al., are taking full advantage of these developments to re-address a long standing question that seemed intractable for many years. In the course of doing so they show how genes are repurposed during evolution and how extensive variation at a single locus can drive a large degree of phenotypic variation in animals.


Future directions

This study takes full advantage of the first half of the genomic revolution: Our ever increasing ability to read genomes. What I anticipate to happen next is to harness the other half – our novel ability to rewrite the genetic code. Genome engineering using TALE nucleases has already been described in ladybirds. The CRISPR/Cas system, which in many respects is easier to handle, has been adopted in a large variety of species, and should hopefully also work in ladybirds. Genome engineering should allow precise modifications to the cis-regulatory region surrounding pannier and reveal, which sequences are responsible for its differential expression and hence the development of the different colour patterns. This should lead to a more detailed picture of how genomic variation at one single locus can give rise to the more than 200 variations of the red and black pattern we all love.

Tags: evolution, genomics, ladybird, pannier

Posted on: 21st June 2018

Read preprint (No Ratings Yet)

  • Have your say

    Your email address will not be published. Required fields are marked *

    This site uses Akismet to reduce spam. Learn how your comment data is processed.

    Sign up to customise the site to your preferences and to receive alerts

    Register here

    Also in the evolutionary biology category:

    The visual system of the genetically tractable crustacean Parhyale hawaiensis: diversification of eyes and visual circuits associated with low-resolution vision

    Ana Patricia Ramos, Ola Gustafsson, Nicolas Labert, et al.

    Selected by Alexa Sadier

    A unicellular relative of animals generates an epithelium-like cell layer by actomyosin-dependent cellularization

    Omaya Dudin, Andrej Ondracka, Xavier Grau-Bové, et al.

    Selected by Paul Gerald L. Sanchez and Stefano Vianello


    The spindle assembly checkpoint functions during early development in non-chordate embryos

    Janet Chenevert, Marianne Roca, Lydia Besnardeau, et al.

    Selected by Maiko Kitaoka

    Evolution-guided design of super-restrictor antiviral proteins reveals a breadth-versus-specificity tradeoff

    Rossana S Colon-Thillet, Emily S Hsieh, Laura Graf, et al.

    Selected by Connor Rosen

    Establishment of the mayfly Cloeon dipterum as a new model system to investigate insect evolution

    Isabel Almudi, Carlos Martin-Blanco, Isabel Maria Garcia-Fernandez, et al.

    Selected by Ivan Candido-Ferreira


    Symmetry breaking in the embryonic skin triggers a directional and sequential front of competence during plumage patterning

    Richard Bailleul, Carole Desmarquet-Trin Dinh, Magdalena Hidalgo, et al.

    Selected by Alexa Sadier

    Bridging the divide: bacteria synthesizing archaeal membrane lipids

    Laura Villanueva, F. A. Bastiaan von Meijenfeldt, Alexander B. Westbye, et al.


    Extensive transfer of membrane lipid biosynthetic genes between Archaea and Bacteria

    Gareth A. Coleman, Richard D. Pancost, Tom A. Williams

    Selected by Gautam Dey


    PUMILIO hyperactivity drives premature aging of Norad-deficient mice

    Florian Kopp, Mehmet Yalvac, Beibei Chen, et al.

    Selected by Carmen Adriaens

    Eukaryotic acquisition of a bacterial operon

    Jacek Kominek, Drew T. Doering, Dana A. Opulente, et al.

    Selected by Lauren Neves

    millepattes micropeptides are an ancient developmental switch required for embryonic patterning

    Suparna Ray, Miriam I Rosenberg, Hélène Chanut-Delalande, et al.

    Selected by Erik Clark

    Conserved phosphorylation hotspots in eukaryotic protein domain families

    Marta J Strumillo, Michaela Oplova, Cristina Vieitez, et al.

    Selected by Gautam Dey

    Peculiar features of the plastids of the colourless alga Euglena longa and photosynthetic euglenophytes unveiled by transcriptome analyses

    Kristina Zahonova, Zoltan Fussy, Erik Bircak, et al.

    Selected by Ellis O'Neill


    The Ly6/uPAR protein Bouncer is necessary and sufficient for species-specific fertilization

    Sarah Herberg, Krista R Gert, Alexander Schleiffer, et al.

    Selected by James Gagnon

    Timed collinear activation of Hox genes during gastrulation controls the avian forelimb position

    Chloe Moreau, Paolo Caldarelli, Didier Rocancourt, et al.

    Selected by Wouter Masselink

    Altering the temporal regulation of one transcription factor drives sensory trade-offs

    Ariane Ramaekers, Simon Weinberger, Annelies Claeys, et al.

    Selected by Mariana R.P. Alves

    A robust method for transfection in choanoflagellates illuminates their cell biology and the ancestry of animal septins

    David Booth, Heather Middleton, Nicole King

    Selected by Maya Emmons-Bell

    Also in the genomics category:

    Accurate detection of m6A RNA modifications in native RNA sequences

    Huanle Liu, Oguzhan Begik, Morghan C Lucas, et al.

    Selected by Christian Bates


    Crowdfunded whole-genome sequencing of the celebrity cat Lil BUB identifies causal mutations for her osteopetrosis and polydactyly

    Mike Bridavsky, Heiner Kuhl, Arthur Woodruf, et al.

    Selected by Jesus Victorino, Gabriel Aughey


    Endogenous CRISPR arrays for scalable whole organism lineage tracing

    James Cotterell, James Sharpe

    Selected by Irepan Salvador-Martinez

    Prospective, brain-wide labeling of neuronal subclasses with enhancer-driven AAVs

    Lucas T Graybuck, Adriana Sedeño-Cortés, Thuc Nghi Nguyen, et al.

    Selected by Jesus Victorino

    Self-reporting transposons enable simultaneous readout of gene expression and transcription factor binding in single cells

    Arnav Moudgil, Michael N Wilkinson, Xuhua Chen, et al.

    Selected by James Briscoe


    High-throughput functional analysis of lncRNA core promoters elucidates rules governing tissue-specificity

    Kaia Mattioli, Pieter-Jan Volders, Chiara Gerhardinger, et al.

    Selected by Clarice Hong

    Reconstruction of the global neural crest gene regulatory network in vivo

    Ruth M Williams, Ivan Candido-Ferreira, Emmanouela Repapi, et al.

    Selected by Hannah Brunsdon

    Charting a tissue from single-cell transcriptomes

    Mor Nitzan, Nikos Karaiskos, Nir Friedman, et al.

    Selected by Irepan Salvador-Martinez

    Single cell RNA-Seq reveals distinct stem cell populations that drive sensory hair cell regeneration in response to loss of Fgf and Notch signaling

    Mark E. Lush, Daniel C. Diaz, Nina Koenecke, et al.


    Distinct progenitor populations mediate regeneration in the zebrafish lateral line.

    Eric D Thomas, David Raible

    Selected by Rudra Nayan Das


    Maintenance of spatial gene expression by Polycomb-mediated repression after formation of a vertebrate body plan

    Julien Rougot, Naomi D Chrispijn, Marco Aben, et al.

    Selected by Yen-Chung Chen


    The embryonic transcriptome of Arabidopsis thaliana

    Falko Hofmann, Michael A Schon, Michael D Nodine

    Selected by Chandra Shekhar Misra


    Simultaneous multiplexed amplicon sequencing and transcriptome profiling in single cells

    Mridusmita Saikia, Philip Burnham, Sara H Keshavjee, et al.


    High-throughput targeted long-read single cell sequencing reveals the clonal and transcriptional landscape of lymphocytes

    Mandeep Singh, Ghamdan Al-Eryani, Shaun Carswell, et al.

    Selected by Samantha Seah

    The microbial basis of impaired wound healing: differential roles for pathogens, "bystanders", and strain-level diversification in clinical outcomes

    Lindsay Kalan, Jacquelyn S Meisel, Michael A Loesche, et al.

    Selected by Snehal Kadam

    Comparative analysis of droplet-based ultra-high-throughput single-cell RNA-seq systems

    Xiannian Zhang, Tianqi Li, Feng Liu, et al.

    Selected by Samantha Seah

    PUMILIO hyperactivity drives premature aging of Norad-deficient mice

    Florian Kopp, Mehmet Yalvac, Beibei Chen, et al.

    Selected by Carmen Adriaens

    LCM-seq reveals unique transcriptional adaption mechanisms of resistant neurons in spinal muscular atrophy

    Susanne Nichterwitz, Helena Storvall, Jik Nijssen, et al.


    Axon-seq decodes the motor axon transcriptome and its modulation in response to ALS

    Jik Nijssen, Julio Cesar Aguila Benitez, Rein Hoogstraaten, et al.

    Selected by Yen-Chung Chen