The genomic basis of colour pattern polymorphism in the harlequin ladybird

Mathieu Gautier, Junichi Yamaguchi, Julien Foucaud, Anne Loiseau, Aurelien Ausset, Benoit Facon, Bernhard Gschloessl, Jacques Lagnel, Etienne Loire, Hugues Parrinello, Danny Severac, Celine Lopez-Roques, Cecile Donnadieu, Maxime Manno, Helene Berges, Karim Gharbi, Lori Lawson-Handley, Lian-Sheng Zang, Heiko Vogel, Arnaud Estoup, Benjamin Prud'homme

Preprint posted on 13 June 2018

How do ladybirds get their spots? A new study uses modern genomics to solve an old puzzle.

Selected by Fillip Port

Generations of children and biologists have marveled at the seemingly endless variations of colour and pattern on the back of ladybirds. As far as biologists are concerned, much of the attention has focused on the question how such large phenotypic variation is encoded in the genome. Classic genetic experiments in the 1930s have suggested that colour variation in ladybirds is encoded at a single locus, but the identity of that locus has remained enigmatic. The existence of a single colour pattern locus is puzzling, given that more than 200 colour patterns have been described, raising the question what kind of mechanism can support the stable existence of so many phenotypes in a single species. A new preprint by the labs of Arnaud Estoup and Benjamin Prud’homme now offers new insights and presents strong evidence that variation of the cis-regulatory region of a single gene encoding a transcription factor is responsible.


The preprint

The authors started out by producing a new genome assembly of the harlequin ladybird Harmonia axyridis. For this they used a MinION sequencer, a device the size of an USB stick, capable of producing extremely long sequencing reads. They then used conventional short-read sequencing at high depth to assay the genomic variation present in 14 samples, each containing many ladybirds of various colours. Knowing the frequency of the different colour morphs in each sample allowed them to ask, which genetic variations are likely to be associated with the different patterns. This strongly suggested the importance of a single locus, in agreement with the genetic experiments done 80 years earlier.

The sequence identified by the genome-wide association study encodes two genes, GATAe and pannier, which both encode transcription factors. Neither of these genes has so far been implicated in animal colouration, so the authors performed RNAi in ladybirds to directly test for a role in this process. While GATAe had no effect on colour pattering, animals injected with dsRNA against pannier lacked all dark pigmentation. This strongly suggested that pannier is the long sought colour pattern gene in ladybirds. But how does genomic variation at this locus leads to all the different colour patterns?


Dark pigmentation in ladybirds requires expression of pannier. Top row: Naturally occuring colour pattern. Bottom row: Animals in which pannier expression was inhibited by RNAi. From Figure 2 of Gautier, Yamaguchi et al. 2018.


To get at this question the authors first investigated the expression of pannier in ladybirds of different colour morphs. They found that there is a strong correlation between the level of expression and the amount of dark cuticle, with the dark areas expressing high levels of pannier and light areas expressing low levels. These differential expression patterns appear to be encoded in the non-coding sequences surrounding pannier, which are highly diverse in animals of the different colour morphs. Intriguingly, genomic variation at this locus includes at least one very large inversion, which is expected to suppress recombination, a possible mechanism that could contribute to the stable existence of multiple alleles.


My take

Besides the iconic object of study, what I really love about this preprint is how the authors use modern genomics to shed light on a very old biological puzzle. The price and availability of sequencing has changed dramatically over the last 15 years, making large-scale sequencing projects feasible for individual labs. Furthermore, new sequencing technologies producing long sequencing reads make de novo genome assembly significantly easier and more accurate. Here, Gautier, Yamaguchi et al., are taking full advantage of these developments to re-address a long standing question that seemed intractable for many years. In the course of doing so they show how genes are repurposed during evolution and how extensive variation at a single locus can drive a large degree of phenotypic variation in animals.


Future directions

This study takes full advantage of the first half of the genomic revolution: Our ever increasing ability to read genomes. What I anticipate to happen next is to harness the other half – our novel ability to rewrite the genetic code. Genome engineering using TALE nucleases has already been described in ladybirds. The CRISPR/Cas system, which in many respects is easier to handle, has been adopted in a large variety of species, and should hopefully also work in ladybirds. Genome engineering should allow precise modifications to the cis-regulatory region surrounding pannier and reveal, which sequences are responsible for its differential expression and hence the development of the different colour patterns. This should lead to a more detailed picture of how genomic variation at one single locus can give rise to the more than 200 variations of the red and black pattern we all love.

Tags: evolution, genomics, ladybird, pannier

Posted on: 21 June 2018

Read preprint (No Ratings Yet)

Have your say

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Sign up to customise the site to your preferences and to receive alerts

Register here

preLists in the evolutionary biology category:

EMBO | EMBL Symposium: The organism and its environment

This preList contains preprints discussed during the 'EMBO | EMBL Symposium: The organism and its environment', organised at EMBL Heidelberg, Germany (May 2023).


List by Girish Kale

9th International Symposium on the Biology of Vertebrate Sex Determination

This preList contains preprints discussed during the 9th International Symposium on the Biology of Vertebrate Sex Determination. This conference was held in Kona, Hawaii from April 17th to 21st 2023.


List by Martin Estermann

EMBL Synthetic Morphogenesis: From Gene Circuits to Tissue Architecture (2021)

A list of preprints mentioned at the #EESmorphoG virtual meeting in 2021.


List by Alex Eve

Planar Cell Polarity – PCP

This preList contains preprints about the latest findings on Planar Cell Polarity (PCP) in various model organisms at the molecular, cellular and tissue levels.


List by Ana Dorrego-Rivas

TAGC 2020

Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20


List by Maiko Kitaoka et al.

ECFG15 – Fungal biology

Preprints presented at 15th European Conference on Fungal Genetics 17-20 February 2020 Rome


List by Hiral Shah

COVID-19 / SARS-CoV-2 preprints

List of important preprints dealing with the ongoing coronavirus outbreak. See for additional resources and timeline, and for full list of bioRxiv and medRxiv preprints on this topic


List by Dey Lab, Zhang-He Goh


SDB 78th Annual Meeting 2019

A curation of the preprints presented at the SDB meeting in Boston, July 26-30 2019. The preList will be updated throughout the duration of the meeting.


List by Alex Eve

Pattern formation during development

The aim of this preList is to integrate results about the mechanisms that govern patterning during development, from genes implicated in the processes to theoritical models of pattern formation in nature.


List by Alexa Sadier