Close

Evolution of tandem repeats in putative CSP to enhance its function: A recent and exclusive event in Plasmodium vivax in India

Manoswini Dash, Veena Pande, Aparup Das, Abhinav Sinha

Preprint posted on 28 November 2023 https://doi.org/10.1101/2023.11.28.568961

Is variation in repeats driving PVX_086150 evolution?

Selected by Alejandra Herbert Leffler's Lab

Categories: evolutionary biology

Background

Plasmodium vivax is the second most common underlying parasite responsible for causing malaria in humans. Malaria control and eradication programs need to redirect their efforts to target this species.Implementing targeted strategies requires understanding the parasites’ genetic features and the evolutionary processes that have molded this species. DNA repeats are common in the protein-coding genes of Plasmodium species. This underlying genetic feature can result in changed or novel protein functions over time, thus influencing parasite dynamics (Davies, et al., 2017). Despite harboring high variation within these repeat sites , few show signs of being under selection and most are following the expected neutral evolution.
Plasmodium proteins’ functions are poorly characterized, which is also true for repeat proteins. The circumsporozoite (CSP) gene is one example in Plasmodium containing variable repeats of clinical importance as some of these are the target of malaria vaccines. The repeat sequence motif in CSP is known to be variable both between and within parasite species, showing differences between strains of P. falciparum (Neafsey et al., 2015). Thus, CSP is an excellent protein model to understand the effects of repeat protein’ variation and its evolutionary significance. P. vivax possesses a locus, PVX_086150, which is a putative CSP gene in this species. The protein encoded by this gene structurally resembles the P. falciparum CSP protein, containing a hypervariable repeat region. In this preprint, the authors analyze the evolution of the putative P. vivax CSP loci and speculate on the selection forces that have allowed the expansion of the repeat region.

Key findings of the preprint.

Qualitative characterization of repeats in PvpuCSP (PVX_086150)
After confirming infection by P. vivax in samples and obtaining sequences of the target loci CSP,the authors screened for repeats using Tandem Repeats Finder (TRF) in PvpuCSP (PVX_086150) sequences. The repeat units were grouped into RATs, repeat allotypes, and PRMs, protein repeat motifs. The consensus of the repeat was found to be AGG[GA/TG]TAA[C/T]GC[C/T], and the variations produced four RATs encoding two PRMs (R-D-N-A; R-C-N-A). The order of the repeats 1-4 was also characterized showing that in most samples the repeats started with R1 and ended with R3, except in 3 isolates (G3948-GUJ, K13-ODI, D2115-DEL). The authors also found variation in the number of repeats within their samples ranging from 2 to 30.

Haplotype diversity and neutrality tests
After characterizing the repeats variation and structure, the authors created haplotypes from the 71 isolates. The haplotypes were found to vary across populations and were “mostly population specific”; furthermore, 5 haplotypes were shared among all analyzed populations. Overall diversity in the PvpuCSP (π= 0.00047, θW=0.00084) was smaller compared to the repeats’ region (π= 0.00513, θW=0.01762), thus leading the authors to conclude that gene diversity is mostly driven by the repeats’ region. Describing diversity among populations in a putative vaccine candidate locus is necessary for efficient vaccine development.

Selection tests and recombination
Tests of neutrality suggested some deviations from neutral evolution, but these were statistically nonsignificant. The authors conducted positive selection tests suggesting there might be some evidence to support directional selection, and results from site tests showed that most sites under positive selection fall within the repeat region. In contrast, branch tests suggested diversifying selection in two branches. Complementary recombination tests showed breaks at the start and end of the repeat region and no pattern was found to be related to any population repeat number of rearrangements.

The authors speculate that the selection of repeats could enhance the surface area for binding proteins targeted for degradation. Finally, the lack of repeat units in PvpuCSP orthologs species could be interpreted as a sign that repeat expansion in P. vivax is a recent evolutionary event, specific to this species populations, which could be targeted by new malaria elimination strategies.

What we like about the preprint

Malaria elimination requires targeting other Plasmodium species besides P. falciparum and this includes P. vivax. Recognizing and identifying evolutionary features that separates P. vivax from other malaria-causing species is necessary to implement targeted elimination.

Questions for the authors

-Do you think that statistically controlling for the number of sites would change any of your interpretations? In other words, do you think your results could be partially explained because repeats increase the power of detecting positive selection using site tests?

-Could you functionally validate that the repeats expansion on PvpuCSP are selected due to their role in protein degradation?

-If you were able to follow up and do a longitudinal collection of samples, do you expect consistency in branch tests where evidence of positive selection is found in the “newest” samples”?

-Given what you found, would you recommend this locus for a vaccine? What other applications would you suggest given your results?

References

Davies HM, Nofal SD, McLaughlin EJ, Osborne AR. Repetitive sequences in malaria parasite proteins. FEMS Microbiol Rev. 2017 Nov 1;41(6):923-940. doi: 10.1093/femsre/fux046. PMID: 29077880; PMCID: PMC5812512.
Neafsey DE, Juraska M, Bedford T, et al. Genetic diversity and protective efficacy of the RTS, S/AS01 malaria vaccine. N Engl J Med 2015;373:2025–37.

 

Posted on: 18 March 2024 , updated on: 19 March 2024

doi: https://doi.org/10.1242/prelights.36592

Read preprint (No Ratings Yet)

Author's response to our questions

Abhinav Sinha shared

-Do you think that statistically controlling for the number of sites would change any of your interpretations? In other words, do you think your results could be partially explained because repeats increase the power of detecting positive selection using site tests?

A perpetual dilemma exists in understanding the interplay between the evolutionary conservation and rapid alterations of the repeat region within a protein-coding gene and its association with biological processes. When the evolution of repeats is in a phase of rapid change, marked by the incorporation of new repeats into the gene, it signifies an ongoing event. During this phase, there is a notable presence of strongly relaxed purifying selection and positive selection, facilitating changes, as demonstrated in previous research (Persi et al., 2016, https://www.nature.com/articles/ncomms13570#Sec10), a phenomenon we also observed in the PvpuCSP locus. However, once the repeats are integrated and assume a defined function, they experience maintenance through robust purifying selection. Importantly, this phenomenon is not attributable to technical bias; rather, it is rooted in the genuine existence of repeats that are unique to P. vivax. Therefore, in a way the presence of repeats in PvpuCSP gene might increase selection power, at the current evolutionary dynamics.

– Could you functionally validate that the repeats expansion on PvpuCSP are selected due to their role in protein degradation?
It would be challenging and fascinating endeavor to functionally validate the role of repeats in protein degradation and their selection based on repeat expansion. It is worth a try using alternative parasite model phylogenetically closer to P. vivax and amenable to in vitro culture, circumventing the challenges associated with long-term culture issues specific to P. vivax. Nevertheless, I believe first it is essential to validate the role of PvpuCSP as an E3 ubiquitin ligase in vitro and in vivo. A knock-out study, maybe in P. berghei, might provide valuable insights into the significance of this locus.

-If you were able to follow up and do a longitudinal collection of samples, do you expect consistency in branch tests where evidence of positive selection is found in the “newest” samples”?
Yes, I believe there would be a signature of positive selection in the newest samples, if it is an ongoing process. In other words, it could be verified if it is an ongoing process by looking into newer samples. At the same time, one would be more careful and exercise caution in choosing samples as the selection dynamics get influenced by many factors.

-Given what you found, would you recommend this locus for a vaccine? What other applications would you suggest given your results?
It’s too soon to suggest the locus as a vaccine candidate since there hasn’t been a dedicated study to explore its potential in that regard. However, it’s worth investigating. Rather, based on the current understanding, which is again of course still at early stage, I would recommend exploring the locus as a therapeutic target because of its biological function in parasites. Many ubiquitin ligases are being evaluated for their therapeutic efficacy. Putative CSP as an E3 ubiquitin ligase may hold significance as a promising drug target

Have your say

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Sign up to customise the site to your preferences and to receive alerts

Register here

preLists in the evolutionary biology category:

‘In preprints’ from Development 2022-2023

A list of the preprints featured in Development's 'In preprints' articles between 2022-2023

 



List by Alex Eve, Katherine Brown

preLights peer support – preprints of interest

This is a preprint repository to organise the preprints and preLights covered through the 'preLights peer support' initiative.

 



List by preLights peer support

EMBO | EMBL Symposium: The organism and its environment

This preList contains preprints discussed during the 'EMBO | EMBL Symposium: The organism and its environment', organised at EMBL Heidelberg, Germany (May 2023).

 



List by Girish Kale

9th International Symposium on the Biology of Vertebrate Sex Determination

This preList contains preprints discussed during the 9th International Symposium on the Biology of Vertebrate Sex Determination. This conference was held in Kona, Hawaii from April 17th to 21st 2023.

 



List by Martin Estermann

EMBL Synthetic Morphogenesis: From Gene Circuits to Tissue Architecture (2021)

A list of preprints mentioned at the #EESmorphoG virtual meeting in 2021.

 



List by Alex Eve

Planar Cell Polarity – PCP

This preList contains preprints about the latest findings on Planar Cell Polarity (PCP) in various model organisms at the molecular, cellular and tissue levels.

 



List by Ana Dorrego-Rivas

TAGC 2020

Preprints recently presented at the virtual Allied Genetics Conference, April 22-26, 2020. #TAGC20

 



List by Maiko Kitaoka et al.

ECFG15 – Fungal biology

Preprints presented at 15th European Conference on Fungal Genetics 17-20 February 2020 Rome

 



List by Hiral Shah

COVID-19 / SARS-CoV-2 preprints

List of important preprints dealing with the ongoing coronavirus outbreak. See http://covidpreprints.com for additional resources and timeline, and https://connect.biorxiv.org/relate/content/181 for full list of bioRxiv and medRxiv preprints on this topic

 



List by Dey Lab, Zhang-He Goh

1

SDB 78th Annual Meeting 2019

A curation of the preprints presented at the SDB meeting in Boston, July 26-30 2019. The preList will be updated throughout the duration of the meeting.

 



List by Alex Eve

Pattern formation during development

The aim of this preList is to integrate results about the mechanisms that govern patterning during development, from genes implicated in the processes to theoritical models of pattern formation in nature.

 



List by Alexa Sadier
Close