Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 4;6(2):lqae032.
doi: 10.1093/nargab/lqae032. eCollection 2024 Jun.

Properties and predicted functions of large genes and proteins of apicomplexan parasites

Affiliations

Properties and predicted functions of large genes and proteins of apicomplexan parasites

Tiffany Fang et al. NAR Genom Bioinform. .

Abstract

Evolutionary constraints greatly favor compact genomes that efficiently encode proteins. However, several eukaryotic organisms, including apicomplexan parasites such as Toxoplasma gondii, Plasmodium falciparum and Babesia duncani, the causative agents of toxoplasmosis, malaria and babesiosis, respectively, encode very large proteins, exceeding 20 times their average protein size. Although these large proteins represent <1% of the total protein pool and are generally expressed at low levels, their persistence throughout evolution raises important questions about their functions and possible evolutionary pressures to maintain them. In this study, we examined the trends in gene and protein size, function and expression patterns within seven apicomplexan pathogens. Our analysis revealed that certain large proteins in apicomplexan parasites harbor domains potentially important for functions such as antigenic variation, erythrocyte invasion and immune evasion. However, these domains are not limited to or strictly conserved within large proteins. While some of these proteins are predicted to engage in conventional metabolic pathways within these parasites, others fulfill specialized functions for pathogen-host interactions, nutrient acquisition and overall survival.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
(A) Probability density function estimation of log size distribution of proteins in all genomes, smoothed with Gaussian kernel and compared with the ‘average eukaryotic protein size distribution’. (B) Genomic overview including genome size, GC content, number of CDSs, average protein length and number of large proteins normalized over genome size in Mb.
Figure 2.
Figure 2.
(A) Histogram showing distribution of protein size relative to number of total proteins in all parasite proteomes divided into size bins of 250 amino acids. (B) Distribution of protein size ratio over large proteins in all parasites.
Figure 3.
Figure 3.
GC content distribution by CDS length for whole genome and large genes of apicomplexan and outgroup species studied.
Figure 4.
Figure 4.
Log10-transformed scatterplots of TPM expression level by protein length across (A) the whole genome and (B) large proteins of each species, with trendlines for each.
Figure 5.
Figure 5.
Frequency of occurrence of significantly overrepresented (P < 0.001) GO classes in large proteins of all parasitic genomes and outgroups determined via hypergeometric distributions (GO annotation glossary is provided in Supplementary Table S3).
Figure 6.
Figure 6.
UpSet plots illustrating (A) the top 30 ortholog groupings of species in this study (top: total number of shared proteins among different organisms; bottom left: number of shared genes per organism) and (B) ortholog intersections in large proteins.
Figure 7.
Figure 7.
Top 10 most common conserved domains among large proteins of controls (A) S. cerevisiae and (B) E. histolytica as well as select Apicomplexa (C) T. gondii and (D) P. falciparum.

Similar articles

References

    1. Maier A.G., Matuschewski K., Zhang M., Rug M. Plasmodium falciparum. Trends Parasitol. 2019; 35:481–482. - PubMed
    1. World Health Organization World Malaria Report 2022. World Health Organization. 2022; 14–16.
    1. Ord R.L., Lobo C.A. Human babesiosis: pathogens, prevalence, diagnosis and treatment. Curr. Clin. Microbiol. Rep. 2015; 2:173–181. - PMC - PubMed
    1. Renard I., Ben Mamoun C. Treatment of human babesiosis: then and now. Pathogens. 2021; 10:1120. - PMC - PubMed
    1. Bloch E.M., Day J.R., Krause P.J., Kjemtrup A., O’Brien S.F., Tobian A.A.R., Goel R. Epidemiology of hospitalized patients with babesiosis, United States, 2010–2016. Emerg. Infect. Dis. 2022; 28:354–362. - PMC - PubMed