Comparative Proteomic Analysis of <em>Toxoplasma gondii</em> RH Wild-Type and Four SRS29B (SAG1) Knock-Out Clones Reveals Significant Differences between Individual Strains

Kai Pascal Alexander Hänggeli; Andrew Hemphill; Norbert Müller; Manfred Heller; Uldry Anne-Christine; Sophie Braga-Lagache; Joachim Müller; Ghalia Boubaker

doi:10.20944/preprints202305.1482.v1

Submitted:

19 May 2023

Posted:

22 May 2023

You are already at the latest version

Abstract

In T. gondii, as well as in other model organisms, gene knock-out using CRISPR-Cas9 is a suitable tool to identify the role of specific genes. The general consensus implies that only the gene of interest is affected by the knock-out. Is this really the case? In a previous study, we have generated knock-out (KO) clones of TgRH88_077450 (SRS29B; SAG1) which differed in the numbers of integrated dihydrofolate-reductase-thymidylate-synthase (MDHFR-TS) drug-selectable marker. Clones 18 and 33 had a single insertion of MDHFR-TS within SRS29B. Clone 6 was disrupted by the insertion of a short unrelated DNA-sequence, but the marker was integrated elsewhere. In clone 30, the marker was inserted into SRS29B and several other MDHFR-TS copies were found in the genome. KO and wild-type (WT) tachyzoites had similar shape, dimensions and vitality. This prompted us to investigate the impact of the genetic engineering as such on the overall proteome patterns of the four clones as compared to the respective WT. Comparative shotgun proteomics of the five strains was performed. Overall, 3236 proteins were identified. Principal component analysis of the proteomes revealed five distinct clusters corresponding to the five strains by both iTop3 and iLFQ algorithms. Detailed analysis of the differentially expressed proteins revealed that the target of the KO, srs29B, was lacking in all KO clones. Besides this protein, twenty other proteins were differentially expressed between KO clones and WT or between different KO clones. The protein exhibiting the highest variation between the five strains was srs36D encoded by TgRH_016110. The deregulated expression of SRS36D was further validated by quantitative PCR. Moreover, the transcript levels of three other selected SRS genes, namely SRS36B, SRS46, and SRS57 exhibited significant differences between individual strains. These results indicate that knocking out a given gene may affect the expression of other genes. Therefore, care must be taken when specific phenotypes are regarded as a direct consequence of the KO of a given gene.

Keywords:

genetic manipulation

;

pleiotropic effects

;

systems biology

Subject:

Biology and Life Sciences - Biochemistry and Molecular Biology

1. Introduction

Human toxoplasmosis, caused by the apicomplexan parasite Toxoplasma gondii is one of the most prevalent parasitic diseases with one-third of the human population on planet earth being chronically infected [1]. In Europe and North America, three main genotypes (I-III) prevail [2], but new and atypical genotypes, distinct from I-III, have been discovered, most notably in South America [3].

Since over three decades T. gondii has been exploited as an excellent molecular genetic model to investigate intracellular apicomplexan parasites [4]. T. gondii ME49, a type II strain, has been the first of several strains whose genomes have been sequenced (toxodb.org), and CRISPR-CAS mediated genome engineering [5] has led to the identification of a large number of fitness conferring genes [6]. It is widely assumed that the inactivation of a specific gene can cause a reproducible and identical impact on parasite biology, regardless of the strain or the genetic modification methods employed. This is largely evidenced by applying a single comparison frame "one modified clone versus wildtype parasites" in many reports.

However, since genetic engineering is prone to interfere with fundamental cellular functions, it is noteworthy investigating which impact this procedure as such has on the systems biology of the organism. A suitable strategy to investigate such effects is the analysis of differentially expressed proteins in transformed clones and respective non-transformed wildtype (WT) parasites. For instance, in Giardia lamblia, nearly 10 % of the overall proteome is altered upon transformation with a stable plasmid [7]. In order to investigate to what degree protein expression patterns other than the intended knockout (KO) are affected in T. gondii, a non-essential gene should be chosen whose KO has no impact on fitness, at least not during in vitro culture of tachyzoites.

T. gondii, as well as other apicomplexan protozoans, express a panoply of surface proteins involved in many aspects of host-parasite interactions [8]. The surface antigen 1 (sag1), sometimes designated as p29 or p30, is the predominant surface antigen of T. gondii [9]. It is a member of a family of closely related proteins called “sag-related sequence proteins (srs proteins)”, as evidenced three decades ago [10,11]. First genomic sequencing efforts have revealed the existence of a family of 161 SRS genes with two subfamilies with SAG1 and SAG2A as prototypic members in the Me49 strain [12]. Later, this number has been corrected to 109 SRS genes [13]. These genes are designated as SRS followed by a number. For instance, SAG1 is named SRS29B, SAG2A became SRS34A. Proteins encoded by SRS genes have one to four cysteine-rich SRS protein domains and a c-terminal glycosyl-phosphatidyl-inositol (GPI) anchor domain providing a flexible attachment on the parasite surface [13]. SRS29B KOs generated earlier by chemical mutagenesis [14] or more recently by genetic engineering [5,9,13] do not exhibit marked effects on proliferation during in vitro culture. Thus, SRS29B is a suitable candidate to test our hypothesis that generating a KO clone impacts on protein expression patterns other than the intended KO target protein.

In a previous study, using CRISPR-Cas9, we have generated in the type I strain T. gondii RH four SRS29B KO clones, which display different integration patterns of the dihydrofolate reductase-thymidylate synthase (MDHFR-TS) selection marker [15]. The clones differ regarding the copy numbers and the mdhfr-ts insertion pattern. Clones 18 and clone 33 both have a single-copy MDHFR-TS insertion within SRS29B. Clone 6 also has a single-copy MDHFR-TS insertion in the genome, although not within SRS29B. Instead, SRS29B was disrupted by the insertion of a short DNA sequence unrelated to the resistance marker. Clone 30 exhibits multiple copies of MDHFR-TS in the genome, with one copy disrupting SRS29B [15]. In the present study, we compare the proteome patterns of these four clones to their respective WT T. gondii RH using whole cell shotgun LC-MS/MS and show that – besides the intended KO – the expression patterns of other proteins are altered.

2. Results

Differential proteomic analysis of the Tg_SRS29B KO clones versus their corresponding T. gondii RH wildtype yielded 36476 unique peptides matching to 3236 T. gondii proteins. The complete dataset is available as supplemental Table S1. Overall analysis of the data by principal component analysis revealed four non-overlapping clusters of the biological replicates of WT, clone6/clone33, clone18, and clone 30 by both iLFQ and iTop3 algorithms. The biological replicates of clones 6 and 33 were closer to each other than to other strains (Figure 1).

A more detailed analysis revealed that twenty-one proteins were significantly downregulated between the WT and the KO clones or between the different KO clones, taking into account both Top3 and LFQ algorithms to enhance belief in our results.

Peptides belonging to protein srs29B or sag1 encoded by the open reading frame (ORF) TgRH88_077450 (Figure 2A) were unambiguously detected in WT tachyzoites, only. In three of the KO clones, C6, C18, and C33, sag1 peptides were detected in trace amounts (ca. three orders of magnitude lower than in the WT) in one of three biological replicates, only. In clone 33, sag1 peptides were not detectable (Figure 2 B; Table S1). The protein with the highest identity to srs29B, the syntenic srs29C encoded by TgRH88_077470 (E value 6e^-31) did not significantly differ between WT and KO clones (Figure 2B).

Altered expression levels were detected in KO clones not only for srs29B, the target protein of the KO, but also for fifteen other proteins (Table 1). Eight of these proteins were differentially expressed between WTand KO clones only, six proteins were also differentially expressed between individual KO clones. None of these proteins was differential in all KO clones as compared to the WT (Table 1).

Moreover, six proteins were differential between different TgRH_SRS29B KO clones only. These proteins are listed in Table 2.

Interestingly, the list of differentially expressed proteins comprised – besides the intended KO SRS29B – two other SAG-related sequences, namely srs16C, encoded by the ORF TgRH88_006930 and srs36D, encoded by TgRH88_016110. Note that srs16C had significantly lower abundances in clone 33 as compared to the WT, clones 18 and clone 30, srs36D was equally abundant in wildtype and clone 30, and had significantly lower expression levels in the other clones. The abundances srs36D and srs16C in WT tachyzoites were one and two magnitudes lower, respectively, compared to the abundance of srs2929B (Figure 3; ORF 006930, ORF 016110).

Amongst the four other proteins with differential expression between WT and KO clones and between clones was the protein encoded by TGRH88_012840. Annotated as an unspecific product in the TgRH88 database and as putative transmembrane protein in other strains, this protein had equal abundances in WT and clone 30 and exhibited lower levels in the other clones (Figure 3; ORF 012840). Due to high variations between biological replicates, the expression levels of ORF 012840 were significantly different only between WT and clone 6, and clone 30 and clone 6, respectively, while clone 18 expression levels exhibited no significant difference to the WT. This protein was expressed in similar amounts as srs36D in WT and clone 30 tachyzoites (Fig 3; ORF 012840). Another protein with similar levels in WT and clone 30 was the protein encoded by ORF 017250, while in the three other clones, this protein was not detectable. Its expression levels in WT and clone 30 tachyzoites was in the same range as seen for srs16C (Figure 3; ORF 017250). The protein encoded by ORF 004470 had lower levels in clone 33 as compared to WT and the other clones (Figure 3; ORF 004470). Another expression pattern was observed with the protein encoded by TgRH_021190, a cAMP-dependent protein kinase. This protein was expressed at significantly higher levels in clone 6 than in WT tachyzoites and the other clones (Figure 3; ORF 021190). Finally, the protein encoded by ORF 046380, a histone lysine methyltransferase set1 homolog, could be detected in tachyzoites from clones 18 and 30 only.

Since SRS proteins other than sag1 were differentially expressed in the KO clones, we investigated the relative abundance of SRS proteins in WT and KO clones. In WT tachyzoites, 37 SRS proteins were detectable in at least two biological replicates, and srs29B (sag1, encoded by ORF 077450) was the most abundant one, with nearly 50% of SRS proteins being srs29B. The second-most abundant SRS protein was srs34A (23 %; encoded by ORF 080370), followed by srs25 (encoded by ORF 024520), srs52A (ORF 054180), and srs57 (057630). The remaining 33 SRS proteins amounted to 12 %. In all KO clones, srs34A (sag2A) became the most abundant SRS protein representing between 40 % and 47% of total SRS proteins in clone 18 and 33, respectively. The second most abundant SRS protein in these KO clones was srs25, followed by srs57 and srs52A as third and fourth most-abundant SRS proteins in all clones. In clone 18, srs29C (ORF 077470) was the fifth most abundant SRS protein, in all other clones, srs20A (ORF 026130) took this place. A schematic overview of the relative abundances based on iBAQ values of the proteins is presented as pie charts in Figure 4, the corresponding data are given as supplemental Table S2. A complementary presentation of the relative abundances of the major SRS proteins in all strains is shown in Table S2 (second page).

The respective relative abundances of the differentially expressed srs16C (ORF 006930) were 0.1% in WT and clone 6, 0.3 % in clones 18 and 33 and less than 0.01 % in clone 33. The relative abundance of srs36D (ORF 016110) in WT tachyzoites was one magnitude higher than the abundance of srs16C amounting to 1%. This protein was equally abundant in clone 30 with 1.5% of total SRS proteins. In clone 18, it represented only 0.1%, and was not detectable in clone 6 and clone 33 tachyzoites (Table S2).

When it comes to less abundant SRS proteins, however, the difference of their relative abundances (see Table S2) between the strains became clearly visible, as visualized by plotting the ranking of the relative abundances against the ranking of the WT proteins (Figure 5).

To investigate whether this observation could be generalized to other groups of proteins, we plotted the relative abundance rankings of four other classes of excretory/secretory proteins, namely dense granule proteins (GRA; 15 proteins detected), microneme proteins (MIC; 16 proteins), rhoptry proteins (ROP; 34 proteins) and rhoptry neck proteins (RON; 9 proteins) based on the dataset presented in Table S3. As shown in Figure 5, the rankings of proteins of these groups were more homogeneous with only one permutation in the RON group.

These results prompted us to determine the relative levels of transcripts of ORFs encoding four selected SRS proteins. Besides ORF 016110 encoding the differentially expressed srs36D, ORF 016080 encoding srs36B (sag9) was included. This SRS protein was expressed not only in tachyzoites, but also in bradyzoites [16] of type II and III strains (profiles in ToxoDB). Furthermore, ORF 040330 encoding srs46 with constant expression levels in cell cultures (profiles in ToxoDB), and ORF 057630 encoding srs57 (sag3), one of the more abundant SRS proteins in all strains and an important virulence factor [17] were included. The full dataset is presented as supplemental Table S4. However, the mRNA levels of ORF 057630 were similar among all clones, except for clone 18 where the transcript levels were 1.5 times higher than in the other strains. Both ORFs 0161080 and 016110 had significantly lower transcript levels in all KO clones as compared to the wildtype. In the case of ORF 016080, clones 6 and 33 had significantly lower levels than clones 18 and 30, in the case of ORF 016110, the transcript levels in these two clones were significantly lower compared to clone 30 only. Significantly lower transcript levels were also noted for ORF 040330 between WT and clone 33 tachyzoites (Figure 7).

3. Discussion

Overall, upon application of very strict criteria for determining differential expression levels, SRS29B KO clones and WT tachyzoites vary by 21 out of 3236 proteins, which corresponds to 0.65%. This variability is far below the nearly 10% of differentially expressed proteins detected in WT and transfected clones of the phylogenetically distant protist Giardia lamblia, for which the same strict criteria had been applied [7]. However, as illustrated by the principal component analysis plots, the herein presented datasets on T. gondii WT and KO clones do not simply form two clusters of T. gondii WT proteome and KO proteome. Instead, WT and each KO clone cluster separately, with clone 6 (SRS29B disrupted by a short sequence) and 33 (SRS29B disrupted by resistance marker) closer together than the other clones. The proteomes from KO clones 18 and 30 (SRS29B disrupted by resistance marker; but multiple insertions of the resistance marker in the genome of clone 30) are separated from the WT proteomes by one principal component, only. Conversely, the proteomes of clone 33 (with a single integration of the resistance marker into SRS29b and supposedly similar to clone 18 in terms of targeted SRS29B gene editing [15]) and of clone 6 were separated from the clone 18 and WT proteomes by both principal components. A potential factor involved in these proteomic differences among seemingly identical SRS29B KO clones could be the high and divergent phenotypic plasticity of T. gondii, in particular with respect to parasite-host-interaction [18,19]. In particular, expression levels of SRS genes are highly variable, as shown by transcript analyses of subclones of a given parental strain during ten division cycles [20]. Therefore, it is not surprising to find – besides the intended KO of srs29B – five other SRS proteins as well as five putative transmembrane proteins within the set of differentially expressed proteins. Regarding the complete set of SRS gene products identified in the proteomes, we see differences in numbers of unambiguously identified proteins (30 to 38 depending on the strains), as well as fluctuations of the relative abundances of these proteins between the strains. These fluctuations are less pronounced in other groups of proteins involved in host-parasite interactions. Transcript analyses of four selected ORFs encoding SRS proteins revealing differences between individual KO and WT tachyzoites point into the same direction. In a previous study, double SAG1-SAG2-KOs were followed by the upregulation of SRS29C expression resulting in a reduction of virulence in mice [13]. In our single KOs, however, srs29C protein levels were not significantly affected.

Interestingly, variable expression of membrane proteins as a response to various biotic and abiotic stresses is a well-known phenomenon in the intestinal protozoan G. lamblia (see for review [21] and [7,22,23] for more recent proteomic investigations) and in Trypanosoma sp. [24,25] (both superkingdom Excavata), and may thus be a more general phenomenon amongst eukaryotes. These effects might be the result of epigenetic effects such as histone acetylation or methylation [26-28]. The fact that a histone methyltransferase homolog is amongst the differentially expressed protein in our dataset underlines this hypothesis. Moreover, genomic DNA modifications may result in antigenic variations. In Trypanosoma brucei, antigenic variation is triggered by DNA breaks and recombination events [29,30]. The variability among KO clones of the same gene seen is not restricted to protozoans. It has been shown that KO and WT mice exhibit variabilities in gene expression far beyond the intended KOs [31]. Last, but not least, clonal variation resulting from the selection procedures necessary for obtaining the desired transgenic cell lines may be responsible for the observed variability in gene expression and proteome composition [32,33].

Taken together, our data – in alignment with previously published findings – show that genetic modifications may result in pleiotropic effects in the resulting cell lines. Therefore, it is difficult to establish a causal relationship between the affected gene locus (e.g. via KO) and the function of the corresponding gene. For phenotypical analyses (if intended), add-backs should be performed not only with one, but several KO clones, and their respective proteomes resp. transcript profiles should be analyzed.

4. Materials and Methods

4.1. Chemicals

If not stated otherwise, all tissue culture media were purchased from Gibco-BRL (Zürich, Switzerland), and biochemicals from Sigma (St. Louis, MO). Primers for real-time PCR (RT-PCR) were purchased from Eurofins (Luxemburg, Luxemburg).

4.2. Host cell culture, maintenance and purification of tachyzoites

Tachyzoites of T. gondii RH WT and the four SRS29B KO clones were maintained in vitro in human foreskin fibroblasts (HFF; PCS-201-010™) as previously described [34].

For analyses of the proteomes, T75 cell culture flasks were seeded with HFF (2 x 10⁶) and were cultured at 37°C / 5% of CO₂. When approximately 80% confluent, HFF monolayers were infected with 1 x 10⁶ tachyzoites. After 72 hours of culture, heavily infected monolayers were scraped, syringe lysed with a 25G needle, and filtered through a 47 mm diameter polycarbonate disc filter membrane (pore size 3 µm). The resulting tachyzoite suspensions were centrifuged (15 min, 1000 × g, 4 °C). To remove the residual media, pellets were washed twice with PBS, and parasites were stored as pellets at -80°C.

4.3. Proteomics

Cell pellets were lysed in 100 μL 8M urea/100 mM Tris/HCl pH 8, reduced with 10 mM DTT for 30 min at 37°C, alkylated with 50 mM iodoacetamide for 30 min in the dark, and proteins precipitated with 5 volumes of acetone at -20 ◦C overnight. The pellets were suspended in 8M urea in 50 mM Tris/HCl pH 8 to an estimated protein concentration of 2 mg/mL. Effective protein concentration was determined with Qubit Protein Assay (Invitrogen). The urea concentration was reduced to 1.6 M by dilution with 20 mM Tris/HCl, 2 mM CaCl₂ and an aliquot corresponding to 10 μg protein was digested with sequencing grade trypsin (Promega) at room temperature overnight at a protein-to-protease ratio (w/w) of 50:1. Digestions were stopped with TFA at a final concentration of 1% (v/v).

The digests were analyzed by nano-liquid chromatography mass spectrometry system consisting of an Ultimate 3000 (ThermoFischer Scientific, Reinach, Switzerland) coupled to a timsTOF Pro (Bruker Daltonics, Bremen, Germany) through a CaptiveSpray source (Bruker, Bremen, Germany) with an end-plate offset of 500 V, a drying temperature of 200 °C, and with the capillary voltage fixed at 1.6 kV. A volume of 2µL (200ng) from the protein digest were loaded onto a pre-column (C18 PepMap 100, 5µm, 100A, 300µm i.d. x 5mm length, ThermoFisher) at a flow rate of 10µL/min with 0.05% TFA in water/acetonitrile 98:2. After loading, peptides were eluted in back flush mode onto a homemade C18 CSH Waters column (1.7 μm, 130 Å, 75 μm × 20 cm) by applying a 90-minute gradient of 5% acetonitrile to 40% in water / 0.1% formic acid, at a flow rate of 250 nl/min. The TimsTOF Pro was operated in data-dependent acquisition mode using Parallel Acquisition SErial Fragmentation (PASEF). The mass range was set between 100 and 1700 m/z, with 10 PASEF scans in an ion mobility window of 0.6 and 1.6 V s/cm2. The accumulation time was set to 2 ms, and the ramp time was set to 100 ms. Fragmentation was triggered at 20,000 arbitrary units (au), and peptides (up to charge 5) were fragmented using collision induced dissociation with energies set between 20 and 59 eV dependent on peptide precursor mass.

Mass spectrometry data were processed by MaxQuant [35] software, version 2.0.1.0, against the ToxoDB-52_TgondiiRH88_AnnotatedProteins and a contaminants fasta protein sequence database with matching between runs option using a matching time window of 0.7 min and an ion mobility window of 0.05 1/K0, and giving each cell line a non-consecutive fraction number in order to prevent over-interpretation, respectively. Strict trypsin cleavage rule was applied, allowing for up to three missed cleavages, variable modifications of protein N-terminal acetylation and oxidation of methionine, static modification of cysteine with carbamidomethylation. Precursor and fragment mass tolerances were set to 20 ppm. Peptide spectrum matches, peptide and protein group identifications were filtered to a 1% false discovery rate (FDR) based on reversed database sequence matches, and a minimum of two razor or unique peptides were required to accept a protein group identification. MaxQuant’s Intensity Based Absolute Quantification (iBAQ) values were used to calculate relative abundance by equalizing their sum in each sample. The comparison of protein abundance between groups was made using both MaxQuant’s Label Free Quantification (LFQ) values as well as Top3 values (sum of the 3 most intense peptide form intensities), as reported elsewhere [7,36]. Protein identifications from the contaminants database (e.g. trypsin or BSA) as well as proteins identified only by site were removed for statistical validation.

4.4. Data processing and statistics

Differential protein abundance tests between the different sample groups were done based on LFQ and Top3 protein intensities as described elsewhere. For each strain, three biological and three technical replicates were processed [36]. As in previously published studies [7,37], to enhance belief in our data, only proteins, which were differential between the strains by both LFQ and Top3 intensities were regarded as differentially expressed.

4.5. Quantification of transcripts by multiplex TaqMan – qPCR

Total RNA was extracted from tachyzoites using the NucleoSpin RNA isolation kit (Macherey-Nagel, Duren, Germany). Furthermore, 1 μg of total RNA template was reverse transcribed to cDNA by GoScript Reverse Transcription System (Promega, Amriswil Switzerland) using random hexamer primer under the following conditions: 5 min at 25 °C, 60 min at 42°C, 15 minutes at 70°C, reactions were then cooled down to 4°C. All q-PCRs were performed in a Bio-Rad CFX 96 QPCR instrument (Biorad) using the FastStart Essential DNA Probes Master (Roche, Basel, Switzerland). The total volume for qPCR was 10 μl, consisting of 5 μl of 1x SensiFast master mix (Bioline, Meridian Bioscience, Cincinnati, Ohio, USA ), 0.5 μM of reverse and forward primers, 0.1 μM of each probe, 0.3 mM dUTP, one unit of heat-labile Uracil DNA Glycosylase (UDG) and 2.8 μl of cDNA template (diluted 1:5). The primers and probes for four ORFs encoding SRS proteins and for two reference genes are listed in Table 3.

Quantitative PCRs were performed according to the following thermal profile: (1) initial incubation of 10 min at 42°C, followed by (2) denaturation step of 5min at 95°C and (3) 50 cycles of two-step amplification (10 s at 95°C and 20 s at 62°C). A negative control with double-distilled water was included for each experiment. Relative mRNA expression using multiple reference genes was determined as previously described [38,39] and presented as weighed data with values obtained from wildtype tachyzoites arbitrarily set as one.

4.6. Statistics

Evaluations of both proteomic and gene expression data were performed based on three biological replicates per individual strain. Strains were compared with each other and not KO strains versus wildtype.

As in previously published studies [7,37], to enhance belief in our data, only proteins which were found differential between the strains by both LFQ and Top3 intensities were regarded as differentially expressed. Briefly [35], peptide forms were normalized by variance stabilization and imputed at this level to form the reported iTop3 values, whereas LFQ values were imputed at the protein group level to form the reported iLFQ values. In both cases, missing values were replaced by the maximum likelihood estimation method if there were otherwise more than 1 identification in the group of replicates. All other missing values were replaced by a random number from a Gaussian distribution of width 0.3 x sample standard deviation and centered at the sample distribution mean minus 2.8x or 2.5 x sample standard deviation, at peptide form and protein group level respectively. Differential expression was tested with the moderated t-test, and p-values corrected for multiple testing by the Benjamini-Hochberg method. A significance curve was calculated such that a minimum log2 fold change of 1 (absolute value) was required, and a maximum adjusted p-value of 0.05 as asymptotically high log2 fold change. Twenty cycles of imputation were performed, and protein groups consistently recorded as differentially expressed were flagged as protein groups of interest. Relative mRNA expression data were analyzed by ANOVA followed by multiple t-tests using the software package R [40]. P-values of less than 0.05 after Bonferroni adjustment were considered to be statistically significant.

Supplementary Materials

The following supporting information can be downloaded at the website of this paper posted on Preprints.org, Table S1: Complete proteome dataset of tachyzoites from TgRH wildtype and four SRS29B knock-out clones.; Table S2: iBAQ values and relative abundances of SRS proteins from tachyzoites of TgRH wildtype and four SRS29B knock-out clones; Table S3: iBAQ values and relative abundances of GRA, MIC, RON and ROP proteins from tachyzoites of TgRH wildtype and four SRS29B knock-out clones; Table S4: Quantitative reverse-transcriptase PCR transcript level data of four selected SRS genes. .

Author Contributions

Data curation, J.M., A.-C.U., S.B.-L. and M.H.; formal analysis, J.M. and S.B.-L.; funding acquisition, A.H.; investigation, K.H.., G.B., J.M., K.H., M.H.; methodology, J.M.,S.B.-L., M.H.; project administration, A.H.; resources, A.H.; software, A.-C.U., S.B.-L. and M.H.; validation, K.H., G. B..; visualization, D.I. writing—original draft, J.M.; writing—review and editing, J.M., G.B., K.H. and A.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Swiss National Science Foundation (grant No. 310030_184662 and 310030_214897) and the Uniscientia Foundation.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

All abbreviations are explained in the text. Please note that names of genes or open reading frames are written in capital, names of proteins in lowercase letters.

References

Dubey, J.P. Toxoplasmosis of animals and humans, 2nd ed.; CRC Press: Boca Raton, 2010; pp. xvii, 313 p., 314 p. of plates. [Google Scholar]
Howe, D.K.; Sibley, L.D. Toxoplasma gondii comprises three clonal lineages: Correlation of parasite genotype with human disease. J Infect Dis 1995, 172, 1561–1566. [Google Scholar] [CrossRef] [PubMed]
Darde, M.L. Toxoplasma gondii, "new" genotypes and virulence. Parasite 2008, 15, 366–371. [Google Scholar] [CrossRef] [PubMed]
Kim, K.; Weiss, L.M. Toxoplasma gondii: The model apicomplexan. Int J Parasitol 2004, 34, 423–432. [Google Scholar] [CrossRef] [PubMed]
Sidik, S.M.; Hackett, C.G.; Tran, F.; Westwood, N.J.; Lourido, S. Efficient genome engineering of Toxoplasma gondii using CRISPR/Cas9. PLoS ONE 2014, 9, e100450. [Google Scholar] [CrossRef] [PubMed]
Sidik, S.M.; Huet, D.; Ganesan, S.M.; Huynh, M.H.; Wang, T.; Nasamu, A.S.; Thiru, P.; Saeij, J.P.; Carruthers, V.B.; Niles, J.C.; et al. A Genome-wide CRISPR screen in Toxoplasma Identifies essential apicomplexan genes. Cell 2016, 166, 1423–1435.e1412. [Google Scholar] [CrossRef]
Heller, M.; Braga, S.; Müller, N.; Müller, J. Transfection with plasmid causing stable expression of a foreign gene affects general proteome pattern in Giardia lamblia trophozoites. Front Cell Infect Microbiol 2020, 10, 602756. [Google Scholar] [CrossRef]
Templeton, T.J. Whole-genome natural histories of apicomplexan surface proteins. Trends Parasitol 2007, 23, 205–212. [Google Scholar] [CrossRef]
Lekutis, C.; Ferguson, D.J.; Grigg, M.E.; Camps, M.; Boothroyd, J.C. Surface antigens of Toxoplasma gondii: Variations on a theme. Int J Parasitol 2001, 31, 1285–1292. [Google Scholar] [CrossRef]
Tomavo, S. The major surface proteins of Toxoplasma gondii: Structures and functions. Curr Top Microbiol Immunol 1996, 219, 45–54. [Google Scholar] [CrossRef]
Boothroyd, J.C.; Hehl, A.; Knoll, L.J.; Manger, I.D. The surface of Toxoplasma: More and less. Int J Parasitol 1998, 28, 3–9. [Google Scholar] [CrossRef]
Jung, C.; Lee, C.Y.; Grigg, M.E. The SRS superfamily of Toxoplasma surface proteins. Int J Parasitol 2004, 34, 285–296. [Google Scholar] [CrossRef] [PubMed]
Wasmuth, J.D.; Pszenny, V.; Haile, S.; Jansen, E.M.; Gast, A.T.; Sher, A.; Boyle, J.P.; Boulanger, M.J.; Parkinson, J.; Grigg, M.E. Integrated bioinformatic and targeted deletion analyses of the SRS gene superfamily identify SRS29C as a negative regulator of Toxoplasma virulence. MBio 2012, 3. [Google Scholar] [CrossRef] [PubMed]
Kasper, L.H. Isolation and characterization of a monoclonal anti-P30 antibody resistant mutant of Toxoplasma gondii. Parasite Immunol 1987, 9, 433–445. [Google Scholar] [CrossRef] [PubMed]
Hänggeli, K.P.A.; Hemphill, A.; Müller, N.; Schimanski, B.; Olias, P.; Müller, J.; Boubaker, G. Single- and duplex TaqMan-quantitative PCR for determining the copy numbers of integrated selection markers during site-specific mutagenesis in Toxoplasma gondii by CRISPR-Cas9. PLoS ONE 2022, 17, e0271011. [Google Scholar] [CrossRef] [PubMed]
Kim, S.K.; Karasov, A.; Boothroyd, J.C. Bradyzoite-specific surface antigen SRS9 plays a role in maintaining Toxoplasma gondii persistence in the brain and in host control of parasite replication in the intestine. Infect Immun 2007, 75, 1626–1634. [Google Scholar] [CrossRef] [PubMed]
Dzierszinski, F.; Mortuaire, M.; Cesbron-Delauw, M.F.; Tomavo, S. Targeted disruption of the glycosylphosphatidylinositol-anchored surface antigen SAG3 gene in Toxoplasma gondii decreases host cell adhesion and drastically reduces virulence in mice. Mol Microbiol 2000, 37, 574–582. [Google Scholar] [CrossRef] [PubMed]
Frenal, K.; Soldati-Favre, D. Plasticity and redundancy in proteins important for Toxoplasma invasion. PLoS Pathog 2015, 11, e1005069. [Google Scholar] [CrossRef]
Possenti, A.; Di Cristina, M.; Nicastro, C.; Lunghi, M.; Messina, V.; Piro, F.; Tramontana, L.; Cherchi, S.; Falchi, M.; Bertuccini, L.; et al. Functional characterization of the thrombospondin-related paralogous proteins rhoptry discharge factors 1 and 2 unveils phenotypic plasticity in Toxoplasma gondii rhoptry exocytosis. Front Microbiol 2022, 13, 899243. [Google Scholar] [CrossRef]
Theisen, T.C.; Boothroyd, J.C. Transcriptional signatures of clonally derived Toxoplasma tachyzoites reveal novel insights into the expression of a family of surface proteins. PLoS ONE 2022, 17, e0262374. [Google Scholar] [CrossRef]
Prucca, C.G.; Rivero, F.D.; Luján, H.D. Regulation of antigenic variation in Giardia lamblia. Annu Rev Microbiol 2011, 65, 611–630. [Google Scholar] [CrossRef]
Müller, J.; Braga, S.; Uldry, A.C.; Heller, M.; Müller, N. Comparative proteomics of three Giardia lamblia strains: Investigation of antigenic variation in the post-genomic era. Parasitology 2020, 147, 1008–1018. [Google Scholar] [CrossRef] [PubMed]
Müller, J.; Braga, S.; Heller, M.; Müller, N. Resistance formation to nitro drugs in Giardia lamblia: No common markers identified by comparative proteomics Int J Parasitol: Drugs Drug Resist 2019, 9, 112–119.
Pays, E.; Vanhamme, L.; Perez-Morga, D. Antigenic variation in Trypanosoma brucei: Facts, challenges and mysteries. Curr Opin Microbiol 2004, 7, 369–374. [Google Scholar] [CrossRef] [PubMed]
Stockdale, C.; Swiderski, M.R.; Barry, J.D.; McCulloch, R. Antigenic variation in Trypanosoma brucei: Joining the DOTs. PLoS Biol 2008, 6, e185. [Google Scholar] [CrossRef] [PubMed]
Jublot, D.; Cavailles, P.; Kamche, S.; Francisco, D.; Fontinha, D.; Prudencio, M.; Guichou, J.F.; Labesse, G.; Sereno, D.; Loeuillet, C. A histone deacetylase (HDAC) inhibitor with pleiotropic in vitro anti-Toxoplasma and anti-Plasmodium activities controls acute and chronic Toxoplasma infection in mice. Int J Mol Sci 2022, 23, 3254. [Google Scholar] [CrossRef] [PubMed]
Sullivan, W.J., Jr.; Hakimi, M.A. Histone mediated gene activation in Toxoplasma gondii. Mol Biochem Parasitol 2006, 148, 109–116. [Google Scholar] [CrossRef] [PubMed]
Nardelli, S.C.; Che, F.Y.; Silmon de Monerri, N.C.; Xiao, H.; Nieves, E.; Madrid-Aliste, C.; Angel, S.O.; Sullivan, W.J., Jr.; Angeletti, R.H.; Kim, K.; et al. The histone code of Toxoplasma gondii comprises conserved and unique posttranslational modifications. MBio 2013, 4, e00922-00913. [Google Scholar] [CrossRef] [PubMed]
Li, B. DNA double-strand breaks and telomeres play important roles in Trypanosoma brucei antigenic variation. Eukaryot Cell 2015, 14, 196–205. [Google Scholar] [CrossRef] [PubMed]
Li, B.; Zhao, Y. Regulation of antigenic variation by Trypanosoma brucei telomere proteins depends on their unique DNA binding activities. Pathogens 2021, 10, 967. [Google Scholar] [CrossRef]
Eraly, S.A. Striking differences between knockout and wild-type mice in global gene expression variability. PLoS ONE 2014, 9, e97734. [Google Scholar] [CrossRef]
Slavov, N. Learning from natural variation across the proteomes of single cells. PLoS Biol 2022, 20, e3001512. [Google Scholar] [CrossRef]
Westermann, L.; Li, Y.; Gocmen, B.; Niedermoser, M.; Rhein, K.; Jahn, J.; Cascante, I.; Scholer, F.; Moser, N.; Neubauer, B.; et al. Wildtype heterogeneity contributes to clonal variability in genome edited cells. Sci Rep 2022, 12, 18211. [Google Scholar] [CrossRef]
Winzer, P.; Müller, J.; Aguado-Martinez, A.; Rahman, M.; Balmer, V.; Manser, V.; Ortega-Mora, L.M.; Ojo, K.K.; Fan, E.; Maly, D.J.; et al. In vitro and in vivo effects of the bumped kinase inhibitor 1294 in the related cyst-forming apicomplexans Toxoplasma gondii and Neospora caninum. Antimicrob Agents Chemother 2015, 59, 6361–6374. [Google Scholar] [CrossRef] [PubMed]
Cox, J.; Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 2008, 26, 1367–1372. [Google Scholar] [CrossRef] [PubMed]
Uldry, A.C.; Maciel-Dominguez, A.; Jornod, M.; Buchs, N.; Braga-Lagache, S.; Brodard, J.; Jankovic, J.; Bonadies, N.; Heller, M. Effect of sample transportation on the proteome of human circulating blood extracellular vesicles. Int J Mol Sci 2022, 23, 4515. [Google Scholar] [CrossRef] [PubMed]
Müller, J.; Schlange, C.; Heller, M.; Uldry, A.C.; Braga-Lagache, S.; Haynes, R.K.; Hemphill, A. Proteomic characterization of Toxoplasma gondii ME49 derived strains resistant to the artemisinin derivatives artemiside and artemisone implies potential mode of action independent of ROS formation. Int J Parasitol Drugs Drug Resist 2022, 21, 1–12. [Google Scholar] [CrossRef]
Vandesompele, J.; De Preter, K.; Pattyn, F.; Poppe, B.; Van Roy, N.; De Paepe, A.; Speleman, F. Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol 2002, 3, RESEARCH0034. [Google Scholar] [CrossRef]
Hellemans, J.; Mortier, G.; De Paepe, A.; Speleman, F.; Vandesompele, J. qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data. Genome Biol 2007, 8, R19. [Google Scholar] [CrossRef]
R_Core_Team R: A Language and Environment for Statistical Computing, Vienna, Austria, 2012.

Figure 1. Principal component analysis of proteome data set of TgRH_SRS29B KO clones and their corresponding WT Tg RH. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Methods”. Principal components of the iTop3 (left panel) and iLFQ (right panel) datasets are presented. For each strain, three biological replicates are shown. C, clone; PC, principal component; WT, wildtype.

Figure 2. The protein srs29B encoded by open reading frame (ORF) TGRH88_077450 is lacking in the respective KO clones. A, protein sequence of srs29B with peptides detected in WT samples highlighted in red. LFQ (black) and Top3 (white) values corresponding to ORF 077450 and ORF 077470 (srs29c). Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Material and Methods”. Mean values ± standard deviation correspond to three biological replicates.

Figure 3. Abundances of the six proteins with differential expression between WT and KO clones designated by their corresponding TgRH88 ORFs. Annotations and significant differences between strains are indicated in Table 1 and Table 2. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Materials and Methods”. LFQ (black) and Top3 (white) mean values ± standard deviation correspond to three biological replicates.

Figure 4. Relative abundances of SRS proteins in WT and KO clones designed by their names and the corresponding TgRH88 ORFs. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Materials and Methods”. The percentage values were calculated based on the iBAQ values of the respective proteins as listed in Table S2., 100 % being the total amount of SRS proteins for each strain. The numbers correspond to SRS proteins detected in at least two biological replicates in each strain.

Figure 5. Expected vs. observed ranking of relative abundances of SRS proteins in tachyzoites of WT and SRS29B KO clones. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Materials and Methods”. The relative abundances were calculated based on the iBAQ values of the respective proteins and ranked from 1 (the most abundant protein within a class) to n (the least abundant protein). The values are listed in Table S2, The expected ranking was the value of the WT (black line). To avoid distortions, Sag1 (srs29B; ORF 077450), the most abundant SRS protein in the wildtype, was omitted from the plot. Therefore, the plot of the WT ranking starts with 2. Without variation in rankings, all points together would form a straight line.

Figure 6. Expected vs. observed ranking of relative abundances of members of different classes of proteins of WT and SRS29B KO clones. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Materials and Methods”. The relative abundances were calculated based on the iBAQ values of the respective proteins and ranked from 1 (the most abundant protein within a class) to n (the least abundant protein). The values are listed in Table S3, The expected ranking was the value of the WT (black line). GRA, dense granule proteins; MIC, microneme proteins; ROP, rhoptry proteins; RON, rhoptry neck proteins. Without variation in rankings, all points together would form a straight line.

Figure 7. (A) mRNA expression levels of genes encoding four selected SRS proteins. Transcript levels were determined by multiplex TaqMan – qPCR. Bars represents the mean of three biological replicates ± standard deviation. Values superscribed by the same letters are not significantly different from each other (ANOVA, followed by multiple t-tests; p<0.05 after Bonferroni adjustment). The complete dataset including exact p values is presented as table S4.

Table 1. List of proteins differentially expressed (DE) between TgRH_SRS29B KO clones and their corresponding WT TgRH. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Materials and Methods”. Only proteins exhibiting significantly different expression levels by both Top3 and LFQ were considered. “WT/C up” indicates that the DE protein is significantly higher expressed in the WT than in the respective clone, while “down” indicates lower expression levels of DE in the WT. DE proteins with different levels also between clones are highlighted in italics.

GN	Annotation	WT/C6	WT/C18	WT/C30	WT/C33
TGRH88_003360	Unspecified product (putative transmembrane protein)	down		down
TGRH88_006930	SAG-related sequence srs16C				up
TGRH88_012840	Unspecified product (putative transmembrane protein)	up
TGRH88_016110	SAG-related sequence srs36D	up	up		up
TGRH88_017250	Unspecified product	up	up		up
TGRH88_021190	cAMP-dependent protein kinase	down
TGRH88_029080	Gamma-glutamyl hydrolase	up
TGRH88_029100	Unspecified product	down
TGRH88_036500	Multi-pass transmembrane protein	down			down
TGRH88_042260	Unspecified product (putative transmembrane protein)	up			up
TGRH88_043760	Putative translation elongation and release factors (GTPase)	down	down
TGRH88_046380	Histone lysine methyltransferase set1		down	down
TGRH88_055240	Putative myosin heavy chain	up
TGRH88_062920	Bifunctional dihydrofolate reductase - thymidylate synthase		down		down
TGRH88_073870	MaoC family domain-containing protein	up
TGRH88_077450	SAG-related sequence srs29B	up	up	up	up

Table 2. List of proteins differentially expressed (DE) between TgRH_SRS29B KO clones. Whole-cell shotgun LC-MS/MS and analysis of the data were performed as described in “Materials and Methods”. Only proteins with significantly different levels by both Top3 and LFQ were considered as DE. X/Y up means DE protein level significantly higher in X than in Y. DE proteins with different levels also between clones are highlighted in italics.

GN	Annotation	C6/ C18	C6/C30	C6/ C33	C18/C30	C18/C33	C30/ C33
TGRH88_004120	PRELI family protein					up
TGRH88_006930	SAG-related sequence srs16C					up	up
TGRH88_008350	Unspecified product ( putative iron-sulfur cluster assembly accessory protein)	down	down
TGRH88_009830	SAG-related sequence srs19F	down
TGRH88_012840	Unspecified product(putative transmembrane protein)	down	down
TGRH88_014370	Cathepsin cpc1	down				up
TGRH88_016110	SAG-related sequence srs36D	down	down		down		up
TGRH88_016870	Unspecified product			up			up
TGRH88_017250	Unspecified product		down		down
TGRH88_021190	cAMP-dependent protein kinase	up	up	up
TGRH88_046380	Histone lysine methyltransferase set1	down	down
TGRH88_076630	Unspecified product	down

Table 3. List of primes and probes for TaqMan-quantitative reverse transcriptase PCR used in this study. BHQ, black hole quencher; Cy5, cyanine 5; FAM, 6-carboxyfluorescein; HEX, hexachlorofluorescein.

Gene ID (ToxoDB)	Gene name	Forward primer (5’-3’)	Reverse primer (5’-3’)	Taqman - hydrolysis probe
TGRH88_014440	Glyceraldehyde-3-phosphate dehydro-genase GAPDH1	TGGAGGTTTTGGCGAT	ATGGCAGTTGGCTCCTT	HEX-TACCCCGGC-GAAGTCAGC-BHQ
TGRH88_016080	SAG-related sequence SRS36B (SAG 5D)	TCAGAGGTCACCCGAGT	TCGTGGTGTCCTGGTTAC	FAM-CACCATCCAG-GTGATCCAGCC-BHQ
TGRH88_016110	SAG-related sequence SRS36D (SAG 5C)	TGTATGGCAAGCGACAA	TGTGCACACCTCAGTTAGTG	FAM-ACGGGCAG-AGTGACAGCGCA-BHQ
TGRH88_040330	SAG-related sequence SRS46	TGCAGACGTACCGACAGT	TCTGCGTCGACGAGTG	FAM-TCTTCGCTG-CGGCAAAAACTT-BHQ
TGRH88_055140	Alpha tubulin TUBA1	GACGCCTTCAACACCTTC	TTGTTCGCAGCATCCT	Cy5-TTACCGCCA-CCTGTTCCACCC-BHQ
TGRH88_057630	SAG-related sequence SRS57 (SAG 3)	TGCGATCTTGGGAAC	TCAGGGTGCTCTTTGTTC	FAM-TGTTCGTCG-CCGCAGGGA-BHQ

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Comparative Proteomic Analysis of Toxoplasma gondii RH Wild-Type and Four SRS29B (SAG1) Knock-Out Clones Reveals Significant Differences between Individual Strains

Abstract

Keywords:

Subject:

1. Introduction

2. Results

3. Discussion

4. Materials and Methods

4.1. Chemicals

4.2. Host cell culture, maintenance and purification of tachyzoites

4.3. Proteomics

4.4. Data processing and statistics

4.5. Quantification of transcripts by multiplex TaqMan – qPCR

4.6. Statistics

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe