1. Introduction
The clinical manifestation of Myalgic Encephalomyelitis/Chronic Fatigue Syndrome (ME/CFS) is typically physical or mental post-exertional malaise, a persistent fatigue that is not alleviated by rest, together with other symptoms related to neurologic, autonomic, immunologic systems [
1]. Several pathological mechanisms have been proposed to explain the origin of the disease and its progression over time [
1,
2,
3,
4,
5]. Among these mechanisms, deleterious autoimmunity mostly driven by viruses is gaining traction in the literature [
6,
7,
8]. SARS-CoV-2 is the newest causative agent of ME/CFS given that long-COVID patients might also receive a diagnosis of this disease [
9,
10,
11,
12]. However, the Epstein-Barr virus (EBV) among other herpesviruses remain the usual culprits of ME/CFS and now of long-COVID [
13,
14,
15,
16]. EBV is a particular interesting virus given its strong potential for antigenic mimicry with human proteins, especially, the EBNA1 protein that contains highly repetitive glycine-alanine motifs [
17,
18]. This potential of eliciting autoimmunity motivated serological investigations in patients with ME/CFS to identify key pathological EBV antigens and peptides [
19,
20,
21,
22]. However, these efforts did not lead to the identification of specific anti-EBV antibodies with a high accuracy of distinguishing patients with ME/CFS from heathy controls (HCs).
These disappointing findings could be simply explained by disease heterogeneity, selection bias, and misdiagnosis [
23,
24]. An alternative explanation is an inadequate choice of antibodies under analysis [
25]. To overcome this limitation, a recent study performed a large screening of IgG antibody responses to more 3,000 EBV peptides in patients with ME/CFS and HCs [
26]. In a subsequent study on the same data, antibody responses to two peptides were identified as candidate biomarkers for the subgroup of ME/CFS patients whose disease started with an infection. Notwithstanding providing the basis for a good disease diagnostic tool, these antibody responses were included in simple statistical models based on linear functions between covariates and disease status. Therefore, previous study was likely to fail detecting alternative antibody responses with more complex statistical relationships with disease status. Also, the same study did not evaluate eventual problems related to data overfitting.
The present paper aimed at re-analyzing the same dataset with the objective of using a machine learning approach where the identified analytical limitations could be tackled. We also took the chance of re-evaluating the role of molecular mimicry between EBV and human antigens on ME/CFS.
4. Discussion
This paper demonstrated the difficulty of finding anti-EBV antibodies that could be used as general markers of ME/CFS. Such a difficulty was clear when analyzing the whole cohort of ME/CFS patients due to the subset of patients who did not know their disease trigger or report a non-infectious disease trigger. Similar difficulty was obtained in studies on IgG antibodies against common pathogens [
41] or multiple peptides derived from different herpesviruses [
19].
In the present work, the best-case scenario was obtained for the patients with an infectious disease trigger where a set of 26 EBV-related antibodies led to a good accuracy in both train and test datasets. This finding on this subgroup of ME/CFS patients is not surprising given that our previous study also led to a similar conclusion using a different statistical methodology [
27]. Overall, the present study reinforces the idea that patients with ME/CFS are very heterogenous as a clinical group and, as such, the quest of discovering a generic disease marker applicable to all patients is likely to fail. However, this problem can be surpassed by using an appropriate stratification (e.g., based on the hypothetical disease trigger) where specific disease biomarkers can emerge, as illustrated in this work.
Six of the selected 26 antibodies were associated with EBNA1_0005, EBNA3_0577, EBNA4_0566, EBNA6_0025, EBNA6_0488 EBNA6_0752 peptides. Antibodies against these antigens were found elevated in patients with multiple sclerosis (MS) [
42]. Such a result is another piece of evidence that some patients with ME/CFS and MS share not only symptoms [
43,
44], but also serological alterations possibly linked to EBV [
45]. In this line of thought, one could argue that rituximab that depletes the B-cell population and used in MS patients could be beneficial for this specific cohort of ME/CFS patients [
2]. Unfortunately, the only large-scale clinical trial on the use of this drug in ME/CFS patients led to disappointed results [
46]. A likely explanation for this finding is not targeting the specific cohort of ME/CFS patients with an infectious disease trigger .
Among the peptides recognized by the selected antibodies, EBNA6_0070 had the highest sequence homology with a human protein. This antigen was already discovered and amply discussed in our previous study [
27]. The replication of this finding using a different statistical approach provided further support for the hypothetical role of this peptide in ME/CFS. However, in contrast with our previous work, antibodies against this candidate antigen were not on the top most important antibodies for disease prediction. This result suggests that the potential pathological effect of this EBV antigen via molecular mimicry might not be as straightforward as our initial study suggested. To resolve this question, one could perform functional studies where purified antibodies against this EBV peptide, when transferred to a recipient animal model, could or not elicit ME/CFS-like disease.
The EBNA6_0488 peptide had the second highest sequence homology with human proteins. This homology was related to two possible human 10-mer peptides belonging to CTCF and AEBP1. The former protein is a master transcription factor due to its more than 50,000 possible binding sites and its role as a chromatin barrier element [
47,
48]. Besides that, the level of this transcription factor is inversely correlated with the levels of DNA methylation [
48]. CTCF in partnership with cohesion molecules is also important in many immunological pathways, such as interferon gamma production in Th1 cells and establishment and maintenance of regulatory T cells in visceral adipose tissue and skeletal muscle [
49,
50]. In this scenario, we speculate that the increased quantity of antibodies against EBNA6_0488 results in a cross-reactive antibody response to the CTCF peptide, thus, reducing the abundance of this transcription factor. This putative reduction could lead to altered gene expression and DNA methylation patterns, and abnormal immunological processes, including the maintenance of deleterious autoimmunity in check. This speculation is in line with findings from altered gene expression and DNA methylation profiles in ME/CFS patients [
51,
52,
53,
54]. As an extreme case, one study identified more than 12,000 CpG sites with altered DNA methylation levels in patients with ME/CFS compared to HCs [
55]. Immunological abnormalities are also reported by many studies in ME/CFS patients (reviewed in ref. [
7,
56]). An alternative interpretation is that the increased levels of antibodies against EBNA6_0488 resulted from a putative CTCF overexpression during the disease progression in the cohort of ME/CFS patients with an infectious disease trigger. An overexpression of this transcription factor could be the result of a stress-induced response to restore homeostatic equilibrium within cells. However, altered gene expression was not reported for CTCF by any gene expression studies published so far. This negative reporting could be explained by not performing any patient’s stratification when analyzing data from these studies.
With respect to AEBP1, this protein is a ubiquitous transcriptional repressor involved in the regulation of adipogenesis, mammary gland development, inflammation, macrophage cholesterol homeostasis, and atherogenesis [
57]. Interestingly, mutations on AEBP1-encoding gene were implicated with the onset of Ehlers-Danlos syndrome (EDS) [
58,
59]. Patients with EDS hypermobility-type can also receive a diagnosis of ME/CFS [
60]. On the other hand, patients with a diagnosis of ME/CFS also show EDS as a co-morbidity [
61]. In fact, the presence of EDS in a suspected case of ME/CFS has not been considered as an exclusionary condition for the respective disease diagnosis [
62]. However, genome-wide association studies of ME/CFS did not report any genetic markers located on the AEBP1 gene [
63,
64,
65,
66,
67]. In this scenario, antibody responses to EBNA6_0488 with the potential of being cross-reactive with AEBP1 should alter the regulation of biological processes where this protein is involved. In particular, the deficient regulation of inflammatory processes is particularly relevant for ME/CFS given the general idea that established ME/CFS translates into a persistent low-grade inflammatory process in patients [
5]. Given that endothelial disfunction is also observed in patients with ME/CFS [
68,
69,
70], such a disfunction could result from damaged endothelial cells via persistent low-grade inflammation in response to EBNA6_0488 mimicking an AEBP1 peptide. Hence, the identification of this molecular mimicry brings an unexpected link between EBV and AEBP1. As alluded above for CTCF, current gene expression studies did not highlight AEBP1 at the top of the proteins with the most significant differential abundance between patients with ME/CFS and HCs. The lack of patient’s stratification is once again a possible reason to not detect an altered abundance of AEBP1 in ME/CFS patients when compared to HCs.
Interestingly, the maximum sequence homology of the peptides recognized by the 26 selected antibodies and human proteins was not associated with the importance of the same antibodies in disease prediction. Moreover, antibodies against EBNA1_430 that contains a peptide mimicking a peptide from the human Anoctamin-2 protein [
39,
40] had a low importance in predicting ME/CFS patients. These results suggest that eventual molecular mimicries due to antibody reactivity between EBV and human antigens has a minor role in the underlying pathological mechanism in such a subset of patients. However, we cannot rule out the possibility of an eventual mimicry based on three-dimensional molecular structure of the respective peptides, but not at the level of their amino-acid sequence. We cannot also rule out that molecular mimicries based on sequence homology might be elicited by peptides from EBV proteins other than the ones evaluated in this study. This might be the case of two EBV peptides from BPLF1 and BHRF1 proteins that were able to elicit an immune response by self-reactive T cell-clones derived from patients with MS [
71].
An interesting perspective of the above results can be given by the so-called danger theory [
72]. The theory is based on the premise that the immune system is activated by danger or damage signals sent by infected (or stressed) cells to the immune system. These danger signals are independent of the intrinsic nature of antigens (self or non-self) seen by the immune system. As a corollary, autoimmune responses and autoimmune diseases are then understood as unintended consequences of persistent danger signals that ultimately include chronic presentation of multiple self and nonself antigens. This explains why chronic and low-grade infections by herpesviruses are among the most documented triggers of autoimmune diseases. In this scenario, the theory predicts exactly the lack of correlation between importance of the selected antibodies in predicting ME/CFS and the degree of molecular mimicry between the EBV peptides and human proteins.
The basic question of applying the danger theory to ME/CFS pathogenesis lies upon understanding which danger signals are at the core of the disease. According to the original proponents of the danger theory, general danger signals are the heat shock proteins (HSP), the vasoactive intestinal polypeptide (VIP), the cytokines TNFα and IL1β, among others [
73]. A brief discussion about some of these danger signals in the context of ME/CFS is given below; a more comprehensive discussion of this topic will be done in a near future.
HSPs are highly conservative proteins in nature and they are produced in response to many different cellular stresses. In theory, antigens derived from these proteins were thought to belong to the so-called immunological homunculus, a limited set of dominant self antigens that allow the immune system to have picture of the self [
74]. However, there is no consensus whether HSPs are indeed signaling danger or simply key regulatory and resolution elements of a stress or immune response [
72,
75]. This alternative interpretation of the functional role of HSPs might explain the lack of consistency in HSP-related responses across studies where patients with ME/CFS and HCs were challenged with physical exercise [
76,
77,
78]. In addition, antibodies against endogenous and microbial HSP65 peptides were at the same level in patients with ME/CFS and HCs with the exception of a higher seroprevalence to a HSP65 peptide derived from
Chlamydia pneumoniae in the former [
79].
VIP is a neuromodulator present in the gut and the anterior chamber of the eye [
73]. On the one hand, it has the capacity of activating dendritic cells [
80] (thus, its suggestion as a possible danger signal). On the other hand, the binding of VIP to its receptor in immune cells also leads to anti-inflammatory actions [
81]. In this line of thought, a loss of tolerance to VIP, other vasoactive neuropeptides or their receptors were hypothesized to be at the genesis of ME/CFS [
82]. However, a follow-up study showed an elevated expression of VPACR2 – the VIP receptor – in immune cells and an increased frequency of the regulatory Foxp3+CD4+ T cells in ME/CFS patients in comparison to HCs [
83]. Given the capacity of VIP inducing the generation of these regulatory cells [
84], this finding is more in line with VIP as a regulation mediator in the context of ME/CFS.
TNFα and IL1β are the two classical pro-inflammatory cytokines. According to a systematic review [
85], 20%-25% of the studies reported elevated levels of these cytokines in patients with ME/CFS when compared to HCs. However, the same systematic review did not perform a meta-analysis of the published data. Therefore, it is unclear whether the lack of significant findings related to these two cytokines is a consequence of insufficient statistical power due to reduced sample sizes used in the respective studies. It is worth noting that patients with ME/CFS from Italy had a higher frequency of an allele variant associated with elevated levels of TNFα (rs1800629:G>A) [
86]. However, this finding was not replicated by another study with German patients [
87].
In summary, this study provided a list of possible EBV peptides whose associated IgG antibody responses could be used in the diagnosis of suspected ME/CFS cases who reported an infection at their symptom’s onset. Two of these peptides had high sequence homologies with human proteins, but the corresponding antibody responses were not the most important ones for disease prediction. This finding suggested that the role of EBV on eventual ME/CFS-related autoimmunity should be reconsidered under alternative theories about the self-nonself discrimination problem.