4. Discussion
Our results demonstrate that, in the majority of cases (92%), GC can be classified into one of five molecular subtypes. We did not find GC EBV(+) that were also dMMR and this is consistent with other published data [
28,
29]. A hierarchical approach is necessary, since a proportion of GC dMMR is also p53m (38%) and some dMMR cases (12%) have also aberrant expression of E-cad or β-cat. This hierarchical approach also proved useful in cases with test failure, since we were still able to reach a specific diagnostic category in 40% of such cases. GC NOS/indeterminate represents 8% of our total; this was due to test failure for MLH1 (two cases), E-cad (three cases) and β-cat (one case).
In our cohort, GC EBV(+) has a prevalence of 6%. This compares favorably with Setia et al. (2016), who found EBV(+) in 5% of their cases, with the TCGA (9%) and with Ramos
et al. (10%) [
2,
5,
8]. To determine the EBV status, we chose EBER ISH. In the context of a classification for widespread adoption, the use of IHC may be more desirable, however, the sensitivity (44%) and specificity (93%) of IHC for the latent membrane protein 1 (LMP1) compares unfavorably with EBER ISH (sensitivity 94% and specificity of 69%) [
30]. Other techniques, such as sequencing, microRNA and droplet digital PCR may be more specific but EBER ISH is considered the gold standard for detecting and localising latent EBV in FFPE tissue [
31,
32,
33]. The role of EBV in GC remains uncertain and it is possible that it has an involvement in a larger number of GC cases through a “hit and run” mechanism [
34]. What has been clear from the comprehensive molecular assessments of the TCGA is that in a smaller group of GC patients, the tumour retains EBV-related molecular pathways which correlate with specific outcomes and response to treatment [
35,
36]. For these reasons, the use of EBER-ISH is appropriate to recognise these cases.
Systematic sequencing in different tumour sites (for instance endometrium and colorectum) has shown presence of independent molecular pathways with broad commonalities such chromosomal instability (CIN), CpG island methylator phenotype (CIMP) and defects of double stranded DNA repair (MMR deficiency) [
37]. In addition, a smaller proportion of cases seem to have oncogenic drivers specific to the tumour site. For instance, HER2 amplification in breast cancer or POL-E mutation in endometrial carcinoma [
38,
39]. In this context, GC has a small number of cases where EBV appears to be the oncogenic driver; these tumours have extremely high CIMP and are molecularly, genetically and epigenetically distinct form all other types [
40]. Unlike the ACRG study, the TCGA group recognised this as unique category with good prognosis, distinct pathological features and an association with good response to immune checkpoint therapy [
41,
42]. We believe that GC EBV(+) should be recognised as a distinct category as the first step in the hierarchical approach.
The proportion of dMMR tumours in our cohort is 20%. Similar proportions were found in other studies [
2,
3,
5,
6,
7,
8], with prevalence ranging between 16% (Setia et al.) and 24% (Ahn et al.). In our study, the predominant cause of dMMR was loss of MLH1 (94%), followed by MSH2 (6%). This is concordant with published data, which identifies hypermethylation of MLH1 promoter as the most common mechanism of dMMR in GC, MSH2 mutation being present only in a minority of cases [
43].
Absence of IHC staining for the MMR enzymes MLH1, PMS2, MSH2 and MSH6 is widely accepted as a surrogate marker of mismatch repair status and correlates well with previously used surrogate markers such as microsatellite instability and sequencing [
18]. While in our diagnostic routine we assess MMR status using all four IHC makers, for this work we used only two of the biomarkers (MLH1 and MSH2), principally in order to save tissue sections. Since MLH1 promoter methylation and germline mutation in MLH1 and MSH2 are the most frequent causes of MMR deficiency in gastric cancer, some institutes use only MLH1 and MSH2 also in the diagnostic routine [
44]. In our experience, the use of all four markers aids interpretation and we would not advocate implementing GC classification using only two of these biomarkers. For GC dMMR may be desirable to perform MLH1 promoter methylation studies to help distinguish syndromic patients from sporadic cases; this distinction is not part of our molecular classification. Within the GC dMMR group, we identified two cases (12%) that showed also aberrant expression of E-cad or β-cat. Similar cases were also described in the cohort assessed by Setia et al.[
5].
In this study, GC EMT represents 14% of the cohort. This category comprises all cases with aberrant expression of E-cad and/or β-cat. The majority of these cases (10/11 or 91%) are of Laurén diffuse type. Prevalence of GC EMT is slightly higher than that found by Ramos et al. (9%) [
8] but similar to that that found in the ACRG study (15%) [
3,
4] and in the study of Ahn et al. (15%) [
6]. However, it is much lower than the incidence found in other studies which ranged between 20% and 29% [
2,
5,
7]. One case in our GC EMT cohort was of Laurén intestinal type. On review, this case shows tubular morphology. There are reports of tubular carcinoma of the stomach with aberrant E-cad expression [
6,
7,
8]. It is difficult to argue that tubular carcinoma shows morphological evidence of EMT, since it has well developed epithelial structures nevertheless, in comprehensive genomic studies, it has gene expression profiles more similar to tumours with more classic EMT [
3,
4]. This may signify that GC EMT may not be the most appropriate term to identify this group of tumours.
In order to identify cases of GC EMT, we used β-cat in addition to E-cad. Others who attempted molecular classification of GC with on-slide biomarkers limited testing to E-cad only [
5,
6,
7,
8]. EMT is linked to loss of cell-to-cell adhesion and this is in most cases due to a defect in E-cad. A prototype tumour with EMT is lobular carcinoma of the breast (LBC), which can be associated with sporadic or familiar defects in CDH1, the gene encoding for E-cad. Nevertheless, in a small proportion (10-15%) of LBC, the defect lies in accessory molecules of the cadherin-catenin complex which includes α-catenin, β-catenin and p120 catenin [
45]. The addition of IHC for β-cat has proven helpful in understanding loss of cell-to-cell adhesion [
46,
47] and both markers are now used routinely in breast pathology, providing more accurate diagnosis of LBC [
48,
49].
There are strong similarities between LBC and diffuse GC. A proportion of diffuse GC is hereditary and part of a rare autosomal dominant syndrome that was first described in 1998 and is characterised by increased risk of diffuse GC and LBC [
50]. The most common underlying defect in this syndrome is mutation in CDH1 gene, followed by mutation in the CTNNA1 gene that encodes for α-catenin [
51]. Alpha catenin defects result in destabilisation of the cadherin-catenin complex with increases degradation of these molecules and leads to abnormal localisation of β-cat [
52,
53]. For these reasons we added β-cat to our panel. Others have shown that aberrant expression of β-cat is linked with defective cadherin-catenin complex also in GC but this marker is yet to be in routine use [
54].
In our study, 64% of GC EMT (7/11) show loss of both E-cad and β-cat expression, 18% (2/11) show only aberrant E-cad expression and in the remaining 18% (2/11) E-cad is indeterminate while β-cat show aberrant expression. Our experience supports published evidence that the assessment of EMT is greatly facilitated by the use of both markers [
55]. In addition, a promising new immunotherapy targeting, CLDN18.2, appears particularly effective in diffuse GC. CLDN18.2 is a constituent of tight junctions and becomes accessible to the immune system when tight junctions are not functional, as in the case of GC EMT. Zolbetuximab, a humanized monoclonal antibody, selectively binds CLDN18.2 on tumour cells and mediates antibody-dependent cell-mediated cytotoxicity (ADCC); this treatment is more effective in tumours with higher expression of CLDN18.2 [
56,
57,
58]. The correct identification of GC EMT is important not only because of its associated with poorer prognosis but also for the selection of specific immune therapy.
CLDN18.2 has potential to become an important predictive biomarker in view of its effectiveness in clinical trials [
56,
57,
58]. There is some uncertainty over the threshold for positivity. Some trials considered a case positive for CLDN18.2 if there was moderate or strong intensity in ≥40% TC, others in ≥70% TC or in ≥ 75% TC [
59]. For the purpose of this analysis we considered a threshold for positivity ≥40%. It is possible that some of the molecular subgroups may have a better response to this new drug, nevertheless this requires further work.
GC p53m represents 23% of our cohort, which is lower than that found by Ramos et al. (35%) and the ACRG (36%). It is worth noting that in both studies there is no indeterminate group. If the p53m cases of the GC NOS group are added to the GCp53m group, the total (27%) approaches that of these two studies [
3,
8]. In our study, the proportion of p53m cases is considerably lower than those in the work of Setia et al. (51%) and Ahn et al. (49%) [
5,
6].
The number of cases in the GC p53wt category in our study (29%) compares well with that of Ramos and colleagues (29%), Ahn et al. (21%) and the ACRG study (26%) but is very different to that reported by Setia et al. (7%) [
3,
5,
6,
8]. These differences are not surprising, since p53 mutation is assessed by IHC in three of these studies and relies on sequencing in the ACRG study. In addition, staining parameters and interpretative algorithms are different, and these are known to impact significantly on results. For example, both Setia et al. and Ahn et al. did not recognise cytoplasmic staining as aberrant expression linked to mutation. We used more recent guidelines which have been developed for p53 assessment in endometrial carcinoma [
21].
Eight of our cases (10%) are Her2 positive (IHC 3+ and DDISH amplified). Other studies reported a prevalence (between 3% and 5%) [
3,
6]. Our study is small but we found no association with particular molecular subtypes. However, all cases were of Laurén intestinal type, confirming previous observations [
5,
6,
7,
8] and highlighting the important of the Laurén model to determine further downstream tests.
We found a total of 16 PD-L1 positive cases in our cohort. These results should be interpreted with caution as PD-L1 staining in tumour tissue can be heterogeneous and smaller samples (such as those in our TMAs) may have significant sample bias. In addition, we chose a positivity threshold of CPS 5 in view of the approval of nivolumab in GC. There is a second clinical threshold for GC at CPS 10 for the use of pembrolizumab. Our study showed that the GC EBV(+) is enriched with PD-L1 positive cases (40% vs 21% in GC EBV(-)) and this is in keeping with other published data [
2,
5,
6]. Some authors have linked this to high numbers of lymphoid cells in the stroma of EBV(+) GC [
60]. In our study, the majority (81%) of PD-L1(+) GC is of the Laurén intestinal type and only 3/16 cases (19%), signifying no preferential expression in either subgroups. There is a variation of percentage of PD-L1 overexpression in the literature, ranging between 28-65% for the intestinal type and between 19-54% for the diffuse-type. Comparison is difficult because of the different thresholds for positivity that have been used [
61,
62,
63,
64].
This study shows that a molecular classification is possible to implement using existing histopathology resources and expertise. While this study has a very small number of cases and further work using larger cohorts is needed, the correlation between molecular subgroups and prevalence of key predictive oncology biomarkers is apparent. The clinical value of such association in the context of busy clinical units cannot be overlooked, especially when some of these predictive biomarkers are send away tests with long turnaround times. Knowledge of the molecular subgroup and hence an understanding of the prognosis as well as the prevalence of specific predictive tests, would greatly assist the multi disciplinary team (MDT) discussions and aid treatment decision making while awaiting the result of predictive tests.
There is limited data on the biological behaviour of each of the molecular subtypes of GC. An understanding of prognosis is difficult in consideration of the fact that some of these subgoups are enriched for responders to immunotherapy. The relative aggressiveness of each subgroup has been estimated from published papers [
2,
3,
4,
5,
6,
8] and we have attempted to collate available data in
Figure 4, which provides an indication on the aggressiveness of each subtype. This picture may change when considering relative abundance of responders in each category and the effectiveness of existing biological therapy (for instance herceptin, pembrolizumab and nivolumab) [
24,
65,
66] and new biological drugs (for instance zolbetuximab) [
56,
57,
58].
This study has a number of limitations. We have no outcome data for our cohort of patients therefore assumption on the prognostic significance of these categories are inferred from other studies. The study uses TMAs which may be suboptimal when assessing heterogenous tumours and the relatively small cohort means that some subgroups have very low number of events. The tissue used might have been affected from suboptimal pre-analytics, since these are derived from gastric resection specimens where fixation and processing was not controlled. Finally, the study was not designed to assess accuracy of interpretation of the biomarkers used.