1. Introduction
Cancer heterogeneity refers to the diversity of cancer cells within a tumor as well as the differences between tumors in different individuals [
1,
2]. It is a fundamental characteristic of cancer and can have important implications for diagnosis, treatment, and prognosis. Tumor heterogeneity includes genetic mutations, epigenetic changes, and cellular level signaling pathway differences [
3,
4]. This causes subpopulations of cancer cells within a tumor with different characteristics, such as different growth rates, treatment response, and metastasis potential [
5]. The presence of heterogeneity can make developing effective cancer therapies challenging, as effective treatments against one cancer cell subpopulation may be ineffective against others. Breast cancer tumors have intratumor heterogeneity (ITH) and tumors with high ITH have demonstrated therapeutic resistance [
6]. Also, Human epidermal growth factor receptor 2 (HER2) expression can be heterogeneous in breast cancer tumors. HER2 is a tyrosine kinase receptor which is encoded by the ERBB2 gene as a cancerogenic gene. HER2 is an orphan receptor characterized by a constitutively activated conformation and lacks specific ligands or ligand binding activity. The formation of HER2 homodimers leads to phosphorylation of the tyrosine kinase domain, thereby activating various downstream cancer signaling pathways, ultimately enhancing cancer cell proliferation, tumor formation, invasion, and related processes [
7,
8].
HER2-positive breast cancer is more aggressive than other breast cancer types, but it can be treated with targeted therapies such as trastuzumab and pertuzumab, which specifically target the HER2 protein [
9,
10]. Overall, the prognosis of patients with HER2-positive early breast cancer has significantly improved in recent years due to advances in targeted therapies. Currently, the primary application of HER2 assessment lies in the improvement for patient survival to anti-HER2 therapy in the neoadjuvant and adjuvant settings. HER2 heterogeneity is defined by the coexistence of at least two distinct cellular clones with differing HER2 statuses within the same tumor, and the heterogeneity is the coexistence of different tumor cell subpopulations with varying levels of HER2 protein expression [
11]. The definition of HER2-positive cancer is typically determined primarily immunohistochemistry (IHC) and in situ hybridization (ISH). ISH involves the simultaneous detection of HER2 gene amplification and chromosome 17 (CEP17) to assess HER2 gene status in tumor tissue. If the IHC test reveals a score of 3+, indicating strong HER2 protein expression, or if the IHC test shows a score of 2+ along with confirmed HER2 gene amplification via ISH, the cancer is classified as HER2-positive [
12]. HER2 heterogeneity is a well-established phenomenon in breast cancer, signifying the potential for distinct regions within the same tumor to exhibit varying levels of protein expression or genetic amplification [
7,
11]. Tumor heterogeneity refers to the diversity of cancer cells within a tumor or among tumors, encompassing variations in genetic mutations, gene expression patterns, and cellular characteristics. This heterogeneity can occur spatially, with different regions of a tumor exhibiting distinct features, and temporally, as cancer cells evolve and adapt over time [
2,
13,
14]. Heterogeneous HER2 amplification was observed in 11%–40% of tumor cell populations in HER2-positive breast cancers [
8,
15,
16,
17]. On the geographic (spatial) distribution, HER2 heterogeneity has been described in a “clustered”form or “mosaic” pattern characterized by pockets of highly amplified cells [
8,
18,
19,
20]. HER2 heterogeneity may contribute to inaccurate HER2 status assessment and affect therapeutic decision-making, and validation of techniques to identify HER2 heterogeneity in the clinic and concurrent development of agents to effectively treat tumors with nonuniform HER2 expression is needed [
21,
22]. Dual-color in situ hybridization (DISH) is also a molecular diagnostic technique used to assess the status of the HER2 gene in breast cancer and other malignancies [
23,
24,
25]. DISH probes are designed to bind to the HER2 gene and CEP17 and can be visualized under a microscope. DISH allows for the simultaneous detection of both the HER2 gene and CEP17, providing comprehensive information on gene amplification, and allows for quantitative analysis, enabling the precise measurement of gene amplification levels.
Recently, Seol et al. reported that the presence of HER2 heterogeneity associated with decreased disease-free survival (DFS) was treated with only 25% adjuvant trastuzumab therapy [
26]. In contrast, Shen et al. revealed that HER2 heterogeneity detected with a HER2 gene protein assay (GPA) was associated with poor overall survival (OS) and increased distal metastasis in patients with HER2-positive breast cancer [
21,
27]. In the report, HER2 gene amplification was evaluated with HER2 GPA, which combines HER2 IHC and DISH for simultaneous evaluation at both the protein and gene at the tissue level. However, The DISH method frequently presents with non-specific background deposition, including debris as subtle silver deposits, which elude automated analysis. Furthermore, the DISH method yields manual counting dependent on the measurer, requiring training for signal counting and susceptible to inter-measurer variability in interpretation [
28,
23]. Recently, Laurence et.al. investigated the prevalence of a range of gene expression distributions in three different tumor types from the Cancer Genome Atlas (TCGA), and studied a genes distribution shape to model heterogeneity of transcriptomic data [
29]. Also, their results indicate that prognostic genes identified based on consideration of the shape of the distribution were different from those identified through more standard assumptions. histogram analysis is the most common and popular method of characterization of ITH in imaging data [
30]. Radziuviene et al. demonstrated automated image analysis of HER2 FISH histograms to refine their definition of genetic heterogeneity, and HER2 ITH could also be identified by histograms based on HER2 FISH amplification [
31]. It is necessity to develop novel methods capable of analyzing HER2 ITH within cancer tissues automatically, and to make unbiased analyses, rather than relying on manual procedures. Also, to assess more clear-cut heterogeneity of HER2, the novel method is necessary to evaluate the HER2 copy number quantification, as HER2 gene copy numbers are present in each of those cancer cells. In additional, it is desirable that the method proves beneficial in predicting prognosis and anti-HER2 therapies sensitivity. We developed a novel method to confirm the shape of HER2 FISH distributions with HER2 gene copy number within a tumor sample, and analyze HER2 ITH automatically. Also, we conducted our method to investigate the association between HER2-targeted therapy and HER2 heterogeneity. This study defined HER2 ITH using Gaussian mixture modeling (GMM) analysis of histograms based on HER2 FISH distributions. Recently A GMM classifier demonstrated potential use for clinical validation of markers and determination of target populations to improve molecular stratification of patients with breast cancer [
32].
We aimed to develop a novel method to analyze HER2 ITH automatically, and determine the clinical association of high HER2 heterogeneity with poor prognosis due to resistance to postoperative adjuvant therapy with HER2-targeted agents in primary breast cancer, through the application of GMM analysis.
2. Materials and Methods
2.1. Patients selection and breast tumor samples
This study consecutively recruited 251 patients with HER2-positive invasive breast carcinoma (mean age: 56.6 years; range: 28–97 years) treated with adjuvant or neoadjuvant HER2-targeted therapies. These patients underwent mastectomy or breast-conserving surgery from February 2009 to January 2018 at Osaka University Hospital, Osaka, Japan. All 251 patients received 12 months of adjuvant trastuzumab.
Surgery was performed on 185 patients, followed by anti-HER2 adjuvant chemotherapy, and 66 patients received neoadjuvant chemotherapy followed by surgery (anthracycline and taxane-based chemotherapy, n = 146; taxane-based chemotherapy, n = 87; anthracycline-chemotherapy, n = 5). Both anti-HER2 adjuvant therapy and hormonal therapy were given to 162 patients with hormone receptor-positive cancer (aromatase inhibitors, n = 99; tamoxifen, n = 45; or goserelin + tamoxifen, n = 18). We collected clinical–pathological characteristics from patients’ electronic medical records, and
Table 1 summarized the details. Tumor tissues (surgical specimens) were obtained at surgery, and tumor tissues from patients who received neoadjuvant chemotherapy were obtained before the treatment by vacuum-assisted core needle biopsy. Solid intratumoral tissues were fixed in 10% buffered formalin and embedded in paraffin (FFPE). The Ethics Committee of Osaka University approved the study protocol.
2.2. HER2 FISH assay
All tumor tissues underwent HER2 FISH assay. HER2 status, including HER2 gene signals and ratios, was determined by FISH using the PathVysion HER-2 DNA probe Kit (Abbott Laboratories, Abbott Park, IL). Regarding the kits, the tumor FFPE tissue slides were dried and 20 µL DAPI counterstain solution was applied. Then, a fluorescence microscope was used to capture both orange-fluorescent signals corresponding to HER2/neu and green-fluorescent signals corresponding to CEP17. For the histograms of HER2 FISH, all of the fluorescence images were automatically identified by histograms based on HER2 FISH amplification for the protocol of Duet-3/Setup Station/SOLO2 system (BioView, Rehovot, Israel). Meanwhile, for counting the value of HER2 FISH, a fluorescence microscope was used to count the number of those fluorescent signals under the manual or semi-automated protocols analysis. First, samples were analyzed with semi-automated protocol and then samples failed semi-automated protocol analysis were analyzed with manual protocol. The number of cells counted was more than 60 in semi-automated protocols, and it was 20 in the manual. The ratio between the HER2/neu and the CEP17 signal count was calculated. HER2 was considered positive when a tumor contained more than two genes per cell for the FISH assay of HER2. Experienced molecular pathologists used the 2018 HER2 ASCO/CAP updated guidelines to analyze the HER2 status.
2.3. Assessment of HER2 intratumoral heterogeneity
We evaluated HER2 intratumoral heterogeneity (ITH), which was manifested by the shape of HER2 FISH distributions with HER2 gene copy number within a tumor sample, as shown in
Figure 1A. We used scanned image data from FISH diagnostic reports. We extracted the FISH count information by reading the histogram and performing image processing analysis, such as OCR, because a few scanned FISH images were insufficient for image analysis. Then, the heterogeneity was analyzed by Gaussian mixture modeling (GMM) of histograms based on HER2 FISH. Parameters were estimated by fitting a mixed distribution with mean 𝜇 𝜎, and 𝜋 was the weight of each mixed component:
Gaussian distribution f(x) to the distribution of signal counts, where N was the normal and standard deviation.
Here, the distributions of FISH signal counts 2 and those >2 are assumed to be normal (π , respectively). Then, 1/ 2 was used as a measure to calculate the diversity. This is because a large 1/ 2 means a FISH count of <2, but the average FISH count is >2.
Figure S1 shows two representative cases where imaging for a diagnostic report was fitted with a mixture of Gaussian distributions. HER2 ITH, which was manifested by the shape of HER2 FISH distributions were classified into two groups: the biphasic grade graph (high heterogeneity [HH]) group and the monophasic grade graph (low heterogeneity [LH]) group for each tumor in
Figure 1B. Both groups were evaluated in terms of DFS and OS. All patients had been receiving HER2 treatment. The median of 10 years after annual follow-up was obtained. We used Python (v.3.10.6) and the Python libraries “opencv” (v.4.7.0.72), “pillow” (v.9.0.1), “pyOCR” (v.0.8.3), and “pytesseract” (v.0.3.10) for diagnostic report image processing.
Figure 1 shows the representative histograms of HER2 FISH distributions.
2.4. Statistical analyzes
All patients was classified into the HH and LH groups, and the HH groups was compared to the LH groups In clinicopathological parameters. The chi-square test was used to assess the associations between HER2 heterogeneity and clinicopathological parameters. DFS was calculated from the date of diagnosis of primary breast cancer to the date of diagnosis of metastatic breast cancer, and OS was calculated from the date of diagnosis of metastatic breast cancer to the date of death from any cause, censoring at last known follow-up. The Kaplan–Meier method was used to calculate t disease-free survival (DFS) rates. Differences in DFS were evaluated by the log-rank test. The Cox proportional hazard model was used for univariate and multivariate analyses of various clinicopathological and biological factors for DFS. R (v.4.2.3; 14) was used for all statistical analyses, and statistical significance was set to P < 0.05 for two-sided tests.
2.5. Data availability
We obtained Medical Ethical Committee approval (ethical board approval number of Osaka University: 22080 (T1). This is an observational study conducted with patient enrollment based on the opt-out method for comprehensive informed consent. Informed consent was obtained in the form of opt-out on the web-site, and those who rejected were excluded.
4. Discussion
This study aimed to develop our novel method to analyze HER2 intratumoral heterogeneity (ITH) automatically, and to clarify the clinicopathological characteristics and prognosis of patients with breast cancer with HER2 ITH (Figure S1). We revealed an 18.3% frequency of HH tumors. HH tumors were significantly associated with high ER positivity, PgR positivity, and lower HER2 FISH ratio than LH tumors. This study aimed to develop our novel method to analyze HER2 intratumoral heterogeneity (ITH) automatically, and to clarify the clinicopathological characteristics and prognosis of patients with breast cancer with HER2 ITH (Figure S1). We revealed an 18.3% frequency of HH tumors. HH tumors were significantly associated with high ER positivity, PgR positivity, and lower HER2 FISH ratio than LH tumors. Thus, luminal HER2 (ER+ and PgR+) tumors have the highest frequency (28%) of HH tumors (Figure S3) in our study, and these observations are essentially consistent with the concept of tumor heterogeneity, where a biological diversity of cancer cells was found within a tumor [
33]. In
Table 2, HH tumors were not significantly associated in pretreatment LN metastasis (0.098). The reason is that the HER2 ITH may contribute to the therapeutic effect (suppression of metastatic recurrence) of HER2 treatment and may has little to do with cancer invasion, metastasis, and the progression to LN metastasis. High proliferative cancer cells in both HH and LH tumors may be susceptible to pretreatment LN metastasis.
Further, the present study demonstrated a difference in DFS and OS rates between patients with HH and LH tumors, indicating that the presence of high heterogeneity has prognostic significance (
Table 3). High heterogeneity was an independent prognostic factor after adjustment for other factors, including the type of anti-HER2 adjuvant chemotherapy. Highly heterogeneous breast cancers may show poorer prognosis because of their association with different biological characteristics. On the other hand, the ratio of HER2 FISH, and HER2 IHC (3+, or 2+) could not predict prognosis in patients with HER2-positive breast cancer. In additional, we investigated whether the level of HER2 FISH amplification is associated with the prognosis of LH, and LH tumors were divided into the ex-high amplification LH group (Ex-High LH) and a high amplification group LH (High LH). As a result, in HER2-positive LH tumors, our study revealed that the ratio of HER2 FISH (mean number of HER2 FISH signals/cell) (Ex-High LH vs. High LH) was not associated with prognosis of DFS rates and OS rates (Figure 2C and 2D).
Similarly, Shen et al. reported a significantly poorer prognosis in patients with HH tumors than in those with LH tumors, to be evaluated with HER2 GPA, which combines HER2 IHC (the protein) and DISH (HER2 gene amplification) at the tissue level [
27,
21]. In addition, Hou et al. and Horii et al. revealed that HER2 ITH with HER2 GPA method is an independent factor predicting incomplete response to anti-HER2 neoadjuvant chemotherapy [
34,
35]. However, HER2 GPA method as well as DISH yields manual counting and demands advanced technical expertise and specialized equipment. Their methods lacks standardized staining and interpretation methods, making it susceptible to false positives influenced by background factors such as debris. Also, the HER2 GPA was not analyzed with the distribution of HER2 gene copy number within a tumor sample.
The strength of our method is able to analyze HER2 ITH within cancer tissues automatically and to make unbiased analyses of HER2 ITH, rather than relying on manual procedures. Also, our method can evaluate the HER2 copy number quantification within a tumor sample and assess more clear-cut heterogeneity of HER2, because HER2 gene copy numbers are present in each of those cancer cells within a tumor sample. Automated analysis of HER2 expression typically involves utilizing computer algorithms and image processing techniques to assess HER2 IHC status in tissue samples. Miglietta et.al. reported that the assessment of HER2 IHC status had significant observer variability of pathologist [
18]. To accurately assess the state of HER2 IHC, alternative methods such as molecular examinations and digital pathology have been proposed [
36]. Since the widespread adoption of whole slide imaging, Digital image analysis has emerged as a swift, cost-effective, objective, and reproducible scoring methodology for evaluating HER2 IHC status and the efficacy of anti-HER2 therapies through immunohistochemistry [
37,
38,
39,
40]. However, digital image analysis for evaluating the heterogeneity of HER2 IHC, which correlates with anti-HER2 treatment efficacy, is yet to be developed. Our study has the limitations not to compare HER2 IHC with HER2 gene copy number within a tumor sample, unlike HER2 dual-color imaging which combines HER2 IHC and HER2 gene amplification. However, we believe that the prognosis in patients of HER2-positive breast cancers can be estimated more accurately by adding our HER2 FISH distributions analysis of heterogeneity.
Besides, another strength of our analysis is an easy, low-cost method only to read histograms of FISH diagnostic reports and perform image processing analysis. The findings of this study provide valuable insights into the efficacy of our novel method of HER2 ITH analysis, and the highlighting is its potential as a clinically useful of predicting prognosis and anti-HER2 therapies sensitivity. And then, the clinical significance of identification of our analysis for the prediction of the prognosis seems to deserve further investigation of multicenter research.
Furthermore, our analysis has the potential to accurately assess ITH not only in HER2-positive breast cancers but also in HER2-low. It is hypothesized that HER2-low breast cancers commonly represent more heterogeneous population than HER2-positive breast cancers [
41]. Recently, trastuzumab deruxtecan (T-DXd) treatment resulted in significantly longer relapse-free survival and OS than chemotherapy among patients with HER2-low metastatic breast cancer [
42]. This study reported that HER2-low tumors constituted a heterogeneous population that varied in prognosis and sensitivity to systemic treatments. T-DXd, effectively targets tumor cells that express low levels of HER2 and can deliver a potent cytotoxic payload through the bystander effect to neighboring tumor cells heterogeneously expressing HER2, unlike many other approved HER2-targeted therapies [
43,
44]. Therefore, in the HER2-low as well as HER2-positive, highly heterogeneous breast cancers might be responsive to treatment of T-DXd that exhibits potent therapeutic efficacy through the delivery of cytotoxic payload. In addition, Mosele et. al. reported that HER2 is a determinant of sensitivity to T-DXd, although modest anti-tumor activity was also observed in a small subset of patients whose cancer did not express HER2 [
45]. The study revealed that T-DXd anti-tumor activity increased when HER2 expression was high, modest anti-tumor activity was also observed in patients with HER2 IHC 0, and suggested that very low levels of HER2 could allow uptake of T-DXd and/or that partially mediated drug efficacy. That is to say, the level of HER2 IHC is extremely low or absent, which cannot have implications for T-DXd treatment decisions, and our HER2 FISH distributions analysis without HER2 IHC, might accurately identify the heterogeneity of HER2-low breast or absent, and predict T-DXd efficacy of breast cancers. Hence, our analysis may be useful for predicting the effectiveness of not only standard HER2-targeted therapies but also of the new drug against highly heterogeneous tumors. In the future, we are going to investigate whether our method our analysis can accurately identify the heterogeneity of HER2-low breast cancers or not.
Meanwhile, it has been confirmed that HER2-positive gastric cancer responds to HER2-targeted therapy as well [
46]. The positivity rate of HER2 in gastric cancer is lower compared to breast cancer, resulting in lower effectiveness of HER2-targeted therapies [
47,
48]. Moreover, there is a higher probability of HER2 heterogeneity in gastric cancer compared to breast cancer [
49,
50]. Our study might have clinical practice impact that our novel analysis clinically t has the potential to become a new classification of HER2 ITH and predict prognosis in patients with HER2-positive gastric cancer as well as breast cancer, as a discriminative, inexpensive and easy-to perform method.
Author Contributions
Tomonori Tanei: Conceptualization, Visualization, Formal analysis, Investigation, Methodology, Project administration, Writing original draft; Shigeto Seno: Investigation, Methodology, Software, Visualization, Writing; Review & Editing; Yoshiaki Sota: Data curation, Methodology, Validation, Visualization, Writing - Review & Editing, Project administration; Kaori Abe: Writing - Review & Editing; Nanae Masunaga: Writing - Review & Editing; Masami Tsukabe: Writing - Review & Editing; Tetsuhiro Yoshinami: Writing - Review & Editing; Tomohiro Miyake: Writing - Review & Editing; Masafumi Shimoda: Writing - Review & Editing; Hideo Matsuda: Writing - Review & Editing; Kenzo Shimazu: Writing - Review & Editing, Supervision, fouding acuistration.