1. Introduction
Breast cancer is the most common type of cancer and also one of the leading causes of cancer-related mortality in women globally [
1]. Five-year survival rates vary dramatically on the stage at presentation (being 99% in stages 0–IA, and 27% only in stage IV) [
2]. In the last decade, the management of breast cancer shifted from primary surgical treatment to neoadjuvant systemic treatment (NST) in highly proliferative subtypes (human epidermal growth factor receptor 2 (HER2) positive, triple negative (TN), and some luminal B-like) [
3,
4,
5]. This concept assumes that NST eradicates potential microscopic metastases and the evidence that achieving a pathologic complete response (pCR) confers improved long-term survival. In patients without pCR achievement, the outcome could be further improved by tailoring the post-neoadjuvant treatment [
6].
Adaptive immune response has an important role in cancer growth modulation through the mechanisms of avoiding immune destruction of cancer cells and promoting inflammation [
7]. Most notably, a high density of tumour-infiltrating lymphocytes (TIL) has been linked with improved outcomes in colon, ovarian, lung, and breast cancer [
8,
9,
10,
11,
12]. In early breast cancer, patients with high TIL density had higher probability of pCR after NST [
12], irrespective of breast cancer subtype [
13]. However, only in TN and HER2-positive subtypes higher TIL density contributed to prolonged progression-free survival (PFS) and overall survival (OS), while in the luminal B-like subtype that was not the case [
13,
14]. Stromal TIL consists of T cells, B cells, and natural killer cells in various proportions; in breast cancer, typically 75%, 20%, and 5%, respectively. The TIL subtypes further dictate the entire TIL action, thus steering cancer progression toward a more or less favorable clinical outcome [
15,
16,
17]. In-depth research of T-cells in TIL in breast cancer patients revealed that CD8+ T-cell infiltration has been associated with a favorable clinical outcome, however, cancer cells could develop several resistance mechanisms against CD8+ T-cell anti-tumour activity [
18,
19,
20,
21,
22,
23,
24]. They could secrete into the tumour microenvironment immunosuppressive cytokines (interleukin 6 (IL-6), IL-17, and tumour growth factor beta (TGF-β)), which has been associated with a poor clinical outcome [
25,
26]. Moreover, these cytokines also increase the levels of tumour-associated macrophages, regulatory T cells and myeloid-derived suppressor cells in the tumour microenvironment, which further limit CD8+ T-cell tumour infiltration and activity [
8,
25,
27,
28,
29,
30]. This coupled with the upregulation of programmed death ligand 1 (PD-L1) gene expression, further contributing to immunosuppression [
31]. Programmed death receptor 1 (PD-1) is a cellular receptor expressed on antigen-presenting T cells. Binding with its ligands PD-L1 or PD-L2, which are normally expressed on antigen-presenting cells and aberrantly on tumour cells, leads to an inhibition of the effector functions of T-cells. PD-1+ TILs were found frequently in HER2+ and basal-like tumours (part of the TN subtype). In basal and luminal B-like subtypes, PD-1+ TIL was associated with a worse prognosis [
32,
33,
34], however, in some studies, it significantly correlated with higher TIL expression and an improved probability of pCR [
34,
35].
Regulatory T cells in TIL have been associated with a worse prognosis, irrespective of the breast cancer subtype [
8,
36,
37,
38]. Regulatory T-lymphocytes are a subpopulation of CD4+ T-lymphocytes with the immune phenotype CD4+ CD25+ forkhead box P3 (FOXP3+). In a healthy tissue environment, they suppress and modulate the immune responses in order to prevent autoimmune reactions. In breast cancerogenesis, the fraction of FOXP3+ T-lymphocytes increases considerably with the progression from healthy breast tissue to ductal carcinoma in situ and eventually to invasive ductal carcinoma [
39,
40]. In the breast cancer microenvironment, FOXP3+ lymphocytes were strongly associated with both lower relapse free survival (RFS) and OS [
41,
42,
43]. FOXP3 expression in cytoplasma of tumor cells was of no prognostic significance, however a high infiltration of FOXP3+ lymphocytes accompanied by a cytoplasmic FOXP3+ tumor was the most detrimental phenotype [
37]. Other reports, however, depicted that CD4+ regulatory T cells seemed to be capable of inhibiting invasive breast cancer development from pre-invasive breast cancer by suppressing pro-tumorigenic T helper 2 cell responses [
44]. Yeong et al. has shown that a combination of high densities of FOXP3+, CD8+ T-cells and CD20+ B cells is associated with a favorable prognosis in TN subtype [
42,
45].
The above-noted conflicting findings clearly show that at evaluation of the percentage of FOXP3+ TIL, the percentage of CD8+ TIL should also be taken into consideration. Specifically, it is highly important to describe the ratio between the different subsets of TIL components to better understand the nature of tumour microenvironment regulation in terms of its pro- or anti-tumour effect [
46,
47].
Gene expression profiling was used to look at TIL in primary breast cancer. This showed that both CD4+ T-lymphocytes and follicular T-helper cells expressed CXCL-13. While their immune functions are currently less well understood, researchers believe CXCL-13-expressing T-helper cells play a significant role in attracting immune cells to the tumour microenvironment, regulating antigen-specific B-cell responses, as well as promoting the formation of tertiary lymphoid structures, thus generating germinal centers for local memory B-cell differentiation at the tumour location [
48,
49,
50]. Studies on the expression of CXCL13 genes showed that CXCL13 significantly correlated with improved disease-free survival [
15]. One study showed that CXCL13 expression was a significant predictor of a pCR [
48].
However, prospective studies evaluating the contribution of different pro- and anti-tumour TILs to pCR are extremely scarce. The goal of this study was to analyze TIL density and TIL subtypes in core biopsies of breast cancer patients before NST and investigate their possible predictive role for pCR when treated with NST in a whole cohort and according to the breast cancer subtypes. To that end, TIL subtypes were assessed as the proportion of (presumably) anti-tumour (CD8+, CXCL13+) and (presumably) pro-tumour (PD-1+, FOXP3+) immune cells in TIL specimens. Our first hypothesis was that patients with a high TIL density would have a higher probability of pCR after NST compared to those with a low TIL. The second hypothesis was that higher CD8+ TILs, higher CXCL13+ TILs, lower PD-1+ TILs and lower FOXP3+ TILs are associated with a higher probability of pCR. We also investigated TIL and its subtypes within different molecular subtypes of breast cancer and examine their association with other clinicopathologic parameters.
2. Materials and Methods
2.1. Patients and treatment
We conducted a prospective, non-interventional study in patients with early breast cancer who were treated with NST, followed by breast and axillary surgery. All patients were treated at the Institute of Oncology Ljubljana. Inclusion criteria were female older than 18 years, diagnosed with early breast cancer of stages IIA (cT2N0M0) to IIIB (cT4a-cN0-1M0), luminal B-like, HER2+-luminal, HER2+-non-luminal, and TN subtype, regardless of menopausal status, and intended for NST treatment.
Exclusion criteria included the luminal A-like subtype (progesterone receptors (PR) >20% and/or Ki-67 ≤10%), metastatic disease, patients not understanding Slovenian, and ineligibility for the NST. Before inclusion in the study, all patients provided informed consent. The Slovenian Medical Ethics Commission approved the study (No. 0120-133/2017-2, dated June 8, 2017). All procedures were carried out in accordance with the Helsinki Declaration and Good Clinical Practice.
During the diagnostic and staging procedures of breast cancer, patients underwent a clinical examination, laboratory blood analysis, bilateral mammography, breast ultrasound and ultrasound-guided core needle biopsy, radiopaque clip insertion in the breast tumour, ultrasound of tumour-side axillary and supraclavicular lymph nodes, fine-needle aspiration of any suspicious lymph node, and a breast MRI scan (unless mastectomy was planned). For metastatic disease exclusion, a contrast computer tomography scan of the thoracic and abdominal organs and bone scintigraphy were performed.
Patients were treated with NST according to European Society for Medical Oncology (ESMO) recommendations at the time [
3,
4]. Shortly, chemotherapy consisted of sequential therapy with anthracyclines and taxanes (4 cycles of doxorubicin or epirubicin/cyclophosphamide followed by 12 cycles of weekly paclitaxel or 3–4 cycles of 5-fluorouracil/epirubicin/cyclophosphamide followed by 3–4 cycles of docetaxel). At the attending oncologist’s discretion, a dose-dense regimen was delivered. Similarly, in cases of contraindications, anthracycline-free schemas were used. Patients with HER2+ breast cancer tumours received anti-HER-2 therapy alongside taxane administration [
3,
4]. Breast surgery took place 3-6 weeks after the end of NST. Adjuvant radiation, endocrine and anti-HER2 therapy and postneoadjuvant capecitabine in TNBC were applied according to guidelines [
3,
4,
51].
2.1.1. Endpoints
The primary endpoint was the probability of pCR after NST, defined as the absence of invasive carcinoma in the breast (in situ carcinoma could be present) and in the axilla (ypT0/is ypN0) [
52].
2.2. Pathohistological Examinations
On the pre-treatment paraffin-embedded tumour block (core biopsy specimen), standard tumour characteristics (pathohistological subtype, grade, expression of estrogen receptors (ER) and PR, HER2 status, Ki-67) were performed. Patients were regarded as HER2+ if immunohistochemistry (IHC) was 3+, in the case of IHC 2+ fluorescent in situ histochemistry was performed, and ratio ≥2 was regarded as HER2+. Patients were regarded as luminal B-like, if ER ≥1%, PR ≤20%, Ki-67 >10% and HER2-. In the case of HER2+, ER 0% and PR 0% result, patients were regarded as HER2+-non-luminal, and in case of HER2+ and ER ≥1% and/or PR ≥1% they were regarded as HER2+-luminal. TN subtype was defined as ER 0%, PR 0% and HER2- [
3,
4]. For TIL density evaluation, one representative hematoxylin eosin (HE) stained tissue slice was used. TIL density was evaluated by two independent blinded pathologists, sub-specialized in breast cancer pathology, and the average of both TIL assessments was taken for the analysis. TIL density evaluation was performed in accordance with the guidelines of the International TIL Working Group 2014 and the International Immuno-Oncology Biomarker Working Group 2017 [
12,
53,
54]. Briefly, all cells in the intra-tumour stroma that exhibited the characteristics of mononuclear immune cells, were classified as TIL cells. TIL density was expressed as the percentage of the intra-tumour stroma surface area occupied by TIL and was expressed using a numerical variable ranging from 0% to 100%. TIL density was also dichotomised into a low TIL density group (0-59%) and a high TIL density group (60-100%), according to Denkert et al [
12]. For TIL phenotyping, additional tissue slices were stained separately for CD8, CXCL13, FOXP3, and PD-1 positive cells. The percentage of TIL in the intra-tumour stroma, occupied by the above-mentioned markers, was assessed and counted by a pathologist at 400x magnification, where the average cell percentage from four visual fields was used to determine the target cell percentage in each sample.
2.3. Assay Methods
Immunohistochemical examination of FOXP3, CD8, PD-1, and CXCL13 was performed on 2-4 µm thick fresh formalin paraffin-embedded tissue sections dried at 56 ⁰C for 2 hours, using a fully automated IHC staining system Ventana Benchmark Ultra (Ventana ROCHE Inc, Tucson, AZ, USA).
CD8, FOXP3 and PD1 epitopes were retrieved with heat induced epitope retrieval (HIER) using Cell Conditioning Solution 1 (Ventana ROCHE inc. Tucson, AZ, USA; cat. No. 950-124) for 88 minutes at 100⁰C, while CXCL13 epitope was retrieved with proteolysis using Protease I (Ventana ROCHE inc. Tucson, AZ, USA; cat. No. 760-2018) for 8 min at 37⁰C.
Retrieved epitopes were detected using monoclonal antibodies directed against CD8 (DAKO Agilent technologies Inc. Santa Clara, CA, USA; cat. No. M7103; clone C8/144B; diluted at 1:100), FOXP3 (Epitomics, Rocklin, CA, USA; cat. No. AC0304RUO; clone EP340; diluted at 1:200), PD1 (CellMarque, Rocklin, CA, USA; cat. No. 315M; clone MRQ22; diluted at 1:800) and CXCL13 (R&D systems Minneapolis, MN, USA; cat. No. MAB8012; clone 53602; diluted at 1:25).
All primary antibodies were incubated on board for 60 minutes at 37⁰C and visualized using the 3-step multimer detection system OptiView DAB IHC Detection Kit (Ventana ROCHE Inc. Tucson, AZ, USA; cat. No. 760-700) according to the manufacturer's instructions. Visualization of CXCL13 and PD1 was enhanced using OptiView Amplification Kit (Ventana ROCHE inc. Tucson, AZ; USA cat. No. 760 099).
Normal appendix was used as control for the 4 antigens. Optimal staining reactions were considered as follows: CD8 - strong distinct membranous staining in subepithelial and interfollicular cytotoxic T-cells, FOXP3 - moderate intensity distinct nuclear staining in regulatory T-cells, PD1 - strong distinct membranous staining in the germinal center associated helper T-cells and CXCL13 - strong distinct cytoplasmic staining in the germinal center associated follicular helper T-cells and follicular dendritic cells. All other cells must stain negatively for these antigens.
2.4. Statistical Analysis
2.4.1. Sample size
With the planned sample of 180 patients, a chi-square test with a significance level of 0.05 achieves 80% power to reject the null hypothesis of no association between TIL (high ≥60% or low) and pCR (yes or no) if Cohen’s effect size W is at least 0.21. This effect size corresponds to the difference in the probability of pCR between high and low TIL group of at least 29% if we assume that in population 30% achieve pCR (based on [
55]) and 12% belong in the high TIL group (based on [
12]). When analysing TIL as a numerical variable, we are able to detect even smaller effect sizes.
2.4.2. Data analysis
Numerical variables were presented with medians and interquartile ranges (IQR), and categorical variables with frequencies and percentages. In addition to the descriptive statistics, the association of each patient or tumour characteristic with pCR was evaluated by univariate logistic regression, where p values were adjusted using Holm’s method to control the family-wise error rate. The distribution of TILs was presented graphically using boxplots. The difference between TIL measurements by two pathologists was presented by Bland-Altman plot (supplement material). On bar plots for the proportion of pCR in separate TIL groups, 95% Clopper-Pearson’s exact confidence intervals (CIs) were presented.
The association between TIL and pCR was assessed in a multivariable logistic model, where we controlled for age, nodal stage, Ki-67 (numerical) and cancer subtypes (two groups based on different biology: combined HER2+ luminal, HER2+non-luminal and TN vs. luminal B-like subtype). The model meets the requirement for a sufficient number of events per variable with 59 events (achieved pCR) and 5 estimated coefficients (the selection of independent variables was predetermined and based on background knowledge). In exploratory analysis (supplement material), separate multivariable models were fitted to explore the association between pCR and separate TIL subtypes (CD8+, CXCL13+, FOXP3+, PD1+) using the same independent variables as in the main model for TIL. All logistic models were fitted using Firth’s bias reduction method, odds ratios (ORs) were reported together with 95% CIs. The discriminative ability of the multivariable logistic models was estimated by means of the ROC (receiver operating characteristic) curve analysis reporting the area under the ROC curve (AUC).
In the exploratory analysis (supplement material), association between TILs and various clinical and histopathologic characteristics (tumour grade and size, categorised Ki-67, nodal status, breast cancer subtype) was evaluated using Mann–Whitney or Kruskal-Wallis test for numerical TILs and Fisher’s exact test for dichotomised TIL, where p values were adjusted using Holm’s method to control the family-wise error rate.
An (adjusted) p value of less than 0.05 was considered statistically significant. The analysis was performed with the help of relevant open-source libraries in Python 3.8.5. and SPSS, IBM Co., v. 22.
4. Discussion
In our prospective non-interventional study, performed in early breast cancer patients undergoing NST, we have been investigated the possible role of pre-treatment TIL density and TIL subtypes on achieving pCR. Our data clearly confirmed the first hypothesis that patients with a high TIL density would have a higher probability of pCR after NST compared to those with a low TIL.
On the other hand, our second hypothesis, that higher CD8+ TILs, higher CXCL13+ TILs, lower PD-1+ TILs and lower FOXP3+ TILs are associated with a higher probability of pCR, was partially confirmed: only high CD8+ TILs were associated with pCR in the multivariable model (exploratory analysis). Surprisingly, on our sample, a higher probability of pCR was associated with a higher percentage of PD-1+ TILs and FOXP3+ TILs but these associations were not statistically significant in multivariable models nor in univariate analyses. Other independent predictive factors for a higher probability of pCR were the HER2+ non-luminal/TN subtype, high Ki-67 and grade (univariate analyses, adjusted for multiple comparisons).
In this study, we added TIL density evaluation to the routine histopathological analysis of the tumour sample analysis before treatment. TIL density in our cohort was positively associated with grade and Ki-67 (Table S1). Median TIL was 12.5% and 17.5% in G3 and high Ki-67 (Ki-67>40%) tumors, respectively. A similar correlation of high TIL with high Ki-67 and grade was reported in some other studies [
56,
57,
58].
In our sample, patients with HER2+ non-luminal subtype had the highest median percentage of TIL (22.5%) among the four different subtypes but this difference could not be generalised to the population (adjusted p=0.085) (Table S1).
In our sample only 10 patients had high TIL (>60%) and 6 of them (60%) achieved pCR compared to 33% pCR in low TIL group but association between dichotomised TIL (cut-off 60%) and pCR was not statistically significant. The reason could be the small number of patients in the high TIL group, or a too-high TIL cutoff value.
Similar but significant findings, however, were reported in the GeparDuo cohort, where a subgroup of lymphocyte-predominant breast cancer (>60% of either stromal or intratumoral lymphocytes) had a probability of pCR of 41.7% in comparison with 9.3% in the other group [
12,
59]. Additionally, in the GeparDuo and GeparTrio cohort, the odds ratio for pCR increased with the extent of TILs [
60]. Tumours without any TILs had a probability of pCR of 7.2%, while those with lymphocyte predominant breast cancer had a probability of pCR of 40% [
12,
53,
61]. It should be noted that the probability of pCR was 12.8% and 17% in GepardDuo and GepardTrio cohort, respectively, whereas it was 35% in our study. We used a TIL cutoff of 60% because it was the standard used by researchers in large neo- or adjuvant trials at the time our study was designed used cut-off between 50% and 60% [
59,
62,
63,
64,
65,
66,
67,
68,
69]. Numerous other studies, performed in the meantime, showed substantially lower cutoffs for high TIL to be appropriate to distinguish between patients who benefited for pCR prediction. The cutoff, however, depends on a breast cancer subtype [
63]. Recently, the St. Gallen consensus meeting for the TN subtype recommended a cutoff of 50%. This cutoff was based on relapse-free survival (RFS) data, not pCR data, and emphasized a very favorable outcome in small TN tumours without adjuvant chemotherapy treatment [
70]. For HER2+ subtypes, even lower cutoffs (5-40%) are reported to be prognostic for pCR [
62,
63]. Our sample size of 171 patients with 59 achieving pCR (events) was too small to allow for a cut-off determination.
In our study, numerical TIL was significantly associated with pCR in univariate analysis (
Table 1) and revealed as an independent factor for pCR in a multivariable model (
Table 2). Increasing TIL by 10% will increase the odds for pCR by 27% (
Table 2).
Figure 3 indicates that a dichotomised TIL is not suitable for determination of the association with pCR.
Figure 4 suggests that all subtypes of breast cancer have been linked to higher percentages of pCR when the TIL density is high on our sample, but our sample size did not permit an additional inferential subgroup analysis. Similarly, in GeparSixto trial it was shown that increased levels of stromal TILs predict pCR in the whole and also in TN and HER2+ cohort [
65,
71]. Our multivariable model confirmed HER2+/TN subtype and node-negative status as additional independent predictive factors for a higher probability of pCR. These findings are in accordance with reports of others [
12,
62,
63,
71,
72,
73,
74,
75].
It is now increasingly clear that the tumour microenvironment is composed of heterogenous tumour cells and the host's endogenous stroma, which together undergo changes as the disease progresses. Importantly, stromal cells are becoming recognised as key players in the development of the tumour microenvironment, metastasis, immune infiltration and inflammation, and resistance to chemotherapeutic agents [
76,
77,
78].
Our investigation revealed CD8+ TILs as the most prevalent among studied TILs, with a median of 40% of total TIL (
Table 1,
Figure 5). Additionally, all breast cancer subtypes had similarly high CD8+ TILs (median 40-45%). On the contrary, presumably pro-tumour TIL subtypes (PD-1+ TILs and FOXP3+ TILs) tended to be more prevalent in the TN and HER2+ non-luminal subtypes (
Figure 6, Table S1).
In our sample, numerical CD8+ TILs did not show any association with grade, Ki-67, tumour size, nodal stage, or subtypes (Table S1). However, numerical CD8+ TIL was significantly associated with pCR in a multivariable model (exploratory analysis,
Table S2). Odds for pCR increased by 31% with increase of CD8+ TILs by 10%.
From other reports, there is some evidence that the possible role of CD8+ T-cells is greater in hormone receptor-negative breast cancers. In a study evaluating 1854 breast cancer samples, Baker et al. has shown improved disease specific survival in patients with high CD8+ TIL in ER-negative tumors only. On the contrary, in low grade ER-positive tumours a high CD8+ TIL was associated with inferior outcome [
79]. CD8+ infiltrates were seen in 60% of TNBCs. In concordance with our results, CD8+ TILs were found to be an independent predictive factor for pCR [
80,
81].
CXCL13+ TILs represented only a median of 1% of total TIL, it was similarly expressed in all cancer subtypes. We found an association only with the tumour stage. In two other studies, CXCL13 expression was associated with higher tumour grade (grades 2–3), positive nodes, ER-negative status, longer metastasis-free survival, and a stronger prognostic effect in HER2-positive breast cancer [
49,
82]. In the third study, a four-gene signature that includes CXCL13 was predictive for the extent of lymphocytic infiltration after NST in TNBC. The increase of the value of these signatures was associated with a better distant relapse-free survival, adding novel prognostic information for this aggressive breast cancer subtype [
83].
In our patients, PD-1+ TILs expression was associated with high tumor grade and high Ki-67, similar to the findings of others [
34,
35,
84]. Importantly, we observed that a higher percentage of pCR is associated with higher PD-1+ TILs, but this association was not significant when generalized to the population. A significant association of PD-1 expression with higher TIL scores and pCR in early breast cancer, however, was demonstrated by Denkert et al. [
65]. This suggests that PD-1 expression may be a useful biomarker for predicting response to NST in breast cancer patients. Further studies are needed to confirm these findings and determine the potential clinical implications.
FOXP3+ TILs were, surprisingly, positively associated with pCR in our sample (not statistically significant). In our sample TN and HER2+ non-luminal subtypes had a slightly higher median expression of FOXP3+ TILs than other subtypes, but this could not be generalised to the population. Loi et al. reported the highest percentage of FOXP3+ cells in the TN subtype [
61], but the median percentage in that study was 70%, which is substantially higher than in our study (4%). FOXP3+ TILs are known to disrupt anti-tumour immunity by suppressing the effector functions of various immune cells and have been implicated in the escape of cancer cells from immunosurveillance [
85,
86]. Increased levels of FOXP3+ cells were strongly associated with an increased risk of early and late relapse, lower RFS and OS [
12] and lower percentage of pCR [
19]. In our study, on the contrary, FOXP3+ TILs and pCR were positively associated, but the association was not significant. Some other authors, especially in recent years, also found a positive association of FOXP3+ TILs with pCR [
43,
87,
88]. Sun et al. reported subtype-specific improvement of pCR (OR: 1.20, 95% CI 1.02-1.40) and OS for HER2+ subtype of breast cancer [
43]. The possible mechanism underlying this paradoxical observation could be the higher efficacy of chemotherapeutic treatments in a highly immunosuppressive environment. Namely, it has been reported that some chemotherapeutic agents, particularly cyclophosphamide, can inhibit FOXP3+ cells. Chemotherapy might be more effective in tumours with high levels of FOXP3 + TILs, and this in turn could facilitate tumour attack by CD8+ TILs and help to achieve pCR [
43]. Furthermore, tumors having high CD8+/FOXP3+ ratio were more sensitive to anthracycline and taxane chemotherapy [
22] and had higher pCR, especially the TN and HER2+ subtype [
22,
89].
To summarize, our data suggest that an evaluation of TIL subtypes by IHC could add additional predictive information for pCR beyond TIL density alone. More sophisticated and expensive techniques, like software guided immune cell evaluation, could improve the accuracy of TIL subtype evaluation. However, we have to take into consideration that these techniques may suffer from pre analytical and interpretative inconsistencies, which could impair their reproducibility and widespread application in daily practice. It is very important to clearly define each immune cell subset studied. For example, Fillion et al. reported CD4 expression not only in T cells but also in a subpopulation of monocytes, causing a bias and discrepancies in results and interpretation between studies [
90].
This study has some limitations. Firstly, core needle biopsy sample could not be fully representative of the whole tumour's heterogeneity; however, it is used everywhere as a standard test to decide on the type of first treatment. Secondly, the visual method of TIL scoring is prone to intra- and inter-observer variability. In our case, interobserver variability was low at low TIL density, but high at high TIL density (Figures S1 and S2 ). In the case of high dis-concordance, a consensus decision between pathologists should be done. Thirdly, immunophenotyping of TIL includes few types of immune cells only, not taking into consideration other subtypes of lymphocytes, natural killer cells and cytokines or non-immune cells like macrophages, neutrophils, myeloid-derived stem cells, etc.
There are several strengths to our study. First, our study is very valuable, because it was a prospective study that included a substantial fraction of the patients treated with NST in a three-years period. Second, for our pathology department, the study was very important, because, within the study procedures, we initiated systematic scoring of TIL in core biopsies of breast cancer samples, and the method is now ready to be implemented into routine clinical work. TIL density is evaluated using HE stained sections only, which makes it practical for routine use. Also, the preoperative breast cancer tumour board meetings were set up at the beginning of our study, resulting in better decision-making about treatment sequence for all newly diagnosed breast cancer patients. We believe that the incorporation of TIL density measurements can be easily incorporated into routine pathology, especially in low-to-middle-income countries, but some run-in time is needed, as was the case with the Ki-67 method.
Further implication of our work: There is still much work to be done in the field of TIL biomarkers, especially in the course of immunotherapy treatment, which has been approved for the treatment of the TN subtype of breast cancer. The link between pCR, when treated with immunotherapy, and TIL subtypes should be investigated. Such research could pave the way for personalized treatment plans based on TIL biomarkers, leading to improved outcomes for breast cancer patients. Additionally, future studies could explore the potential of TIL biomarkers in predicting responses to other forms of cancer treatment beyond immunotherapy. In the future, machine learning techniques may help us further standardize the TIL evaluation and find important immune phenotypes.
To sum up, TIL scoring is a new biomarker that could be used in clinical practice. It remains open if PD-1+ TILs and FOXP3+ TILs are linked with a higher or lower response to chemotherapeutics used in the neoadjuvant treatment of breast cancer.