Introduction
Despite significant improvements in the early detection and treatment of breast cancers, this disease persists as a global health concern that ranks among the leading causes of mortality in women [
1,
2]. Breast cancers are highly heterogeneous at both the cellular and molecular levels and are classified clinically by immunohistochemistry and/or
in situ hybridization-based detection of estrogen receptor-a (ER-a), progesterone receptor (PR), and HER2/ErbB-2 expression [
3,
4,
5]. Importantly, the presence of these receptors continues to serve as some of the most effective therapeutic targets in all of oncology, particularly for patients harboring breast tumors belonging to the luminal A (ER-a- and PR-positive), luminal B (ER-a and HER2-positive), and HER2 subtypes [
6]. In stark contrast, patients harboring triple-negative breast cancers (TNBCs), which lack notable expression of ER-a, PR, and HER2, exhibit the highest disease grades and proliferation indices that culminate in
(i) the worst progression-free and overall survival rates of all breast cancer subtypes, and
(ii) rapid relapse and early death within 5 years of initial diagnosis and treatment [
6,
7,
8]. Despite the similarities needed to classify breast cancers into subtypes, there remains significant variability amongst individual human breast tumors, even those from the same subtype, which further complicates the choice of therapy and significantly affects patient response to treatment and prognosis.
Transcriptomic profiling of TNBCs using multiple methods such as PAM50 [
9] and TNBCtype-4 [
10] finds that these tumors predominantly fall into the basal-like category, which are highly metastatic and robustly express genes typically found in basal/myoepithelial cells, such as cytokeratins 14, and 16 [
11]. Moreover, TNBCs are also enriched in the expression of transcriptional programs coupled to breast cancer stem cells (
e.g., CD44, ITGA6/Integrinα6, ALDH1A1, and CD133/PROM1 [
12,
13,
14,
15,
16]) and epithelial-mesenchymal transition (EMT) programs [
6]. They are also characterized by
(i) elevated expression of the receptors for epidermal growth factor (EGFR) and stem cell growth factor (KIT) [
17,
18,
19]; and
(ii) mutational inactivation of BRCA and p53, resulting in impaired DNA damage response and high mutational burden [
20,
21]. Finally, recurrent TNBCs frequently acquire resistance to standard-of-care chemotherapeutic agents (
e.g., doxorubicin, cyclophosphamides, and taxanes) through mechanisms that remain incompletely understood. More recently, immunotherapy approaches (
e.g., monoclonal antibodies against programmed death-ligand (PD-L1)) have made inroads into the breast oncology space, particularly for the treatment of TNBCs due to their higher mutational burden, higher infiltration of tumor lymphocytes, and higher expression of PD-L1 [
7]. Unfortunately, this strategy has failed to significantly improve overall patient survival and outcomes in monotherapy settings [
22]. This failure stems from our lack of understanding of the variability and complex nature of the molecular networks that drive the behavior of breast cancer tumors, even those from the same subtype, in different patients. Clearly, a new paradigm in cancer research that takes tumor variability into consideration is urgently needed for identification of new therapeutic strategies that can achieve curative outcomes for TNBC patients.
Long noncoding RNAs (lncRNAs) are a heterogeneous group of cellular RNAs that are transcribed by RNA polymerases I, II or III and thus, may or may not be spliced and polyadenylated [
23,
24,
25]. Although most lncRNAs are less evolutionarily conserved, expressed at lower copy numbers, and less efficiently spliced compared to protein-coding genes [
26,
27], these noncoding transcripts constitute the dominant output of the human genome in terms of the fraction of the genome comprising their loci [
23,
24,
25]. LncRNAs range from around two hundred to tens of thousands of nucleotides in length, with many containing short open reading frames that may even code for short peptides. Nonetheless, they perform an RNA-mediated cellular function [
23,
24,
25,
28]. A large fraction of lncRNAs are predominantly nuclear localized, where they play critical roles in regulation of transcription and epigenetic regulation [
24,
28,
29,
30]. Recent studies have pointed to the involvement of this highly understudied class of RNAs in all aspects of tumor development and metastatic progression that transpires in essentially all human tumors [
31,
32,
33,
34,
35,
36]. However, nearly all mechanistic studies of lncRNA function have been performed using animal models or cultured cells employing knockdown and forced overexpression studies, which may not reflect their function in the context of the complex tumor milieu. Importantly, these studies are inherently unable to address the donor-to-donor variability inherent in human cancers.
We previously characterized and described the role of the murine lncRNA BORG (BMP2/OP1 Responsive Gene [
37]) in breast cancer. Indeed, we found BORG expression to be aberrantly upregulated in human and murine TNBC cell lines and tumors, particularly those exhibiting aggressive metastatic and chemoresistant behaviors [
38]. In the mouse, BORG (annotated as GM45924) is a Pol II-transcribed, nuclear-localized, spliced and polyadenylated intergenic RNA [
39]. Mechanistically, we showed that BORG binds to the E3 SUMO ligase TRIM28/KAP1, and that the resulting complex induces latent breast cancer cells to resume proliferative programs coupled to metastatic outgrowth and recurrence [
38]. Moreover, BORG:TRIM28 complexes also govern the self-renewal and expansion of breast cancer stem cells, doing so by inducing the expression of Nanog, Aldh1a3, and Itgs6/a6 integrins [
40,
41]. Finally, we observed BORG to interact physically with RPA1, an event coupled to the acquisition of resistance to doxorubicin
via activation of the prosurvival NF-B axis [
42]. Despite these intriguing findings, a human gene corresponding to the murine BORG has not been annotated nor identified and thus, the existence of a human ortholog for the mouse BORG gene was uncertain. In addressing this knowledge gap, the overall objectives of the current study were to
(i) determine whether a human ortholog for BORG existed, and if successful,
(ii) characterize the gene and define its genomic architecture, and
(iii) define the function of human BORG in developing and progressing human breast cancers, particularly TNBCs, at molecular level in the context of large numbers of human tumor samples that account for donor-to-donor and tumor subtype variability.
Discussion
We provide the first identification of the human ortholog of the murine BORG lncRNA, and in addition to characterizing its locus, we further define the impact of changes in the expression of human BORG observed in large cohorts of human breast tumors. These analyses show that BORG is aberrantly elevated in primary human breast tumors compared to normal human mammary tissue, and importantly, shows significant variability in expression level in all breast cancer subtypes. As we had described for the murine BORG RNA, higher expression levels of human BORG in tumors are also associated with the induction of invasive, metastatic, and “stemness” gene signatures commonly observed in metastatic tumors [
38,
40,
41,
42]. Accordingly, a significant fraction of genes differentially expressed in BORG-overexpressing murine TNBCs show concordant expression changes when high BORG-expressing human tumors were compared to their low BORG-expressing counterparts. It is noteworthy that BORG, like many other studied lncRNAs, is poorly conserved in its primary sequence [
23,
24,
25], having only two relatively short (300-400 nucleotides long) stretches of evolutionarily conserved sequences in its genic region. However, as observed for many other poorly conserved lncRNAs [
26,
27], the function of BORG in the human and mouse TNBC tumors is strongly conserved.
Our study of the protein interactome of the mouse BORG indicates that it binds TRIM28/KAP1, an E3 Sumo ligase known to be involved in regulation of pluripotency-related pathways [
38,
54,
55]. Interestingly, in both mice and humans, increased expression of BORG is associated with increased breast cancer stem cell characteristics in the tumors. Indeed, aberrant expression of BORG in mouse and human breast cancers was associated with acquisition of breast cancer stem cell-like properties and upregulation of known breast cancer stem cell markers, such as NANOG, ALDH1 and ITGA6/
α6Integrin/CD49f. Collectively, these findings indicate that induction of stemness properties in breast cancers is a conserved function of BORG, thereby explaining the observed association of BORG with metastatic properties in tumors.
Our analysis comparing high versus low BORG-expressing tumors indicate that across all breast cancer subtypes, higher BORG levels were associated with enhanced stem cell/progenitor phenotypes and increased expression of genes that are typically associated with basal tumors. This finding points to the presence of a core set of pathways which are induced in all breast cancer subtypes in association with the elevated level of BORG. On the other hand, several key oncogenic pathways associated with aggressive breast cancer behavior, such as EMT and invasive signatures, were either uniquely or much more prominently induced in high BORG-expressing basal/TNBC tumors compared to other breast cancer subtypes analyzed in this study. Taken together, these findings point to a subtype-specific impact for increased BORG expression, thereby creating a nuanced picture of BORG function in breast cancer with potential clinical and therapeutic relevance.
Another consequence of the strong similarity of the transcriptomic signature of enforced BORG expression in murine TNBC models and the gene expression pattern observed when high and low BORG-expressing TNBC tumors are compared is proving a causative role for BORG in inducing at least a fraction of the gene expression patterns observed in high BORG tumors. Over 70% of genes differentially expressed in mouse TNBC cells engineered to overexpress BORG were concordantly differentially expressed in high BORG-expressing human TNBC tumors. Further, BORG-expressing mouse TNBCs induced an overlapping set of cellular pathways when compared to high BORG-expressing tumors. We showed that a significant fraction of the transcriptomic signature associated with aberrant BORG expression is mediated through induction of expression and/or activation of a handful of transcription factors, including EMT- and pluripotency-regulating factors such as OCT1, MEF2A and MEF2C. It is plausible that BORG, in association with its complement of RNA-binding proteins, may directly affect chromatin modification or transcriptional activity at MEF2C and OCT1 loci, thus setting a pro-metastatic, pro-invasiveness program into motion. Taken together, the above results point to the existence of a human ortholog for BORG RNA, and a key pro-metastatic function for the human BORG. By establishing the overall importance of BORG in activating pro-metastatic cellular pathways and defining the inter-donor and subtype variability of BORG’s effect, these results indicate its unique suitability not only as a target for development of therapeutic agents, but also as a biomarker to guide the choice of therapeutic strategy.