1. Introduction
Abnormal cell growth and series of enabling mutations creates clump of cells that are plastic, genetically malleable and could potentially breach the tissue barrier and spread to secondary sites within the body, a condition known as cancer. Cells multiplies via cell division and cells die due to senescence or when damaged or harbors irreparable DNA damage. The process that intrinsically induces cascade of events of cell death is known as programmed cell death or apoptosis. When cell cycle gets disrupted or the mechanism of programmed cell death fail to initiate death of cells with critical mutations or unrepaired DNA, unregulated, abnormal cell growth can result in formation of lump of tissues in the body known as tumors. Tumors can occur as benign (not malignant) or cancerous. The cancerous or malignant tumors can invade the neighboring tissues [
1]. Cancer is the second most common cause of death worldwide [
2].
Cancer Stem Cells (CSCs) are a group of self-renewing cells, which has the strong tumorigenic potential and unlimited proliferation capacity. CSCs originate from either differentiated cells or adult tissue of resident stem cell at the stage of tumor onset [
3]. Tumor is a heterogeneous mixture of cells with unsynchronized cycling cells that are in different phases of cell cycle. Cells within the tumor mass also differ in terms of mutation, plasticity, level of dedifferentiation and response to stress. Compared to other cells inside a tumor, CSCs are more resistant to therapies and has great adaptability based on their surrounding environment [
4] and these cells are much more dedifferentiated. Because of the significant role in resistance to tumor chemotherapy and radiation therapy, as well as tumor recurrence, CSCs has gained much attention [
5]. These cancer stem cells maintain their self-renewal properties through different signaling pathways like TGFB signaling, Wnt/B catenin signaling, Notch signaling, Hedgehog signaling etc [
6]. Chemotherapy or radiotherapy may not always completely destroy the CSCs, after treatment a small subset of residual CSC can survive, which can cause cancer relapse and ultimately lead to invasiveness and therapy resistance [
7]. CSCs induce relapse, metastasis, radiation resistance and multidrug resistance, due to their capacity to remain arrested in G0 phase of cell cycle, and enter cell cycle at opportune moment. Ever since their discovery in leukemia in 1994, CSCs have been viewed as potential target for cancer treatment. There are many intracellular and extracellular factors that control the activity of CSCs, so these factors can serve as drug target [
8]. Thus, in the past several years CSC targeted therapy gained increasing interest from research community, especially as potential target in aggressive metastatic cancer.
The most frequent type of cancer in women is breast cancer, following skin cancer. In 2020, more than 2 million new cases are registered globally. Based on GLOBOCAN 2020 data, breast cancer ranks fifth globally in terms of cancer-related fatalities [
9]. Majority of the patients get to know about their disease in routine screening, other than that rest of the cases are discovered from breast lump, change in the sizes or discharging of nipple. Genetic mutations and DNA damage, which further activates the secretion of estrogen, are the cause of breast cancer [
10].
Based on GLOBOCON 2018 data, in the category of lung cancer, 2.09 million new cases and 1.76 million deaths are registered globally. In both the genders, lung cancer is most common cancer and also the leading cause of cancer-related deaths. Specially in women, this is the third most frequent cause of cancer and second most lethal cancer [
11]. 90% of the lung cancer cases are caused due to smoking. Men are affected more compared to women [
12]. Ovarian cancer, another frequent type of cancer in women ranks eighth in terms of cancer related deaths, having less than 45% five-year survival rate [
13]. Ovarian cancer can strike anyone at any age; persons over 50 are more likely to have it [
14]. Right now, majority of the clinical studies concentrated on targeted methods, such as latest attempts at immune therapies.
Use of bioinformatics for
in silico studies in translational drug delivery is becoming more and more important in both pharmaceutical industries and academics [
15]. The roots of bioinformatics can be found more than a half a century ago, although DNA sequencing wasn’t started at that time [
16]. The rising amount of data generated through the drug discovery process can now be computationally utilized to address major difficulties. In this present study we focused on identifying CSC linked genes, expression of these genes, role as prognostic biomarker and potential target for drug intervention in breast, lung and ovarian cancer through in depth
in silico analysis.
4. Discussion
CSC are small subpopulation of cancer cells, which have the capability of self-renewal and leads to the resistance to chemotherapy. Different signaling pathways like JAK-STAT, Hedgehog, TGFB, Notch, Wnt-B catenin, VEGF signaling pathways regulate the development of CSC. Different inhibitors, siRNA, monoclonal antibodies, enzyme inhibitors that intervenes with those signaling pathways can target the cancer stem cells. There are clinical trials ongoing targeting CSCs [
31]. Breast, lung and ovarian cancer, originate from the epithelial cells and turn into most aggressive types of cancer by inducing the EMT [
32,
33,
34]. There are different strategies available for targeting the stem cell of these cancers, although, most of these strategies are yet to be clinically approved and plasticity and biology of CSCs need to be explored in more detail for identification of novel targets. In this present study, we first downloaded the CSC gene expression datasets of these three different types of cancer and analyzed genes that are associated with CSCs from different data sets of respective cancer types in order to identify genes that can be important intervention target in these cancers. We tried to identify important genes that are expressed in cancer stem cells of these three types of cancer. Our analysis revealed that there are no common genes between these three types of cancer stem cells, implying that the genes of cancer stem cells are specific for cancer tissue type. Next, we identified common important up and down regulated genes in cancer stem cells of particular type of cancer i.e. breast, lung and ovarian cancer and investigated their relation with different pathological stages of respective type of cancer and also their prognostic value. Finally, we identified several potential candidate drugs that can target those particular genes of cancer stem cells. We validate the identified genes in TCGA data set of respective cancers and also carried out the survival analysis to identify the prognostic significance of these genes. Lastly, we analyzed these selected genes for drug targets, focusing on candidate drugs that can modulate the function of these genes in particular cancers and reduce the CSC mediated cancer proliferation and metastasis.
From our analysis, we identified APOC1(Apolipoprotein C1) gene in breast CSCs, SLC44A5 and CAV2 genes in lung CSCs and MECOM, SLC6A12, PRRG4, ABHD11, DUSP1(MKP-1), ADCY7, AKAP12, TPM2, FLNC in case of ovarian CSCs as some of the most important genes. Out of these twelve genes, eight genes (APOC1, SLC44A5, CAV2, PRRG4, ADCY7, AKAP12, TPM2, FLNC) show significant overall prognostic value in respective cancers.
APOC1 is recently been identified in various types of cancers, such as, colorectal cancer [
35], cervical cancer [
36] and also in breast cancer [
37]. In breast cancer APOC1 induce cell proliferation and metastasis by inducing MAP kinase pathway. In our analysis we have identified APOC1 is upregulated in human breast CSC and also in tumor samples in TCGA database (
Figure 2A). When we analyzed the effect of expression status of this gene on overall survival, we found that higher expression of this gene is linked to worse overall survival (
Figure 3A). We further analyze the relation with pathological stages of breast cancer with expression of this gene, we found that its expression is significantly increased in different stages from the normal condition and also nodal metastatic status increased in N0, N1, N2, N3 stages of breast cancer. Protein expression data from HPA shows that in normal breast tissue it is not detected but breast cancer tissue shows moderate expression (
Figure 6A). TIMER analysis reveals that in breast cancer patients, expression of APOC1 is positively corelated with the infiltration of M2 macrophages, Neutrophils as well as Dendritic cells which contributes in tumor promoting environment in breast cancer (
Figure 7A). We identified drugs that can decrease the expression of APOC1 are Valproic acid, Cyclosporin, Quercetin, Isotretinoin amongst several others.
Differential expression of SLC44A5 (Solute Carrier Family 44 Member 5) and CAV2 (Caveolin 2) are found in lung cancer stem cells, where SLC44A5 shows upregulated expression and CAV2 shows downregulated expression. SLC44A5 is one type of choline transporter which plays important role in inducing cell viability and invasion of hepatocellular cancer cells and also inhibits apoptosis [
38]. In lung cancer, we could not find any study regarding the role of this gene. From the TCGA data we found that its expression is significantly upregulated in lung adenocarcinoma sample compared to normal samples and its higher expression significantly associated with decreased overall survival of lung cancer patients. The expression of SLC44A5 is significantly increased in the various pathological stages of lung adenocarcinoma patients but there are no significant changes within the stages of lung adenocarcinoma (
Figure 4B). Its expression also positively corelated with the nodal metastatic status of patients of lung adenocarcinoma. Several drugs like Cyclosporine, Sunitinib, Bisphenol A and Quercetin can decrease the activity or expression of this gene. Caveolae are small infoldings of the cell membrane and involved in endocytosis, cellular trafficking and also signal transduction. CAV 2 is involved in the inhibition of TGFB signaling pathway and reduce the proliferation of mouse lung endothelial cells [
39]. From the TCGA data, we found that its expression is downregulated in lung adenocarcinoma patients and its higher expression is significantly related to increased overall survival, indicating that it could be an important prognostic marker for the patients of lung adenocarcinoma. Also, the expression of CAV2 is significantly decreased in the pathological stages and various nodal metastatic condition from the normal healthy individuals. Expression of CAV2 protein is found to be highly and moderately expressed in Normal lung tissue and lung adenocarcinoma patients respectively (
Figure 6C). Immune infiltration analysis shows that infiltration of M2 macrophage, neutrophil and dendritic cells positively corelated with the expression of CAV2 but negatively corelated with the expression of SLC44A5 in case of lung adenocarcinoma patients. As this gene is downregulated in lung cancer, therefore, we tried to identify drugs that could increase its expression leading to perturbation in cancer proliferation. Drugs that can increase the expression of CAV2 includes Doxorubicin. Cisplatin, Tretinoin and Valproic acid.
Next, PRRG4 (Proline Rich and Gla Domain 4), ADCY7 (Adenylate cyclase 7), AKAP12 (A Kinase Anchor Protein 12), TPM2 (Tropomyosin 2), FLNC (Filamin C) genes are identified in ovarian cancer stem cells. PRRG4 is upregulated and ADCY7, AKAP12, TPM2 & FLNC are downregulated in ovarian cancer stem cells when compared with the non- stem cells. PRRG family proteins are involved in cell signalling and regulation, although the specific function of PRRG4 is not fully elucidated. Zhang
et al., found that upregulated expression of PRRG4 induces breast cancer metastasis through downregulation of Robo1 via NEDD4 [
40]. In another study it has been found that PRRG4 regulates the mitochondrial function and migratory behaviour of breast cancer cells by Src-STAT3-POLG axis [
41]. In our study this gene is found to be upregulated in ovarian cancer stem cell but higher expression of this gene is significantly linked with increased overall survival which shows a paradoxical significance of this gene in ovarian cancer patients. HPA data shows PRRG4 is highly expressed in ovarian cancer whereas not detected in case of normal ovary tissue (
Figure 6D). PRRG4 induces the M2 macrophage, neutrophil and dendritic cell infiltration in the ovarian cancer microenvironment. Valproic acid, Acetaminophen, Paclitaxel can reduce the activity of the gene in cancer.
ADCY7, AKAP12, TPM2 and FLNC genes show reduced expression in ovarian cancer stem cells and from the TCGA data. ADCY7 is the membrane protein of adenylate cyclase family and induce the signal transduction. There are reports that indicate ADCY7 is associated with poor prognosis of myeloid leukaemia patients [
42]. AKAP12 act as a tumor suppressor gene in different types of cancer and can inhibit the proliferation, migration, invasion and angiogenesis of cancer cells as well as block cell cycle through the activation of protein kinase C [
43]. Tropomyosins are the actin binding proteins that regulates the force generation and sensing stiffness. Downregulated expression of TPM2 has been found to enhance the proliferation and migration of colorectal cancer cell [
44] and also promotes the breast cancer metastasis and chemoresistance [
45]. FLNC is an actin crosslinking cytoskeletal protein that maintains the cellular morphology. In a study of gastric cancer, Filamin C supress the metastasis and angiogenesis by reducing the expression of MMP2 [
46]. Paradoxically, from the Kaplan-Meier plot, we found that high expression of ADCY7, AKAP12, TPM2 & FLNC genes are significantly associated with decrease overall survival, indicating the need for detail study of the function of these genes in CSCs, tumor microenvironment and survival of cancer cells. There is lack of sufficient data for these genes regarding relation with the different pathological stages and nodal metastatic status of ovarian cancer patients. However, when we investigate the immune infiltration status of these genes in ovarian cancer, we found that ADCY 7 expression positively related the infiltration of M2 macrophage, neutrophil and dendritic cells but AKAP12 expression is positively corelated with the infiltration of only neutrophils. TPM2 expression also positively related with the M2 macrophage and neutrophil infiltration but not significantly related with dendritic cell infiltration whereas expression of FLNC is not significantly related with infiltration of M2 macrophage and neutrophil and negatively related with dendritic cell infiltration in ovarian cancer. From immunohistochemical images we have found that ADCY7 expression is not detected in normal ovary and moderately expressed in cancerous tissue. AKAP12 and TPM2 are highly expressed in both normal ovary and ovarian cancer. However, expression of FLNC is not detected in normal condition and in ovarian cancer it shows lower intensity. Drugs like Doxorubicin, Quercetin, Sunitinib, Cyclosporine, Fulvestrant can target the AKAP12 and cyclosporine, valproic acid, Acetaminophen can increase the effect of ADCY7. Also, Doxorubicin, Valproic acid, Bisphenol A can effectively target the TPM 2 and FLNC gene of ovarian cancer stem cells. From our study, Valproic acid, Sunitinib and Quercetin emerged as drug candidates that can target most of the CSC associated genes or signalling pathways in breast, ovarian and lung cancer. It is important to note that Quercetin has been actively explored as potential candidate drug in cancer but so far clinical studies have failed to replicate same level of impact shown in preclinical and in vitro studies [
47] implying that literature and high throughput study based drug data bank may identify many well researched candidate drugs that failed to replicate promising results revealed in vitro/pre-clinical studies in clinical set up and careful screening must be conducted to identify novel promising candidate drugs or small molecules.