1. Introduction
Traditional medicine has significantly contributed to modern pharmaceuticals by providing valuable leads for the creation of effective drugs to eradicate diseases, indicating the potential of natural products in the realm of drug development [
1]. Furthermore, phytoconstituents (PCs) derived from natural sources offer the greater advantage of accessibility and fewer or no side effects when compared to synthetic drugs. The integration of natural medicinal resources into drug discovery holds promise in addressing the eradication of antibiotic resistance [
2].
Potentilla nepalensis belongs to the Rosaceae family, also known as Nepal cinquefoil, and is native to the Himalayan regions of Nepal and Tibet, finding its habitat in alpine and subalpine environments. Despite being admired for its captivating flowers and often cultivated as an ornamental plant,
p.nepalensis is esteemed for its medicinal value. The plant’s therapeutic potential is closely linked to its abundance of secondary metabolite contents, including high amounts of phenols, flavonoids, and terpenoid-derived compounds which were crucial for its therapeutic nature [
3].
P. nepalensis has been traditionally employed for its medicinal properties in various contexts. It has found use in wound healing and addressing skin-related diseased conditions, in addition to aiding digestion and promoting gastrointestinal well-being. Its potential as an anticarcinogenic agent has also been recognized [
4], and its antioxidant activity contributes to safeguarding cells from oxidative stress. Moreover, the plant possesses anti-inflammatory and analgesic properties, rendering it effective in alleviating pain and mitigating inflammation [
5]. Within Tibetan traditional medicine, some species of the Potentilla genus, including
P.nepalensis, have been utilized to treat ailments such as asthma, headache, dysentery, and common cold. Root extracts of
P.nepalensis have exhibited promising anti-cancer [
6] as well as anti-microbial activity [
7].
In our present study, we delve into the exploration of n-hexane and methanolic extracts obtained from both the root and stem parts of the
p.nepalensis using the Gas Chromatography–Mass Spectrophotometry (GC-MS) method determined in our previous research [
3]. Our investigation aims to unveil the extracts’ impact on human genes through an integrated approach of network pharmacology, molecular docking, and dynamic simulation methodologies.
This study methodology encompasses several sequential steps. Firstly, we retrieved the pertinent information regarding the phytoconstituents (PCs) from the PubChem database. These PCs were scrutinized to identify respective inducing genes exhibiting significant pharmacological activity greater than 0.8. Subsequently, we generated protein-protein interaction networks for each extract using string DB. Our focus was then directed towards identifying the most important gene within these networks. Moreover, we performed molecular docking using Autodock Vina software. To facilitate this, we retrieved key proteins identified within the networks from the Protein Data Bank (PDB). Then Molecular dynamics simulations (MDS) and Molecular mechanics with generalized Born and surface area solvation (MMGBSA) free energy calculations of the complexes that exhibited better binding affinity values were calculated. The detailed information on the methodology employed in this research work was presented pictorially (
Figure 1).
2. Results
2.1. Retrieving the Compounds
The n-hexane and methanolic extracts from the root and stem parts of the
p.nepalensis using Gas chromatography–Mass spectrophotometry (GC-MS) method reported earlier in our earlier study outcomes a total of forty compounds, with ten compounds being identified in each extract from both parts of the plant. All these compounds’ information such as their SMILES format, corresponding PubChem IDs, and 2D structures were retrieved from the PubChem database. For convivence, these compounds were classified into four categories, based on the extraction method applied to the plant part. Compounds designated as 1a-1j represent the n-hexane extract from the root. Similarly, compounds labeled as 2a-2j were methanolic extracts from the stem. Compounds 3a-3j were from the n-hexane root extract, while 4a-4j compounds denoted were part of the methanolic shoot extract of
p.nepalensis in
Tables S1–S4.
2.2. Inducing genes of the PCs
A comprehensive assessment of gene induction revealed a total of 764 genes that were induced. The derived root part methanolic (MR) extract led to the induction of 149 genes. Similarly, the shoot methanolic (MS) extract resulted in the induction of 217 genes. Furthermore, the n-hexane extract obtained from the root (NR) triggered the induction of 277 genes. Lastly, the n-hexane extract from the stem (NS) extract induced 121 genes. These findings were comprehensive understandings of gene induction associated with PCs of p.nepalensis.
2.3. Enrichment analysis
To ensure precision and relevance, the top ten descriptive terms were considered from the entire gene ontology study results. This curation was performed because of the high accuracy associated with these terms in predicting efficiently their gene attributes (Excel S1). Among this maximum number of genes involved in enrichment, descriptors were described, providing insights into the genes involved in attribute description.
The biological process (BP) indicates which gene product is involved in biological work [
8]. From the extracts, a maximum number of genes' biological process were predominantly around two BPs; nucleic-acid templated transcription positive, and DNA-templated transcription regulation in
Table 1;
Figures S1A–S4A. The MS extract exhibited a pronounced involvement in these two BPs.
Molecular function (MF) refers to gene product activity [
8]. A converged MF emerged within the induced genes from the MR and NR extracts. Especially, in protein homodimerization and protein serine/threonine activity were common denominators, effectively linked between these extracts. Expanding on this, MS and NS showed similar attributes in their induced genes. In addition, MF has enriched with MR-NR and MS-NS extract-induced genes. DNA binding in NR, DNA binding transcription activator activity, phosphatase activity, and oxidoreductase activity in NS-induced genes were additional MF results in
Table 1;
Figures S1B–S4B. This interconnectedness suggests the potential areas of research focus within the study.
Cellular components (CC) represent the location of the gene in the cell [
8]. The commonality shared between the MR and NR extract-induced genes is reflected in their location within the intracellular membrane-bounded organelle and nucleus. Moreover, the MR extract demonstrates an additional secretory granule lumen. Moving to the MS extract-induced genes are mostly found in the nucleus and azurophil granule lumen. As for NS extract, the induced genes exhibit a more diverse localization in the intracellular organelle, endoplasmic reticulum, and secretory granule lumens in
Table 1;
Figures S1C–S4C.
The genes targeted by the MR extract are involved in several significant pathways, such as Vegfa-Vegfr 2 signaling, leptin signaling, micro-RNAs in cardiomyocyte hypertrophy, and B-cell receptor signaling pathways. Nuclear receptors meta-pathway and Vitamin-D receptor pathways were common in the MS and NR targeted genes, and additionally the Osteoblast differentiation pathway from the MS extract-induced genes. Distinctly, the NS extract-induced genes contribute to the drug addiction pathways and melanoma pyrimidine metabolism pathway in
Table 1;
Figures S1D–S4D.
Upon scrutinizing the diseases associated with the genes induced by all extracts, a notable involvement in cancer-related diseases, especially in the context of neoplasm metastasis. This underscores the significance of these genes in the context of cancer progression and dissemination. Effectively summarized in
Table 1;
Figures S1E–S4E.
2.4. Protein-protein interactions
Protein-protein interaction networks were generated by interlinking the induced genes using String DB resulting in the network containing called nodes (proteins) and edges called (interactions). The interactions have diverse sources from the experimental, text mining, gene fusion, co-expression, neighborhood, and databases. To achieve better insight into the constructed PPI networks, network topological parameters were employed, which include Degree centrality, average shortest path length, clustering coefficient, closeness centrality, and betweenness centrality on each gene’s interaction proteins in the network in
Table 2; Excel S2.
In evaluating the four PPI network topological parameters descriptors, Degree centrality (DC) represents the number of interactions within the network. The average shortest path gauges the minimum number of edges required to connect between the nodes. The clustering coefficient (CC) measures the interconnections required to form a triangular sub-cluster. The shortest path numbers in a network can be measured between centrality (BC) values, which act as a transitional in switching the information. Closeness centrality (C. cen) signifies the closest node to the rest of the nodes in the network. Accounting for these descriptors, five actively interacting genes emerge as significant. These include HSPCB and NFKB1 from MR extract-induced genes in
Figure 2. TP53 is a central figure among MS, NR, and NS extracts-induced genes, illustrated in
Figure 3,
Figure 4 and
Figure 5. These pivotal proteins are demarcated with centrally positioned rectangular boxes, highlighted with a yellow background and blue fonts were the proteins that played prominent roles in the network. Furthermore,
Table 2 indicates the network topological parameter values of these key proteins, further reaffirming their status as potent biomarkers sourced from the
p.nepalensis PCs. This evaluation paves into the critical genes and their connectivity, enhancing our understanding of the molecular level within the network.
Color applied based on the no. of interactions by each gene in the network. Rectangular shape, yellow back colored with blue font exhibiting 15 interactions; Octagonal shape, light blue back colored with green font exhibiting 12 interactions; Diamond shape, light green back colored with pink font exhibiting 11 interactions; Round rectangular shape, purple back colored with pink font exhibiting 10 interactions; Parallelogram shape, orange back colored with blue font exhibiting 9 interactions; Hexagonal shape, light green back colored with red font exhibiting 8 interactions; Ellipse circular shape, light blue back colored with red font exhibiting less than 8 interactions.
Color applied based on the no. of interactions by each gene in the network. Rectangular shape, yellow back colored with blue font exhibiting 25 interactions; Octagonal shape, light blue back colored with green font exhibiting 18 to 14 interactions; Diamond shape, light green back colored with pink font exhibiting 8 to 13 interactions; Ellipse circular shape, light brown back colored with green font exhibiting less than 8 interactions.
Color applied based on the no. of interactions by each gene in the network. Rectangular shape, yellow back colored with blue font exhibiting 20 interactions; Octagonal shape, brown back colored with green font exhibiting 11 to 15 interactions; Hexagonal shape, light pink back colored with pink font exhibiting 9 to 10 interactions; Round rectangular shape, orange back colored with purple font exhibiting 5 to 8 interactions; Ellipse circular shape, light green back colored with red font exhibiting less than 5 interactions.
Color applied based on the no. of interactions by each gene in the network. Rectangular shape, yellow back colored with blue font exhibiting 25 interactions; Octagonal shape, light blue back colored with pink font exhibiting 10 to 16 interactions; Hexagonal shape, light green back colored with red font exhibiting 7 to 9 interactions; Rectangular shape, light pink back colored with blue font exhibiting less than 7 interactions.
2.5. Molecular docking
To explore the interactions between the PCs and potential biomarker proteins, molecular docking analyses were conducted on the methanolic and n-hexane extracts from the root and stem parts of
p.nepalensis. The objective was to determine the binding affinity of the PCs concerning the proteins, namely p53 from the TP53 gene, heat shock protein from the HSPCB gene, and nuclear factor kappa light chain from the NFKB1 gene. In total 40 PCs were subjected, resulting in docking scores that indicate the strength of binding interactions in
Table S5. Among the array of PCs, five of them - 1b, 2a, 3a, 4a, and 4c stood out for resulting higher binding affinity with both p53 and heat shock proteins.
Illustrations,
Figure 6 and
Table 3 indicate the binding affinity, and amino acid involved in interaction types. The binding affinity of 1b with the p53 protein was -8.6 kcal/mol. This interaction involved a single conventional hydrogen bond with the Aser1503 residue. Furthermore, two alkyl bonds formed, connecting the alkyl ends of 1a with the alkyl groups of BMet1584 in the p53 protein. Seventeen π-alkyl bonds emerged, linking the π-alkyl groups of
4ATrp1495, ATyr1502,
2APhe1519, ATyr1523,
3BTrp1495,
3BTyr1502,
2BPhe1519, BTyr1523 with the π-orbitals of 1b. PC 2a demonstrated -8.0 kcal/mol of binding affinity with the p53 protein. AMet1584 participated in an alkyl bond formation with alkyl ends. Additionally, a set of twelve π-alkyl bonds connecting the π-alkyl groups of
2ATrp1495,
2ATyr1502,
2APhe1519,
2BTrp1495, BTyr1502,
2BPhe1519, BTyr1523 with the π-orbitals of 2a.
PCs 3a, 4a, and 4c exhibited impressive docking scores of -9.6, -8.7, and -8.2 kcal/mol with HSP protein. 3a engaged in nine interactions with the HSP residues. AMet98 is involved in three different bond types, including a sulfur bond, a π-Sigma, and an alkyl bond with 3a. Three additional alkyl bonds were formed between the alkyl ends of HSP residues (2ALeu107, AAla111) with 3a. The π-orbitals between APhe138 with 3a are involved to form a π -π Stacked bond in enchaining their stability. Two π-alkyl bonds merged between the π-alkyl groups of APhe138 and AVal150 and the π-orbitals of 3a. For 4a, three π-alkyl bonds and a π-π Stacked bond were formed with the APhe138 residue. Additionally, two π-alkyl and two π-π Shaped bonds were made by 4a with ATrp162. One π-alkyl and two alkyl bonds with AMet98, followed by an alkyl bond with AVal186. Another π-alkyl and one alkyl bond connected 4a and Aleu107. Lastly, 4c shows two CHB interactions with the ATrp162 residue. Six π-alkyl bonds emerged, connecting the π-alkyl groups of APhe22, APhe170, 2ALeu107, AMet98, and AVal150 with the π-orbitals of 4c. Furthermore, an alkyl between the alkyl groups of 4c and AIle26 formed. Two π - π Stacked (2APhe138) and three π - π T Shaped (ATyr139, 2ATrp162) by the 4c. These results provide binding modes and interactions of PCs 3a, 4a, and 4a with the HSP protein, contributing potential roles in therapeutic applications.
2.6. Molecular dynamics simulations and MMGBSA
To study the dynamic nature and protein-ligand stability of the complex in the water molecular dynamics simulations were employed. It provides insights into interaction informatic at an atomic level. The current study, from the results of molecular docking best pose docking score complexes were taken for MD simulations and MMGBSA calculations. A total of five complexes were employed for MDS at 300ns, they are i. p53 with 1b, ii. p53 with 2a, iii. HSPCS with 3a, iv. HSPCB with 4a, and v. HSPCB with 4c. These complexes were pre-processed into three stages along with energy minimization, NPT, and NVT equilibrium. Root-Mean square deviations (RMSD), Root-Mean square fluctuations (RSMF), and MMGBSA values were analyzed by the trajectories.
The Computed trajectories RMSDs of the complexes are represented in
Figure 7. For p53 protein bound with 1b (p53+2b) and 2a (p53+2a) in
Figure 7A, initially to the complex p53+2a, deviations were higher compared to the p53+1b. p53+1b complex maintained constant deviation not exceeding greater than 5Ả. Both complexes reached equilibrium at 6000 frames (~130 ns) at 2-3 Ả. Both ligands exhibit stable complexes and did not leave the binding site during the whole simulation.
For the complexes, HSPCB protein with 3a (HSPCB+3a), with 4a (HSPCB+4a), and with 4c (HSPCB+4c) in
Figure 7B. Their MDS analysis represents the convergence achieved at the end of 100ns. At the beginning of the simulation, the deviations were increasing in three complexes but not exceeding greater than 3.5 Ả in HSPCB+4c. All the three complexes protein-ligand (s) simulations were stable between 2-3 Ả. This indicates the stability of the complexes and without leaving of ligands are exhibited during simulation from their binding regions.
RMSF by bfactor was employed to determine residue displacement in
Figure 8. The RMSF of complexes, p53+1b, and p53+2a exhibited similar RMSF without major fluctuations in
Figure 8A. Suggesting that the binding site was not affected differently by any of both ligands. The three complexes, are HSPCB+3a, HSPCB+4a, and HSPCB+4c. Their RMSF in
Figure 8B, shows that in the HSPCB+4c, the residue 97 suffered considerable fluctuation, nonetheless not involved in the binding site cavity.
Free energy calculations like MMGBSA runs performed by the cpptraj module are employed in analyzing the trajectories of the five complexes. p53+1b and p53+2a, frame 6000 was declared as starting with 5 frames intervals. For HSPBC+3a, HSPBC+4a, and HSPBC+4c, frame 4000 was also started with 5 frames intervals. Frames taken for analysis relied on autocorrelation for every complex. Energy fluctuation calculated by MMGBSA demonstrated the difference between p53+1b and p53+2a complexes since p53+1b revealed better binding energies. Nonetheless, at the end of the simulation, these energies began to fall somehow equal to the p53+2a ligand in
Figure 9A. Both ligands are considered sufficiently thermodynamically favorable to the target receptor, the t-test for both complexes demonstrates a significant difference between energies in
Table S6. MMGBSA calculations demonstrated the ability to target HSPCB protein while maintaining thermodynamically favorable energies during the whole simulation. One-way ANOVA analysis demonstrates that ligand 3a is significantly different from 4a and 4c in
Figure 9B, nevertheless, 4a and 4c are not different from each other in
Table S7. The summary of the MMGBSA free energies results of the complexes were presented as boxer-whisper plots in
Figure 10.
3. Discussion
Considering the root and stem PCs by the n-hexane (NR and NS) and methanolic (MR and MS) extracts through Gas Chromatography-Mass spectrometry (GC-MS) on P. nepalensis, the present study has proceeded to identify the inducing genes/protein as biomarkers by enrichment analysis, protein-protein interaction networks, and molecular docking followed by MDS and MMGBSA studies.
Of the total 764 inducing genes in Text S1-S4, prediction from the DIGEP-Pred at Pa>0.8. Enrichment analysis was performed to find the functional annotations of the genes identified by descriptors of gene ontology, pathways, diseases, and drugs. Subjecting each extract to StringDB to build PPI networks.
Through the protein-protein interactions (PPI) network the gene-encoded proteins and their functions and interactions can be defined. In the network of PPI, proteins are represented by nodes and connecting lines referred to as edges. The current study intended to comprehend the interplay among the genes. The network topological analysis of the more interacting genes in the network (in
Table 2). DC of HSPCB and NFKB1 is 15 of MR extract-induced genes, lesser than the TP53 from the rest of the extracted-induced genes. Three proteins have average shortest path values in the range of 1.75 to 3.02 means the connected nodes are interconnected nearer compared to the other genes. The CC of TP53 is relatively lesser than HSPCB and NFKB1. The C. cen values of TP53 (0.53, 0.42 and 0.57) are little far with GSPCB and NFKB1 (0.33 and 0.34). TP53 has a greater BC value of 0.56, 0.58, and 0.59, compared to HSPCB and NFKB1 of 0.18 and 0.33. Targeting these three genes will have a greater significance in the network. Hence, three genes have been identified as potential biomarkers for the PCs of
P. nepalensis.
HSPCB belongs to the heat shock protein 90 family, considered a pseudogene like a heat shock 90kD protein 1 beta, that has a prominent role in signaling, gastric apoptosis, protein folding, and inflammation. Previous studies have reported its involvement in various cancer cell lines as potential targets such as breast cancer [
9] and Ovarian cancer tissues [
10].
Nuclear factor kappa B subunit 1 (Nf-kB1) known as the transcription regulator activates in the nucleus by translocation into it by stimuli substances (cytokines, and oxidant free radicals) followed by the transcription [
11]. Problems in Nf-kB1 activation have been linked to numerous inflammatory diseases. While continuous inhibition affects immune cell development or delayed cell growth [
12]. Widely used as a biomarker in diabetic cardiomyopathy [
13], and cancer [
14]. Thus, inhibition of the Nf-kB1 helps to reduce the anti-inflammatory activity of targeting Nf-kB1 in the associated pathways [
15]. Designing the antagonist to the Nf-kB1 changes the central gene expression in the leukemic process [
16].
TP53 gene on transcription gives p53 which is a tumor suppressor protein that is involved in the regulation of cell division in an uncontrolled way [
17]. p53 protein is widely known for its action in cancer regulation, well renowned as a target for various cancer types because of its involvement in earlier causes of cancer [
18]. At present few drugs like piperdinone analogs, spirooxindole, nutlin, and isoquilinone are actively using inhibitors against the p53-Mdm2 complex [
19]. Wide research studies are being conducted in designing the small molecules (ligands) targeting mutated p53 protein to restore tumor-suppressing activity [
20].
Considering the above-mentioned research study statement and our study purpose correlated in PCs of
P. nepalensis found biomarkers can effectively exhibit their activity upon binding. Thus, finding the interaction between the PCs and identified biomarker proteins subjected to the docking studies resulted in PCs 1b and 2a, exhibiting better binding scores with p53 protein and PCs 3a, 4a, and 4c with HSPCB in
Table 3. Most of the interactions made by the PCs with the p53 and HSPCB are hydrophobic interactions. Thus, designing these PCs with suitable chemical functional groups that could make hydrogen bonds might increase the binding stronger. MDS of these complexes, inferring RMSD in
Figure 7 and RSMF in
Figure 8 have better stability between them without higher deviations that are between 2-3 Ả. MMGBSA calculations in
Figure 9, demonstrated the ability to target HSPCB protein while maintaining thermodynamically favorable energies during the whole simulation.
4. Materials and Methods
4.1. Retrieving the Compounds
PubChem IDs, 2D structures, and Simplified Molecular Input Line Entry System (SMILES) formats of the PCs found in
P. nepalensis were retrieved from the PubChem database (
https://pubchem.ncbi.nlm.nih.gov/) [
21]. These compounds were earlier identified from the root and stem parts using methanolic and n-hexane extracts GC-MS method [
3]. PubChem is an open-source chemical compounds library with a description.
4.2. Inducing genes of the PCs
The SMILES format of the PCs was utilized to identify induced genes, meeting the criteria of having pharmacological activity greater than 0.8 (Pa>0.8). This analysis was performed in the DIGEP-Pred web server (
http://www.way2drug.com/GE) [
22], which is built based on the Prediction of Activity Spectra for Substances (PASS), calculated by using leave-one-out cross-validation and depended on both mRNA data and protein data.
4.3. Enrichment analysis
Enrichment analysis for the human genes induced by the PCs was conducted using Enrichr (
https://maayanlab.cloud/Enrichr/), by applying a false discovery rate (FDR), and a significant (P) value, both set to be less than 0.05 (FDR<0.05; P-value <0.05) [
23]. Duplicate genes were removed, resulting in a refined gene set that was used as input to ascertain the respective gene's biological process, molecular function, cellular components, pathways, diseases, and available drugs in the market.
4.4. Protein-protein interactions
The interactions among the predicted induced gene proteins were exploited using the STRING DB (
https://string-db.org/) platform. A high confidence threshold of 0.7 was employed to construct the networks. These networks are formed from various sources, including text mining, experimental data, databases, co-expression patterns, neighborhood analysis, gene fusion events, and co-occurrence patterns [
24]. Subsequently, the resulting network was analysed by sending it into Cytoscape V3.10.0 [
25]. With the use of the Analyzer plugin, a protein-protein interaction network was established based on the gene set. These networks underwent further analysis to identify key gene’s network topological parameters including Degree centrality, average shortest path length, clustering coefficient, closeness centrality, and betweenness centrality. These parameters offer valuable insights into the interactions and relationships among the genes within the network.
3.5. Molecular docking
We employed our previously performed molecular docking methods in defining the binding affinities between the protein-ligand complexes [
26]. The tertiary structures of the TP53-binding protein (PDB ID: 6MXY) (
https://www.rcsb.org/structure/6MXY) with K6M ligand (
N-[3-(tert-butylamino)propyl]-3-(trifluoromethyl)benzamide), Nuclear factor NF-kappa-BP (PDB ID: 3GUT) [
27]and Heat shock protein (PDB ID: 1UYM) with PU3 ligand
(9-butyl-8-(3,4,5-trimethoxybenzyl)-9h-purin-6-amine) [
28], were retrieved from the RCSB PDB (
https://www.rcsb.org/). All these structures were determined experimentally through the X-ray diffraction method with resolutions of 1.62, 3.59, and 2.45.
The PCs' SMILES format obtained from PubChem and protein structures sourced from the PDB were not suitable for docking. To address this, the ligands were prepared by adding hydrogens at pH 7.4, generating the 3D geometries, and the application of the MMFF94 forcefield using the Open Babel software [
29], a tool that converts the file to various chemical formats. For the proteins, the extra bound ligands and water molecules were removed in the Drug Discovery studio. Subsequently, the protein structures were repaired by adding missing atoms, polar hydrogens, and Gasteiger charges using the Autodock software [
30].
Grid parameters play a crucial role in defining the precise interacting positions and binding affinity of the ligands. For instance, the bound chemical compounds, such as K6M ligand
of TP53 protein (X=-10.80, Y=26.77, and Z=-3.52), Heat shock protein with PU3 ligand (X=3.60, Y=11.13, and Z=24.75) and the NFKB whole structure (X=28.80, Y=-23.60, and Z=58.23), were considered as grid centers with gird sizes considered as grid parameters in the present study to dock through the AutoDock Vina software [
31]. This strategic utilization facilitated the accurate protein-ligand interactions and binding affinities in the context of this study.
3.6. Molecular dynamics simulations and MMGBSA
Computational molecular simulations for MD were performed on an Ubuntu 22.04 LTS workstation equipped with an Intel core i7-13700k processor and an NVIDIA RTX 4080 graphics card. Amber 22 and AmberTools23 [
32] packages were used to carry out all simulations and trajectory analyses. The ligands bound to p53 and HSPCB from docking complexes were separated, and any missing hydrogens were added using UCSF ChimeraX [
33]. The ligand parameters were assigned using General Amber Force Field (GAFF) [
32], and these were calculated through Antechamber under the AM1-BCC charge method. The Amber force field FF19SB [
34] was employed, along with the TIP3P water model. Neutralization was achieved by adding sodium and chlorine ions, and these parameters were assigned to the respective files for ligands, proteins, and solvents. To balance the system, a total of 30,000 steps of minimization were performed, followed by an increase in temperature to 300 K and 1 atm pressure equilibrium for 200 ps of simulation. A production simulation of 300 ns was then conducted by PMEMD software, accelerated by CUDA. Molecular Mechanics–Generalized Born Surface Area (MMGBSA) [
35] analysis of trajectories was employed using module MMPBSA.py for binding energies calculation. Data visualization and figures were carried out by xmgrace [
36] and R [
37].
5. Conclusions
P. nepalensis renowned for its therapeutic applications, with its composition of PCs identified in our earlier studies. In this study, integrative computational methodologies were employed, leading to the identification of 764 inducing genes. Through the PPI networks analysis, three biomarkers were revealed: TP53 from MS, NR, and NS-induced genes; HSPCB and Nf-kB1 from MR-induced genes. Among the 40 PCs studied, two PCs 1b (MR) and 2a (MS), exhibited superior binding scores with the p53 protein demonstrating values of -8.6, and -8.0 respectively. Additionally, three PCs, 3a (NR) 4a, and 4c (NS) notable binding scores with HSP protein of -9.6, -8.7, and -8.2. MDS and MMGBSA revealed these complexes remain stable without significant deviations, featuring favorable free energy values. The biomarkers identified in this study play significant roles in numerous cancer types.
Supplementary Materials
The following supporting information can be downloaded at the website of this paper posted on Preprints.org. Figure S1: Root methanolic extract PCs enrichment analysis, A. Biological process, B. Molecular function, C. Cellular components, D. Pathways, E. Diseases, F. Drugs; Figure S2: Stem methanolic extract PCs enrichment analysis, A. Biological process, B. Molecular function, C. Cellular components, D. Pathways, E. Diseases, F. Drugs; Figure S3: Root n-hexane extract PCs enrichment analysis, A. Biological process, B. Molecular function, C. Cellular components, D. Pathways, E. Diseases, F. Drugs; Figure S4: Stem n-hexane extract PCs enrichment analysis, A. Biological process, B. Molecular function, C. Cellular components, D. Pathways, E. Diseases, F. Drugs; Table S1: Major phytocompounds identified in GC-MS profiling of methanolic extracts of roots (MR) of P. nepalensis; Table S2: Major phytocompounds identified in GC-MS profiling of methanolic extracts of shoots (MS) of P. nepalensis; Table S 3: Major phytocompounds identified in GC-MS profiling of n-hexane extracts of roots (NR) of P. nepalensis; Table S 4: Major phytocompounds identified in GC-MS profiling of n-hexane extracts of shoots (NS) of P. nepalensis; Table S5: The binding affinities (B.A.) kcal/mol values of TP53, NFKB1, and HSPCB with the PCs of of p.nepalensis (1a – 4j); Table S6: Results from t-test analysis for p53 complexes; Table S7: Results from One-way ANOVA test for HSPCB complexes; Excel S1: Enrichment analysis; Excel S2: Protein-protein interaction analysis; Text S1: MR extract inducing genes; Text S2: MS extract inducing genes; Text S3: NR extract inducing genes; Text S4: NS extract inducing genes.
Author Contributions
Conceptualization, M.P. and M.Y.; methodology, M.P.; software, M.P. and R.D.; validation, M.P., R.R. and Z.Z.; formal analysis, M.P, I.M, and R.D.; investigation, M.P.; data curation, M.P. and R.D.; writing—original draft preparation, M.P. and R.D.; writing—review and editing, M.Y and M.G.S.; visualization, M.P. and R.D.; supervision, M.Y.; project administration, M.Y, I.H, M.A.B, M.G.S.; N.I.Z.; funding acquisition, M.A.B. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the King Saud University, Riyadh, Saudi Arabia, grant number “RSPD2023R740”.
Informed Consent Statement
Not applicable.
Acknowledgments
We are thankful for the Research supporting project number (RSPD2023R740), King Saud University, Riyadh, Saudi Arabia.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Harvey, A.L. Natural products in drug discovery. Drug Discovery Today, 2008, 13, 894–901. [Google Scholar] [CrossRef] [PubMed]
- Mathur, S., Hoskins, C. Drug development: Lessons from nature. Biomedical Reports, 2017, 6, 612–614. [CrossRef] [PubMed]
- Kumari, S., Seth, A., Sharma, S., Attri, C. A holistic overview of different species of Potentilla a medicinally important plant along with their pharmaceutical significance: A review. Journal of Herbal Medicine, 2021, 29, 100460. [CrossRef]
- Tomczyk, M., Pleszczyńska, M., Wiater, A. Variation in total polyphenolics contents of aerial parts of Potentilla species and their anticariogenic activity. Molecules (Basel, Switzerland), 2010, 15, 4639–4651. [CrossRef] [PubMed]
- Kumari, S., Seth, A., Sharma, S., Attri, C. A holistic overview of different species of Potentilla a medicinally important plant along with their pharmaceutical significance: A review. Journal of Herbal Medicine, 2021, 29, 100460. [CrossRef]
- Tomczyk, M., Paduch, R., Wiater, A., Pleszczyńska, M., Kandefer-Szerszeń, M., Szczodrak, J. The influence of aqueous extracts of selected Potentilla species on normal human colon cells. Acta Poloniae Pharmaceutica, 2013, 70, 523–531.
- Tomczyk, M., Leszczyńska, K., Jakoniuk, P. Antimicrobial activity of Potentilla species. Fitoterapia, 2008, 79, 592–594. [CrossRef]
- Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., Dwight, S.S., Eppig, J. T., Harris, M.A., Hill, D.P., Issel-Tarver, L., Kasarskis, A., Lewis, S., Matese, J.C., Richardson, J.E., Ringwald, M., Rubin, G. M., Sherlock, G. Gene Ontology: Tool for the unification of biology. Nature Genetics, 2000, 25, 25–29. [CrossRef]
- Liu, L.-L.; Zhao, H.; Ma, T.-F.; Ge, F.; Chen, C.-S.; Zhang, Y.-P. Identification of valid reference genes for the normalization of RT-qPCR expression studies in human breast cancer cell lines treated with and without transient transfection. PloS One 2015, 10, e0117058. [Google Scholar] [CrossRef]
- Fu, J., Bian, L., Zhao, L., Dong, Z., Gao, X., Luan, H., Sun, Y., Song, H. Identification of genes for normalization of quantitative real-time PCR data in ovarian tissues. Acta Biochimica Et Biophysica Sinica, 2010, 42, 568–574. [CrossRef]
- Cartwright, T., Perkins, N.D., Wilson, C. NFKB1: A suppressor of inflammation, ageing and cancer. The FEBS Journal, 2016, 283, 1812–1822. [CrossRef] [PubMed]
- Li, J., Lei, W.-T., Zhang, P., Rapaport, F., Seeleuthner, Y., Lyu, B., Asano, T., Rosain, J., Hammadi, B., Zhang, Y., Pelham, S. J., Spaan, A.N., Migaud, M., Hum, D., Bigio, B., Chrabieh, M., Béziat, V., Bustamante, J., Zhang, S.-Y., Boisson, B. Biochemically deleterious human NFKB1 variants underlie an autosomal dominant form of common variable immunodeficiency. Journal of Experimental Medicine, 2016, 218, e20210566. [CrossRef]
- Guo, Q., Zhu, Q., Zhang, T., Qu, Q., Cheang, I., Liao, S., Chen, M., Zhu, X., Shi, M., Li, X. Integrated bioinformatic analysis reveals immune molecular markers and potential drugs for diabetic cardiomyopathy. Frontiers in Endocrinology, 2022, 13, 933635. [CrossRef] [PubMed]
- Gaptulbarova, K.A., Tsyganov, M.M., Pevzner, A.M., Ibragimova, M.K., Litviakov, N.V. NF-kB as a potential prognostic marker and a candidate for targeted therapy of cancer. Experimental oncology, 2020, 42, 263–269. [CrossRef]
- Weinmann, D., Mueller, M., Walzer, S.M., Hobusch, G.M., Lass, R., Gahleitner, C., Viernstein, H., Windhager, R., Toegel, S. Brazilin blocks catabolic processes in human osteoarthritic chondrocytes via inhibition of NFKB1/p50: BRAZILIN BLOCKS CATABOLIC PROCESSES VIA INHIBITION OF NFKB1/p50. Journal of Orthopaedic Research, 2018, 36, 2431–2438. [CrossRef]
- Reikvam, H. Inhibition of NF-κB Signaling Alters Acute Myelogenous Leukemia Cell Transcriptomics. Cells, 2020, 9, 1677. [Google Scholar] [CrossRef]
- Damineni, S., Rao, V.R., Kumar, S., Ravuri, R.R., Kagitha, S., Dunna, N.R., Digumarthi, R., Satti, V. Germline mutations of TP53 gene in breast cancer. Tumor Biology, 2014, 35, 9219–9227. [CrossRef]
- Hassin, O., Oren, M. Drugging p53 in cancer: One protein, many targets. Nature Reviews Drug Discovery, 2023, 22, 127–144. [CrossRef]
- Nayak, S.K., Khatik, G.L., Narang, R., Monga, V., Chopra, H.K. P53-Mdm2 Interaction Inhibitors as Novel Nongenotoxic Anticancer Agents. Current Cancer Drug Targets, 2018, 18, 749–772. [CrossRef]
- Zhao, D., Tahaney, W. M., Mazumdar, A., Savage, M. I., Brown, P.H. Molecularly targeted therapies for p53-mutant cancers. Cellular and Molecular Life Sciences, 2017, 74, 4171–4187. [CrossRef]
- Kim, S., Chen, J., Cheng, T., Gindulyte, A., He, J., He, S., Li, Q., Shoemaker, B.A., Thiessen, P.A., Yu, B., Zaslavsky, L., Zhang, J., Bolton, E.E. PubChem 2023 update. Nucleic Acids Research, 2023, 51, 1373–1380. [CrossRef]
- Lagunin, A., Ivanov, S., Rudik, A., Filimonov, D., Poroikov, V. DIGEP-Pred: Web service for in silico prediction of drug-induced gene expression profiles based on structural formula. Bioinformatics, 2013, 29, 2062–2063. [CrossRef] [PubMed]
- Kuleshov, M.V., Jones, M.R., Rouillard, A.D., Fernandez, N.F., Duan, Q., Wang, Z., Koplev, S., Jenkins, S.L., Jagodnik, K.M., Lachmann, A., McDermott, M.G., Monteiro, C.D., Gundersen, G.W., Ma’ayan, A. Enrichr: A comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Research, 2016, 44, 90–97. [CrossRef]
- Szklarczyk, D., Kirsch, R., Koutrouli, M., Nastou, K., Mehryary, F., Hachilif, R., Gable, A.L., Fang, T., Doncheva, N.T., Pyysalo, S., Bork, P., Jensen, L.J., von Mering, C. The STRING database in 2023: Protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Research, 2023, 51, 638–646. [CrossRef]
- Shannon, P., Markiel, A., Ozier, O., Baliga, N.S., Wang, J.T., Ramage, D., Amin, N., Schwikowski, B., Ideker, T. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Research, 2003, 13, 2498–2504. [CrossRef] [PubMed]
- Praveen, M., Morales-Bayuelo, A. Drug Designing against VP4, VP7 and NSP4 of Rotavirus Proteins – Insilico studies. Moroccan Journal of Chemistry, 2023, 11, 729–741. [CrossRef]
- Stroud, J.C., Oltman, A., Han, A., Bates, D.L., Chen, L. Structural Basis of HIV-1 Activation by NF-κB—A Higher-Order Complex of p50:RelA Bound to the HIV-1 LTR. Journal of Molecular Biology, 2009, 393, 98–112. [CrossRef]
- Wright, L., Barril, X., Dymock, B., Sheridan, L., Surgenor, A., Beswick, M., Drysdale, M., Collier, A., Massey, A., Davies, N., Fink, A., Fromont, C., Aherne, W., Boxall, K., Sharp, S., Workman, P., Hubbard, R.E. Structure-Activity Relationships in Purine-Based Inhibitor Binding to HSP90 Isoforms. Chemistry & Biology, 2004, 11, 775–785. [CrossRef]
- O’Boyle, N.M., Banck, M., James, C.A., Morley, C., Vandermeersch, T., Hutchison, G.R. Open Babel: An open chemical toolbox. Journal of Cheminformatics, 2011, 3, 33. [CrossRef]
- Morris, G.M., Huey, R., Lindstrom, W., Sanner, M.F., Belew, R.K., Goodsell, D.S., Olson, A.J. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. Journal of Computational Chemistry, 2009, 30, 2785–2791. [CrossRef]
- Trott, O., Olson, A.J. AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of Computational Chemistry 2009. [CrossRef]
- Wang, J., Wolf, R.M., Caldwell, J.W., Kollman, P.A., Case, D.A. Development and testing of a general amber force field. Journal of Computational Chemistry, 2004, 25, 1157–1174. [CrossRef] [PubMed]
- Pettersen, E.F., Goddard, T.D., Huang, C.C., Meng, E.C., Couch, G.S., Croll, T.., Morris, J.H., Ferrin, T.E. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Science: A Publication of the Protein Society, 2021, 30, 70–82. [CrossRef]
- Tian, C., Kasavajhala, K., Belfon, K.A.A., Raguette, L., Huang, H., Migues, A.N., Bickel, J., Wang, Y., Pincay, J., Wu, Q., Simmerling, C. ff19SB: Amino-Acid-Specific Protein Backbone Parameters Trained against Quantum Mechanics Energy Surfaces in Solution. Journal of Chemical Theory and Computation, 2020, 16, 528–552. [CrossRef] [PubMed]
- Genheden, S., Ryde, U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opinion on Drug Discovery, 2015, 10, 449–461. [CrossRef]
- Turner, PJ. XMGRACE, Version 5.1.19; Center for Coastal and Land-Margin Research, Oregon Graduate Institute of Science and Technology: Beaverton, OR, 2005. [Google Scholar]
- R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, 2018. Available online at https://www.R-project.org/.
Figure 1.
Flow chart of the methodologies used in this study.
Figure 1.
Flow chart of the methodologies used in this study.
Figure 2.
Protein-protein interaction network of the Methanolic root extract-induced genes.
Figure 2.
Protein-protein interaction network of the Methanolic root extract-induced genes.
Figure 3.
Protein-protein interaction network of the Methanolic shoot extract-induced genes.
Figure 3.
Protein-protein interaction network of the Methanolic shoot extract-induced genes.
Figure 4.
Protein-protein interaction network of the n-hexane root extract-induced genes.
Figure 4.
Protein-protein interaction network of the n-hexane root extract-induced genes.
Figure 5.
Protein-protein interaction network of the n-hexane shoot extract-induced genes.
Figure 5.
Protein-protein interaction network of the n-hexane shoot extract-induced genes.
Figure 6.
Molecular interactions of the complexes. i. p53 with PCs 1a and 2a and ii. HSP with PCs 3a, 4a, and 4c.
Figure 6.
Molecular interactions of the complexes. i. p53 with PCs 1a and 2a and ii. HSP with PCs 3a, 4a, and 4c.
Figure 7.
RMSD values from simulated p53 (A) with 1b (red) and 2a (black) complexes; HSPCB (B) with 3a (pink), 4a (brown), and 4c (green) complexes at 300 ns production runs divided into 15000 frames.
Figure 7.
RMSD values from simulated p53 (A) with 1b (red) and 2a (black) complexes; HSPCB (B) with 3a (pink), 4a (brown), and 4c (green) complexes at 300 ns production runs divided into 15000 frames.
Figure 8.
RMSF values by bfactor of every residue in p53 (A) with 1b (red) and 2a (black) complexes; HSPCB (B) with 3a (pink), 4a (brown), and 4c (green) complexes.
Figure 8.
RMSF values by bfactor of every residue in p53 (A) with 1b (red) and 2a (black) complexes; HSPCB (B) with 3a (pink), 4a (brown), and 4c (green) complexes.
Figure 9.
MMGBSA energy values of p53 (A) with 1b (red) and 2a (black) complexes; HSPCB (B) with 3a (pink), 4a (brown), and 4c (green) complexes.
Figure 9.
MMGBSA energy values of p53 (A) with 1b (red) and 2a (black) complexes; HSPCB (B) with 3a (pink), 4a (brown), and 4c (green) complexes.
Figure 10.
Summary of MMGBSA free energy values of both complexes, P53 (pink) and HSPCB (yellow) Compound 1 – 1b, Compound 2 – 2a, Compound 3 – 3a, Compound 4 –4a, Compound 5 – 4c.
Figure 10.
Summary of MMGBSA free energy values of both complexes, P53 (pink) and HSPCB (yellow) Compound 1 – 1b, Compound 2 – 2a, Compound 3 – 3a, Compound 4 –4a, Compound 5 – 4c.
Table 1.
Enrichment analysis of methanolic and n-hexane extracts from root and stem plant parts of p.nepalensis.
Table 1.
Enrichment analysis of methanolic and n-hexane extracts from root and stem plant parts of p.nepalensis.
Descriptors |
MR |
MS |
NR |
NS |
BP |
nucleic acid-templated transcription positive regulation, |
DNA-templated transcription regulation, Nucleic acid-templated transcription positive regulation |
Nucleic acid-templated transcription positive regulation, |
Nucleic acid-templated transcription positive regulation |
MF |
Protein homodimerization activity, |
Protein Serine/Threonine Phosphatase activity |
DNA binding, Protein Homodimerization Activity, |
DNA-Binding transcription activator activity, Protein Serine/Threonine Phosphatase activity, Oxidoreductase activity |
CC |
Intracellular membrane-bounded organelle, Nucleus, secretory granule lumen |
Nucleus, Azurophil granule lumen |
Intracellular membrane-bounded organelle, Nucleus |
Intracellular organelle lumen, Endoplasmic reticulum lumen, Secretory granule lumen |
Pathways |
Vegfa-Vegfr2 signalling pathway, Leptin signalling pathway, Micro-RNAs in cardiomyocyte hypertrophy, B-cell receptor signalling pathway |
Nuclear receptors meta-pathway, Osteoblast differentiation, Vitamin-D receptor pathway |
Nuclear receptors meta-pathway, Vitamin-D receptor pathway |
Common pathways underlying drug addiction, Melanoma, pyrimidine metabolism |
Diseases |
Neoplasm metastasis, Liver carcinoma, Mammary neoplasms, Melanoma |
Neoplasm metastasis, Breast carcinoma, Prostate malignant neoplasm |
Neoplasm metastasis, Liver carcinoma |
Breast carcinoma, Breast neoplasm malignant, Neoplasm metastasis |
Drugs |
Aprindine, Domperidone Trifluoperazine, Pitolisant, Cyproheptadine, Pimozide, Brompheniramine Buprenorphine, Lidoflazine, Chlorambucil |
Mefenamic Acid, Diclofenac, Flufenamic Acid, Quercetin Mezlocillin,,Hydrochlorothiazide Hydroxycarbamide Bendroflumethiazide Benzthiazide, Chlorambucil |
Bezafibrate, Rosiglitazone Stearic Acid, Dodecanoic Acid, Gamolenic Acid, Aprindine, Caffeine, Eicosapentaenoic Acid, Linolenic Acid, Mefenamic Acid |
Stearic Acid, Epalrestat Dodecanoic Acid, Gamolenic Acid, Vemurafenib, Bezafibrate, Gemfibrozil, Linolenic Acid, Aprindine, Eicosapentaenoic Acid |
Table 2.
Network topological parameters of the complete Protein-protein interaction network.
Table 2.
Network topological parameters of the complete Protein-protein interaction network.
Induced genes used |
Proteins |
Network topological parameters |
DC |
Avg short. path leng. |
CC |
C. cen |
BC |
MR |
HSPCB |
15 |
3.02 |
0.18 |
0.33 |
0.18 |
NFKB1 |
15 |
2.85 |
0.25 |
0.34 |
0.33 |
MS |
TP53 |
25 |
1.87 |
0.15 |
0.53 |
0.56 |
NR |
TP53 |
20 |
2.32 |
0.16 |
0.42 |
0.58 |
NS |
TP53 |
25 |
1.75 |
0.17 |
0.57 |
0.59 |
Table 3.
Binding affinity (B.A.) and interactions of the Heat shock protein (HSP), Nuclear factor NF-kappa-BP (NFKB1), and TP53-binding protein (p53) with the extracted PCs of p.nepalensis.
Table 3.
Binding affinity (B.A.) and interactions of the Heat shock protein (HSP), Nuclear factor NF-kappa-BP (NFKB1), and TP53-binding protein (p53) with the extracted PCs of p.nepalensis.
Complex |
B.A. (kcal /mol)
|
Hydrogen Bonds |
Hydrophobic bonds |
Other bonds |
Proteins |
PCs |
CHB |
π-Alkyl |
Alkyl |
π - π Stacked |
π - π T Shaped |
π -Sigma |
p53 |
1b |
-8.6 |
ASer1503 |
4ATrp1495, ATyr1502, 2APhe1519, ATyr1523, 3BTrp1495, 3BTyr1502, 2BPhe1519, BTyr1523 |
2BMet1584 |
- |
- |
- |
- |
2a |
-8.0 |
- |
2ATrp1495, 2ATyr1502, 2APhe1519, 2BTrp1495, BTyr1502, 2BPhe1519, BTyr1523 |
AMet1584 |
- |
- |
- |
- |
HSP |
3a |
-9.6 |
- |
APhe138, Aval150 |
AMet98, 2ALeu107, AAla111 |
APhe138 |
- |
AMet98 |
AMet98 |
4a |
-8.7 |
- |
3APhe138, 2ATrp162, AMet98, ALeu107 |
2AVal186, 2AMet98, AVal150, ALeu107 |
APhe138 |
- |
2Trp162 |
- |
4c |
-8.2 |
2ATrp162 |
APhe22, APhe170, 2ALeu107, AMet98, AVal150 |
AIle26 |
2APhe138 |
ATyr139, 2ATrp162 |
- |
- |
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).