Variations of VEGFR2 Chemical Space: Stimulator and Inhibitory Peptides

Claudiu N. Lungu; Ionel Mangalagiu; Gabriela Gurau; Mihaela Cezarina Hîncu

doi:10.20944/preprints202406.0461.v1

Submitted:

06 June 2024

Posted:

07 June 2024

You are already at the latest version

Abstract

The kinase pathway plays a crucial role in blood vessel function. Particular attention is paid to VEGFR-type 2 angiogenesis and vascular morphogenesis as the tyrosin kinase pathway is preferentially activated. In silico studies were performed on several peptides that affect VEGFR2 in both stimulating and inhibitory ways. This investigation aims to examine the molecular properties of VEGFR2, a molecule primarily involved in the processes of vasculogenesis and angiogenesis. These relationships were defined by the interactions between vascular endothelial growth factor receptor 2 (VEGFR2) and the structural features of the systems. The chemical space of the inhibitory peptides and stimulators was described using topological and energetic properties. Furthermore, chimeric models of stimulating and inhibitory proteins (for VEGFR2) were computed using the protein system structures. The interaction between the chimeric proteins and VEGFR was computed. The chemical space was further characterized using complex manifolds and high-dimensional data visualization. The results show that a slightly similar chemical area is shared by VEGFR2 and is shared by stimulating and inhibitory proteins. On the other hand, the stimulator peptides and the inhibitors have distinct chemical spaces.

Keywords:

peripheral artery disease

;

molecular modeling

;

docking

;

angiogenesis

;

vascular morphogenesis

;

chimeric protein

Subject:

Medicine and Pharmacology - Cardiac and Cardiovascular Systems

1. Introduction

A well-coordinated response from cells is crucial for forming new blood vessels. This response involves specific receptors on the cell surface. Although the main molecular signals are known, their interaction mechanism is still not fully understood[1].

VEGF-VEGFR, Notch-DSL, Tie-Angiopoietin, VE-cadherin, and Ephrin-Eph are the main pathways in vascular morphogenesis. Among these, the VEGF pathway is essential for regulating angiogenesis, which consists of the growth of new blood vessels. It works by activating processes in endothelial cells that promote their growth, movement, and survival, as well as controlling vessel permeability[2].

VEGF mainly affects endothelial cells and influences other cell types like monocytes and macrophages. It promotes the growth and movement of endothelial cells in laboratory settings. The VEGF family includes five molecules: VEGF A to E. Each one binds to specific receptors on the cell surface, triggering a process that activates them through phosphorylation[3].

VEGF plays a crucial role in forming blood vessels by interacting with specific receptors on cell surfaces, leading to various cellular responses that promote angiogenesis[4,5].

VEGF, as stated, interacts with specific receptors on endothelial cells, mainly VEGFR2, to trigger cellular responses involved in angiogenesis. Although VEGFR1 signaling is less potent, it contributes to endothelial cell proliferation by merging with the VEGFR2 pathway. Activation of VEGFR2 leads to various downstream pathways that regulate cell survival, proliferation, and permeability. One pathway involves PI3K-AKT-mTOR signaling, while another crucial pathway involves PLCI-mediated activation of PKC, leading to the induction of the ERK pathway. Endothelial cell migration is influenced by VEGFA/VEGFR2 signaling through p38MAPK activation. This signaling network plays a critical role in angiogenesis by regulating various enzymes, receptors, and transcription factors. Despite efforts, clinical success in promoting angiogenesis in peripheral artery disease patients remains challenging[6,7,8].

VEGF-A is vital for endothelial cell functions related to angiogenesis, primarily through VEGFA/VEGFR2 signaling, which drives endothelial cell proliferation, migration, survival, and new vessel formation. Cell signaling is tightly regulated spatially and temporally, with specialized membranes and vesicles containing specific lipids and proteins modulating signaling output. Phosphatidylinositol 4,5-bisphosphate (P.I. (4,5)P2) is crucial for multiple cellular processes. Furthermore, the tiny G proteins Rap1a and Rap1b offer insights into VEGF signaling in endothelial cells by playing critical roles in angiogenesis and endothelial cell responses to VEGF[9,10].

Furthermore, vascular endothelial cells have the glycoprotein VEGFR-2, which binds to VEGF-A. It is essential to angiogenesis and has particular autophosphorylation sites upon binding to VEGF. Compared to VEGFR1, an impaired tyrosine kinase receptor, VEGFR-2 is much more active. VEGF activates VEGFR2 on the membrane of endothelial cells, which starts a chain reaction of signaling molecules that includes VRASP, PLCγ, ScK, Cdc42, Src, and PI3K. These molecules regulate cell migration, proliferation, survival, and permeability through interactions with downstream pathways such as ERK, p38MAPK, and AktPKE. Essential for both healthy and pathological angiogenesis, VEGFR-2 mediates VEGF-driven responses in endothelial cells[11,12].

In addition to VEGFR-2, the Notch signaling pathway plays a significant role in embryonic development. Delta notch or Seratt-like ligands stimulate the Notch receptor, leading to downstream effects on DNA transcription factors like Hes1/5 and Hey. This pathway is essential for proper embryonic development.[13]

VEGF-A, acting through VEGFR2, leads to endothelial cell proliferation, migration, survival, and new vessel formation crucial for angiogenesis. Cell signaling in angiogenesis is tightly regulated and involves various molecules, including phosphatidylinositol 4,5-bisphosphate (P.I. (4,5)P2) and small G proteins like Rap1a and Rap1b. Hypoxia and downstream signaling pathways influence angiogenesis, including SOX17- and VEGF-R2-mediated pathways[14].

Furthermore, focal adhesion kinase (FAK) plays a crucial role in embryonic angiogenesis, regulating endothelial cell survival and barrier functions. Loss of FAK or its kinase activity decreases endothelial cell proliferation and migration, indicating FAK's role as a kinase in regulating adult angiogenesis[15].

VEGFR-2 and other signaling pathways are essential targets for therapeutic strategies that promote angiogenesis and treat vascular diseases.

In this respect, ischemic diseases like heart failure, strokes, and peripheral artery disease result from poor blood supply. Treating these conditions with pro-angiogenic molecules is appealing. VEGFA plays a crucial role in vessel formation, growth, and branching, making it a critical pro-angiogenic molecule. It primarily acts on VEGFR2 but also stimulates VEGFR1. Targeting both receptors could be a promising therapy for promoting angiogenesis. Despite promising experimental results, there are currently no FDA-approved pro-angiogenic molecules.

Extensive research has categorized various pro-angiogenic molecules, including angiogenic proteins, gene therapy, peptide drugs, and organic molecules[16,17].

Peptides, smaller molecules than proteins, do not require complex structures to be biologically active. They can be manipulated easily and optimized to mimic angiogenesis-stimulating molecules. Peptides can also be modified or conjugated with other molecules to enhance their properties. Due to their simplicity and smaller size, pro-angiogenic peptides can be rapidly synthesized to stimulate angiogenesis effectively[18].

This computational study aims to characterize the chemical space of stimulant and inhibitory VEGFR2 proteins to further design a potent peptide or organic molecule that can shape the angiogenesis and vascular-morphogenesis processes.

2. Results

As presented in the methods section, a complete series of 3D protein molecule structures (PDB) that act on VEGF2 and consecutively inhibit or stimulate vascular morphogenesis have been used. Homology modeling was used for some structures to generate 3D molecules using their Uniprot ID( where the PDB structure was unavailable). As stated, homology modeling for the angiogenesis inhibitor Vasstatin and angiogenesis stimulators -PDGFC, PIGF, and PDGF D were performed using their uniprot I.D. The resulting structures are shown below(Figure 1):

To explore the VEGFR2 interaction with organic molecules, the VEGFR2 binding site was determined computationally to perform docking studies. Furthermore, the binding site characterization (VEGFR2 as the target molecule) retrieved the following binding sites for the VEGFR2 PDB model 3VNT[19]:(a) a major cavity 1 with a volume =435.712 and with the following coordinates x=29.14, y=-36.14, z=-18.65; (b)cavity 2 with volume = 23.04, x = 23.25.z=-44.35, y=-14.55;(c)cavity 3 with volume=16.384 , x=11.72, y=-32.57, z=-21.62;(d) cavity 4 with a volume =5.36, x=13.22, y=--12.71, z=-18.12;(e) cavity 5 with a volume =10.24, x=12.46, y=-7.67, z=-29.09;(f) cavity 6 with a volume=12.80 with x=26.74, y=-47.08, z=-27.29. Cavity 1 was chosen for docking studies, taking into account its volume(Figure 2).

Docking results of 27/ 520( see the rest of docking results as supplementary material S₁) structures selected randomly using the Chembl_1 database are shown in Table 1 below:

The protein-protein docking results are shown in Table 2 and Table 3. Also, in Figure 3, a protein-protein complex is displayed as an example of the docking of inhibitory and stimulant proteins.

Protein-protein docking for the inhibitory protein structures against VEGFR2 retrieved the following complexes with the binding energies represented in Table 2:

Protein-protein docking for the inhibitory protein structures against VEGFR2 retrieved the following complexes with the binding energies represented in Table 3:

Furthermore the chimeric homology model computed for the inhibitory protein structures using their Aa sequences has a sequence similarity of 99.99%, a molecular probability score of 2.03, and a general clash score of 3.99, Ramachandran plot favored of 93.76%, rotamer outlier of 3.75%, aC beta deviation of 2, a ratio of bad bounds versus favorable bounds of 2/3449 and ratio of bad angles versus favorable angles of 20/4662 ( results obtained using mol Probability version 4.1 as stated in the Methods section). As stated in the methods section, the chimeric model was further optimized using the Swiss online preparation server. The structure is represented in Figure 4(a.).

The chimeric homology model design for the stimulatory protein structure using their Aa sequences has a sequence similarity of 99.99%, a mol probability score of 1.84, and a General clash score of 1.77, Ramachandran favored of 94.42%, rotamer outlier of 1.40%, aC beta deviation of 5, the ratio of bad bounds versus favorable bounds of 0/1810 And ratio of bad angles versus favorable angles of 11/2411 ( results obtained using mol Probability version 4.1). The chimeric model was further optimized using the online preparation server Swiss Model. The structure is represented in Figure 4(b.).

Furthermore, Table 4 shows the Aa sequences to the inhibitory and stimulate chimeric models. Figure 5 and Table 5 show the Aa composition and properties of the chimeric models.

Docking results of the inhibitory and stimulatory model show the following: the complex between the inhibitory model and VEGFr2 has a total energy of -71.62 kcal/mol, and the complex of the stimulatory chimeric model and VEGFR2 has a total energy of -58.81 kcal/mol.

Furthermore, the chemical space characterized by molecular descriptors for angiogenesis inhibitor molecules and angiogenesis stimulator molecules, respectively, is represented in Figure 6.

Figure 7 represents the chemical space of the chimeric inhibitory and stimulant models.

Furthermore, The C -alpha-based distance plot computed for the chimeric inhibitory and stimulant models and plots for PEDF and Angiopoietin 1 are represented in Figure 8.

In Figure 9, the chimeric models multidimensional data are represented.

The polynomial equations resulting from the inhibitory, stimulant, and combined multidimensional spaces are shown below:

Inhibitory space y = -23.758x6 - 3.4701x5 + 12.001x4 - 0.9262x3 - 2.8557x2 + 0.4032x + 0.1676

(1)

Stimulant space y = -1.1017x6 - 3.6244x5 + 2.7119x4 + 0.7384x3 - 1.2141x2 - 0.269x + 0.0601

(2)

Combine space y = -7.9346x6 - 9.1068x5 + 6.8296x4 + 1.3786x3 - 2.1349x2 + 0.0735x + 0.1191

(3)

Also, a map of the 2D complex space is shown in the Figure 10 below:

3. Discussion

In structural biology, homology modeling, sometimes called comparative modeling, is a computational technique that predicts a protein's three-dimensional structure using its amino acid sequence and the structure of a comparable protein known to exist (template)[20]. The fundamental premise is that proteins with similar sequences frequently exhibit structural and functional similarities. The following steps are usually involved in the homology modeling process. Finding a template entails finding an appropriate homologous template—comparable in sequence and structure—to the target protein and has a known three-dimensional structure. Numerous databases and sequence alignment techniques, such as BLAST (Basic Local Alignment Search Tool) and HHpred (Homology Detection and Structure Prediction by HMM-HMM Comparison), can be used. When the target protein's amino acid sequence matches the template protein, this is known as sequence alignment. This alignment is essential to map the template structure onto the target protein. Model building based on the sequence alignment, a three-dimensional model of the target protein is constructed using computational techniques such as comparative modeling algorithms. These algorithms use the known structure of the template protein to generate a model of the target protein by aligning corresponding residues and building missing regions. Model refinement is where the initial model may undergo refinement to improve its quality and accuracy. This can involve energy minimization, molecular dynamics simulations, and other optimization techniques to optimize the geometry and remove steric clashes. Lastly, The quality of the homology model is assessed using various validation criteria such as Ramachandran plot analysis, MolProbity scores, and QMEAN scores. These measures help evaluate the stereochemical quality and overall reliability of the model. The validation of homology modeling involves assessing the quality and reliability of the predicted protein structure. Several techniques and criteria can be used: (a) Ramachandran plot analysis evaluates the amino acid residues' backbone dihedral angles (φ and ψ) in the modeled structure. The Ramachandran plot shows allowed and disallowed regions based on stereochemical constraints. A high percentage of residues in the favored areas indicates a good-quality model; (b) MolProbity assesses the overall quality of protein structures, including homology models, by evaluating steric clashes, bond lengths, bond angles, and other geometric parameters.

Lower MolProbity scores indicate better model quality; (c) QMEAN (Qualitative Model Energy ANalysis) is a composite scoring function that evaluates the overall model quality based on various structural features, including energy terms, solvation, and torsion angles. Higher QMEAN scores correspond to better-quality models; (d) ProSA-web: ProSA-web calculates the Z-score of the modeled structure, which measures its overall energy deviation from experimental structures of similar size. Lower Z-scores indicate better agreement with experimental structures. This study used Ramachandran plots to validate the homology models[21]. The Ramachandran Plots are represented in Figure 10.

As observed in Figure 11, a high percentage of residues in the favored regions indicates a good-quality model. Also, the homology models obtained are stable and have an energetically favorable profile.

Binding cavities often have unique structural features, allowing them to interact with specific molecules. These features include pockets, grooves, and specific amino acid residues that form hydrogen bonds, hydrophobic interactions, or electrostatic interactions with the ligand. Binding cavities exhibit specificity towards particular ligands. This specificity arises from complementary shapes and chemical properties between the cavity and the ligand. The binding of ligands to these cavities often triggers conformational changes in the protein, leading to its activation or inhibition. This functional modulation is crucial for various biological processes, including enzymatic reactions, signal transduction, and molecular transport. Binding cavities are frequently targeted by drugs and therapeutics to modulate protein function. Small molecules or medicines can be designed to bind to these cavities, either activating or inhibiting the protein's activity. Binding cavities may exhibit flexibility or adaptability to accommodate different ligands or undergo conformational changes upon ligand binding. This flexibility is essential for the protein to perform its biological functions effectively. In addition to the primary binding site, proteins may possess allosteric sites distinct from the active site. However, they can regulate the protein's activity through conformational changes induced by ligand binding at these sites[22].

Furthermore, protein-protein interactions (PPIs) are fundamental in virtually all biological processes, including cell signaling, gene regulation, enzymatic activity, and structural support. These interactions occur when two or more proteins bind together transiently or stably to form complexes, enabling them to carry out specific functions within the cell. Understanding protein-protein interactions is crucial for elucidating cellular processes and designing therapeutics to modulate these interactions for various purposes. PPIs can be classified into several types based on duration, strength, and functional consequences. These include transient interactions, such as signaling interactions, and stable interactions, such as those involved in forming structural complexes. Protein-protein interactions typically occur through specific binding interfaces, where complementary surfaces of the interacting proteins come into contact. These interfaces often involve amino acid residues that form hydrogen bonds, hydrophobic interactions, electrostatic interactions, or van der Waals forces. PPIs exhibit specificity, meaning that proteins selectively interact with their binding partners. This specificity arises from complementary shapes, charges, and chemical properties between the interacting proteins. The interactions between proteins can be regulated dynamically in response to various cellular signals, environmental cues, or post-translational modifications. This regulation allows cells to fine-tune their signaling pathways and responses to internal and external stimuli. Protein-protein interactions mediate various biological processes, including enzyme activation/inhibition, signal transduction, protein trafficking, DNA replication and repair, and cytoskeletal organization. Disruption or dysregulation of these interactions can lead to diseases such as cancer, neurodegenerative disorders, and autoimmune diseases[23].

Several residues on VEGFR2 have been identified as involved in protein-protein interactions (PPIs), particularly with its ligands (vascular endothelial growth factors, VEGFs) and other signaling molecules. While the specific residues involved may vary depending on the interaction partner and context, here are some general insights into the regions and residues of VEGFR2 involved in PPIs. The extracellular domain of VEGFR2 interacts with VEGF ligands, typically homodimers or heterodimers. Specific residues within the extracellular domain of VEGFR2 bind to VEGF. For example, residues in the ligand-binding domain (LBD), including those in Ig-like domains, have been implicated in VEGF binding. The intracellular tyrosine kinase domain of VEGFR2 is involved in downstream signaling cascades following ligand binding. This domain can interact with various signaling proteins through phosphorylation-dependent or independent interactions, including adaptor molecules and other kinases. Specific residues within the TKD may participate in these interactions, particularly those in substrate recognition and catalysis. VEGFR2 undergoes autophosphorylation on specific tyrosine residues within its intracellular domain upon ligand binding. These phosphorylated tyrosine residues serve as docking sites for downstream signaling proteins containing SH2 (Src homology 2) or PTB (phosphotyrosine binding) domains, mediating protein-protein interactions critical for signal transduction. [Through direct or indirect interactions, VEGFR2 can form complexes with other receptors or co-receptors, such as neuropilins, integrins, and other RTKs. Adaptor proteins or scaffolding molecules often mediate these interactions, and specific residues within VEGFR2 may contribute to the stability or specificity of these complexes. VEGFR2 contains regulatory domains, such as the juxtamembrane and kinase insert domains, which may participate in protein-protein interactions that modulate the receptor's activity, localization, or stability. In his study, 12 protein-protein docking studies were performed on the inhibitory protein complexes and 14 on the stimulant protein complexes. All the protein docking studies retrieve stable VEFFR2 -protein complexes.

In the Table 2 and Table 3, the protein-protein docking results are displayed. In Table 2, VEGFR2 docked with the inhibitory proteins are shown. The best docking energies (kcal/mol) are observed when VEGFR2 is docked with 1AU1, and the most considerable complex energy is observed at the VEGFR2 -1BBN complex. However, all complexes display favorable energies with presumably notable biological activity. In Table 3, VEGFR2 is docked with the stimulant proteins. 2X1W forms the most favorable complex with VEGFR2. In this case. 2TGP forms the lowes in the energy complex(kcal/mol). Like in the case of the inhibitory proteins, all complexes are energetically favorable. If a complex has a negative total energy, it generally indicates that the interactions within the complex are favorable and that the complex is stable. Negative total energy suggests that the attractive forces (such as electrostatic interactions, hydrogen bonding, and van der Waals interactions) between the molecules in the complex outweigh the repulsive forces (such as steric hindrance or electrostatic repulsion). These favorable interactions contribute to the stability of the complex. A negative total energy often correlates with a strong binding affinity between the molecules in the complex. The stronger the binding affinity, the more negative the total energy tends to be. This indicates that the complex will likely form and persist under given conditions. In thermodynamic terms, a negative total energy corresponds to a decrease in the overall free energy of the system upon complex formation. This suggests that the complex is stable under the prevailing conditions and that the formation of the complex is thermodynamically favorable. It's important to note that the accuracy of energy calculations depends on the methods used for computation (e.g., quantum mechanical calculations, molecular mechanics simulations). Different computational methods may yield different absolute energy values, but the relative energy values (such as the change in energy upon complex formation) are generally more meaningful. While a negative total energy indicates stability, it does not necessarily guarantee biological activity or function. Experimental validation is often required to confirm the biological relevance of a predicted complex. Additionally, factors such as entropy and solvent effects, which are not always fully accounted for in energy calculations, can influence the stability of complexes in biological systems. Solvation energy refers to the energy change associated with the process of solvation, where solvent molecules surround and interact with solute molecules to form a solution. It plays a crucial role in various chemical and biochemical processes, influencing the stability, solubility, and reactivity of solutes in solution. Solvation energy can be either favorable (exothermic) or unfavorable (endothermic) depending on the nature of the solute-solvent interactions. Solvation energy is the difference in energy between the solvated and separated states of solute and solvent molecules. It represents the overall effect of solvent molecules stabilizing or destabilizing the solute. When solvent molecules interact favorably with the solute, solvation energy is negative (exothermic), indicating that the solvated state is more stable than the separated state. This typically occurs when solute-solvent interactions are strong, such as in the case of polar solutes dissolving in polar solvents or nonpolar solutes dissolving in nonpolar solvents. Conversely, when solvent-solute interactions are weak or repulsive, solvation energy is positive (endothermic), indicating that the solvated state is less stable than the separated state. This may occur when dissolving nonpolar solutes in polar solvents or polar solutes in nonpolar solvents, where the interactions between unlike molecules are less favorable. The magnitude of solvation energy depends on various factors, including the nature of solute and solvent molecules, their polarity, size, shape, and temperature and pressure conditions. Solvation energy influences the rates and equilibrium of chemical reactions occurring in solution. Solvation of reactant molecules can either enhance or hinder their reactivity by stabilizing or destabilizing their transition states and intermediate species. In summary, angle energy is the potential energy associated with deviations of bond angles from their equilibrium values within a molecule. It is an important component of the total potential energy in molecular mechanics simulations and is crucial in determining molecules' conformational stability and behavior. The specific form of the angle energy term varies depending on the force field being used. However, in general, it represents the energy associated with the bending or stretching of bonds and contributes to the overall potential energy of the molecular system. In a molecular system, chemical bonds connect atoms, and these bonds have characteristic bond angles. The angle energy arises from the deviation of these bond angles from their preferred or equilibrium values. When the bond angles deviate, the system's potential energy increases, contributing to the overall energy of the molecule. Here, both angular and solubility energies show favorable values that correlate with the total complex energies. Overall, docking results show that the docking procedure was performed properly. Finally, VEGFR2 forms stable active complexes with the inhibitory and stimulant peptides retrieved from the literature[24,25,26]. However, all complexes of the of the inhibitory and stimulatory proteins display favorable energies with presumably notable biological activity.Regarding inhibitory molecules docking energies, the most favorable energy is observed at 4EB1 with a total complex energy of -92.87 kcal/mol. The highest docking energy observed at stimulants molecule is observed at 2X1W with a docking energy of -99.99 kcal/mol. Also, in the case of inhibitors, the most favorable solvation energy is observed at 4EB1 with 15734.68 kcal/mol. The same is true in the case of the stimulants; the most favorable docking energy is observed at 2XIW with -14554.78 kcal/mol.

In a chimeric model, structural elements from different molecules are combined to create a new molecule with desired characteristics. This could involve combining functional groups, binding pockets, or other molecular features from existing molecules to generate a hybrid structure. Chimeric models are often designed based on a rational understanding of molecular interactions and structure-activity relationships. Researchers may select specific elements from different molecules known to interact with a target protein or exhibit certain biological activities. Chimeric models can be subjected to virtual screening techniques to assess their potential for binding to a target protein or modulating a biological pathway. Computational methods such as molecular docking or molecular dynamics simulations can be employed to predict the binding affinity and mode of interaction of the chimeric molecule with its target. Chimeric models are valuable tools in drug design and discovery. By combining elements from different molecules, researchers can create novel compounds with improved potency, selectivity, or pharmacokinetic properties compared to existing drugs. Chimeric models can be used in lead optimization, where initial hits identified through high-throughput screening are modified to enhance their drug-like properties. Chimeric molecules may undergo iterative rounds of computational design, synthesis, and biological testing to optimize their activity and pharmacological profile[27,28,29,30].

Comparing two amino acid (Aa) sequences is fundamental in bioinformatics and molecular biology. Sequence comparison allows researchers to identify similarities, differences, and patterns between proteins, which can provide insights into their structure, function, and evolutionary relationships. Here's how you can compare two A.A. sequences. Perform a pairwise alignment of the two Aa sequences using algorithms such as Needleman-Wunsch, Smith-Waterman, or FASTA. These algorithms identify the optimal alignment between the sequences by maximizing the number of matched residues and minimizing gaps and mismatches.

Use scoring matrices such as BLOSUM or PAM to assign scores to matches, mismatches, and gap penalties during sequence alignment. These matrices are based on empirical observations of amino acid substitutions in related proteins and help quantify the similarity between sequences. Calculate sequence similarity and identity scores based on the alignment results. Sequence similarity is the percentage of identical residues and conservative substitutions between the sequences, while sequence identity is the percentage of identical residues only. Similarity and identity scores provide quantitative measures of the degree of similarity between sequences and can help compare proteins with different evolutionary distances. Identify functional domains, motifs, and conserved regions within the aligned sequences. Conserved areas often correspond to functional domains or motifs essential for protein structure and function. Use tools like InterPro, Pfam, or SMART to annotate domains and motifs based on the alignment results. Perform phylogenetic analysis using the aligned sequences to infer evolutionary relationships between proteins. Phylogenetic trees can help elucidate protein sequences' evolutionary history and divergence. Phylogenetic analysis can be conducted using software packages such as MEGA, PHYLIP, or RaxML[31,32].

The domain analysis of the Aa inhibitory chimeric model reveals that the representative domain is the serpin Ci1 domain. The Serpin (serine protease inhibitor) family is a protein group that plays a crucial role in regulating proteolytic processes in various biological systems. Serpins are characterized by their ability to inhibit serine proteases, a class of enzymes involved in a wide range of physiological processes, including blood coagulation, immune response, inflammation, and tissue remodeling. Here's an overview of the Serpin family. Serpins typically share a conserved structure of around 350-400 amino acids. They fold into a compact, globular conformation with three β-sheets (A, B, C) and nine α-helices (A-I). The serpin fold contains a reactive center loop (RCL), which acts as a bait for serine proteases. The RCL undergoes a conformational change upon protease binding, forming a covalent complex between the serpin and protease. Serpins inhibit serine proteases by a suicide substrate-like mechanism. Upon binding to the protease, the RCL of the serpin is cleaved by the protease, leading to the formation of an acyl-enzyme intermediate. This intermediate is then inserted into the central β-sheet of the serpin, irreversibly trapping and inactivating the protease. The Serpin family is highly diverse and includes members with many functions beyond protease inhibition. Some serpins act as inhibitors of blood coagulation factors (e.g., antithrombin), while others regulate immune responses (e.g., α1-antitrypsin), inflammation, and tissue remodeling. Additionally, certain serpins have non-inhibitory functions, such as hormone transport (e.g., thyroxine-binding globulin) and chaperone-like activity. Mutations in serpin genes can lead to various diseases and disorders. For example, mutations in SERPINA1, encoding α1-antitrypsin, are associated with liver and lung diseases, including alpha-1 antitrypsin deficiency. Similarly, hereditary angioedema, a rare illness characterized by recurrent episodes of swelling in diverse body areas, can be brought on by mutations in SERPING1, the gene that codes for the C1 inhibitor. The Serpin family has a long evolutionary history, and members can be found in various animals, including humans and microbes. Throughout their evolutionary history, serpins have undergone significant gene duplication, diversification, and specialization, giving rise to functionally unique subfamilies[33,34]. The antithrombin three domain is the domain of the serine protease inhibitor family. Thrombin, a crucial protease in the coagulation cascade, is inhibited by antithrombin III. Thrombin possesses non-hemostatic properties, such as regulating the behavior of endothelial cells, and is involved in the creation of blood clots. ATIII indirectly influences angiogenesis and endothelial cell function by blocking thrombin. It has been demonstrated that antithrombin III interacts with endothelial cells and modifies their activities. It can lessen endothelial cell proliferation, prevent leukocyte adherence to endothelial cells, and lessen endothelial cell production of growth factors and pro-inflammatory cytokines. These factors may impact vascular remodeling and angiogenesis. Because of its anti-inflammatory qualities, antithrombin III may indirectly affect angiogenesis. Angiogenesis and inflammation are intimately related, and vascular morphogenesis may be influenced by substances that reduce inflammation. The regulating function of ATIII in angiogenesis may be facilitated by its capacity to suppress inflammation. The significance of antithrombin III in preserving vascular homeostasis is underscored by the fact that dysregulation of its levels or function can result in thrombotic diseases or excessive bleeding. A higher risk of venous thromboembolism and other thrombotic problems is linked to antithrombin III deficiency. While antithrombin III's role in vasculogenesis and angiogenesis is not as well-studied compared to other angiogenic factors, emerging evidence suggests its involvement in modulating endothelial cell function and vascular remodeling processes. Further research is needed to elucidate the precise mechanisms ATIII influences vascular morphogenesis and its potential therapeutic implications for angiogenesis-related disorders. The domain analysis of the stimulant chimeric model suggests that the representative domain is Fibrionogen C2, the domain is fibrinogen c, and the conserved sites are Fibrinogen. Fibrinogen, a glycoprotein found in blood plasma, plays a pivotal role in blood clotting (coagulation) by converting into fibrin during coagulation. Fibrinogen's involvement in vascular morphogenesis, specifically in angiogenesis (forming new blood vessels from pre-existing ones), is less direct than its role in coagulation. However, emerging research suggests that Fibrinogen and its degradation products can influence angiogenesis through various mechanisms: Fibrinogen has been shown to exhibit pro-angiogenic properties. Studies have demonstrated that fibrinogen-derived peptides can promote endothelial cell proliferation, migration, and tube formation, which are essential steps in angiogenesis. These peptides may act through specific receptors or signaling pathways on endothelial cells to stimulate angiogenesis[35,36,37]. During coagulation, Fibrinogen is converted into fibrin by the action of thrombin. The resulting fibrin forms a matrix, providing a scaffold for platelets and other blood components to adhere to and form a stable blood clot. This fibrin matrix provides a provisional matrix for endothelial cells to migrate and proliferate during angiogenesis. Fibrin degradation products, generated by the action of fibrinolytic enzymes such as plasmin, can modulate angiogenesis. These degradation products, including fibrin degradation products (FDPs) and fibrin-derived peptides, possess bioactive properties and can influence endothelial cell behavior, vascular permeability, and angiogenic signaling pathways. Fibrinogen and fibrin can interact with various growth factors, cytokines, and extracellular matrix components that regulate angiogenesis. Fibrinogen, for instance, can bind and alter the bioavailability of angiogenic molecules, including fibroblast growth factor (FGF) and vascular endothelial growth factor (VEGF), which in turn affects angiogenic processes. Angiogenesis is necessary to provide oxygen and nutrients to the healing tissues, while fibrin and Fibrinogen play important roles in wound healing and tissue repair. To aid in tissue regeneration, the fibrin matrix that forms at the site of damage serves as a temporary scaffold for angiogenesis and encourages endothelial cell migration and proliferation. While Fibrinogen's primary role is in blood clotting, its involvement in angiogenesis and vascular morphogenesis is increasingly recognized. Further research is needed to elucidate the precise mechanisms by which Fibrinogen and its degradation products influence angiogenesis and their potential therapeutic implications for angiogenesis-related disorders such as wound healing, cancer, and cardiovascular diseases.The resulting inhibitory chimeric model is larger than the stimulant chimeric model.

In Figure 5, the Aa composition of both chimeric models is represented, and observed that the inhibitory ceramic model has more Ala, Arg, Gly, Leu, Tyr, and Val than the stimulant chimeric model. For example, arginine and tyrosine residues are often involved in protein-protein interactions and molecular recognition processes, so a protein with more of these residues may have altered binding capabilities compared to a protein with fewer of these—amino acids such as glycine, alanine, and leucine influence protein structure. Glycine is highly flexible due to its small size, alanine is commonly found in protein helices, and leucine is frequently found in protein hydrophobic cores. Therefore, differences in the abundance of these amino acids could affect the structural characteristics of the proteins.

The stimulant chimeric model has more Cys, Glu, Lys, Pro, Serr, Thr, and Trp. Cysteine residues are crucial for forming disulfide bonds in proteins, contributing to their structural stability and function. Proteins containing disulfide bonds play roles in angiogenesis by modulating growth factor signaling, extracellular matrix (ECM) assembly, and cell-matrix interactions[38]. Glutamate participates in various signaling pathways involved in cell proliferation, migration, and survival. Glutamate receptors and transporters expressed in endothelial cells regulate angiogenic responses by modulating intracellular calcium levels, nitric oxide (NO) production, and vascular permeability.[39].Lysine residues are abundant in extracellular matrix (ECM) proteins such as collagens, Fibrinogen, and fibronectin, which provide structural support for blood vessels. During angiogenesis, ECM proteins containing lysine residues regulate endothelial cell adhesion, migration, and tube formation.[40]. Proline-rich motifs are found in angiogenic factors, cytokines, and extracellular matrix (ECM) proteins involved in vascular remodeling. Proline-rich proteins contribute to proteins' structural stability and flexibility, including those involved in angiogenesis[41]. Serine and threonine residues are protein phosphorylation sites regulating angiogenic signaling pathways. Protein kinases and phosphatases that target serine/threonine residues modulate endothelial cell behavior, proliferation, and migration during angiogenesis[42].Tryptophan metabolism and signaling pathways have been implicated in angiogenesis, inflammation, and immune responses. Tryptophan metabolites such as kynurenine and serotonin can regulate endothelial cell function, vascular permeability, and angiogenic responses[43].

Protein isoelectric point (pI) is crucial in drug design and formulation. For instance, in a study by Böttcher et al. (2010), the authors designed peptides targeting the cell-penetrating peptide transporter, PepT1, by considering the pI of both the peptide and the transporter. By ensuring that the peptide had a different charge from PepT1 at physiological pH, they aimed to enhance peptide transport across cell membranes. This demonstrates how knowledge of pI can guide the design of molecules for improved drug delivery and efficacy[44]. So, proteins' isoelectric point (pI) is critical in various biological processes, including protein-protein interactions, enzyme-substrate interactions, and protein localization within cells.

For example, in a study by Kyte and Doolittle (1982), the authors investigated the role of pI in predicting transmembrane segments in proteins. They found that the distribution of charged residues relative to the pI could provide insights into the topology of membrane proteins, aiding in their prediction and understanding of membrane protein function[45]. A protein's isoelectric point (pI) is the pH at which it carries no net electrical charge. Proteins with different pI values have different charge distributions at a given pH. If one protein has a pI of 7.0 and another has a pI of 8.3 – presumably, the inhibitory chimeric model with a pI of 7.0 will have a zero net charge when the surrounding pH is adjusted to 7.0. At pH values below 7.0, the protein will carry a net positive charge due to more positively charged amino acids (e.g., lysine, arginine) than negatively charged ones (e.g., aspartic acid, glutamic acid).

Conversely, at pH values above 7.0, the protein will carry a net negative charge due to the dominance of negatively charged amino acids. Thus, at pH 7.0, the protein will be least soluble in water and may precipitate out of the solution. The stimulant chimeric model with a pI of 8.3 will carry no net charge at pH 8.3. at pH values below 8.3, the protein will take a net positive charge, while at pH values above 8.3, it will carry a net negative charge. Similarly to the protein with a pI of 7.0, at its pI (pH 8.3), the protein will be least soluble in water. In comparing these two proteins, The protein with a pI of 7.0 will have a net positive charge at physiological pH (around 7.4) and tend to interact more strongly with negatively charged molecules or surfaces. The protein with a pI of 8.3 will have a net negative charge at physiological pH and tend to interact more strongly with positively charged molecules or surfaces. Understanding the pI values of proteins is crucial for various applications, including protein purification, characterization, and predicting their behavior in different biological environments. It allows researchers to manipulate pH conditions to control proteins' solubility, stability, and interactions in biochemical experiments and biotechnological applications[46].

The term "Total number of negatively charged residues (Asp + Glu)" refers to the sum of two specific amino acids: aspartic acid (Asp) and glutamic acid (Glu). These amino acids are considered negatively charged because they contain carboxyl groups that can ionize, releasing a hydrogen ion (H+) and resulting in a negatively charged carboxylate group (COO-). In proteins, aspartic acid and glutamic acid contribute to the protein molecule's overall charge depending on the surrounding environment's pH. These residues tend to be deprotonated at a pH above their respective pKa values (at which 50% of the molecules are deprotonated), carrying a negative charge. They tend to be protonated at a pH below their pKa values, carrying no net charge. A protein's total number of negatively charged residues (Asp + Glu) is essential for understanding its overall charge distribution. It can influence various biological functions, interactions with other molecules, and the protein's behavior under different pH conditions. Proteins with many negatively charged residues may interact preferentially with positively charged molecules or surfaces. In contrast, proteins with many positively charged residues may interact preferentially with negatively charged molecules or surfaces. In summary, the total number of negatively charged residues (Asp + Glu) provides valuable information about the charge distribution of a protein and its potential interactions with other molecules or environments[47].

The placement and type of the negatively charged residues throughout the protein sequence determine how two proteins with 55 and 21 negatively charged residues differ from one another. To be more precise, these charged residues can be negatively charged (like glutamic acid, aspartic acid) or positively charged (like lysine, arginine). It is possible that the protein with 55 charged residues has a greater net charge than the protein with 21 charged residues. Assume that most of these residues have a positive charge. If the protein is primarily negatively charged, the net charge will be negative; otherwise, the protein will have an overall positive net charge. The balance between positively and negatively charged residues affects the net charge of a protein at a specific pH. A higher positive net charge would arise from a greater quantity of positively charged residues. A greater negative net charge would arise from a greater quantity of negatively charged residues.

As discussed, the pI of a protein is the pH at which it carries no net electrical charge. The distribution of charged residues affects the pI value. Proteins with more positively charged residues typically have a higher pI, whereas proteins with more negatively charged residues tend to have a lower pI. Therefore, the protein with 55 charged residues might have a different pI compared to the protein with 21 charged residues, depending on the distribution of these residues and their specific pKa values. Proteins with varying numbers of charged residues may interact differently with other molecules or surfaces. For instance, a protein with many positively charged residues might interact more strongly with negatively charged molecules or surfaces.

In contrast, a protein with many negatively charged residues might interact more strongly with positively charged molecules or surfaces. The distribution and number of charged residues can also influence the protein's biological function. For example, proteins with many positively charged residues might be involved in DNA binding. In contrast, proteins with many negatively charged residues might participate in interactions with RNA or other negatively charged molecules.

The total of two particular amino acids, arginine (Arg) and lysine (Lys), is referred to as the "Total number of positively charged residues (Arg + Lys)." Because these amino acids have amino groups that may take up a proton (H+) in solution and form a positively charged amino group (NH3+), these amino acids are positively charged. Depending on the pH of the surrounding environment, arginine and lysine contribute to the overall positive charge of a protein molecule. These residues typically have a positive charge and are protonated at pH values lower than their corresponding pKa values, which indicate the pH at which 50% of the molecules are protonated. They typically contain no net charge and are deprotonated at pH levels higher than their pKa values. Understanding the overall charge distribution of a protein requires knowledge of its total amount of positively charged residues (Arg + Lys). It can affect the behavior of the protein at different pH levels, as well as a range of biological processes and interactions with other molecules. Proteins having a high concentration of positively charged residues may interact more favorably with surfaces or molecules that are negatively charged. Proteins having a high concentration of negatively charged residues, on the other hand, can interact more favorably with positively charged surfaces or molecules. In conclusion, a protein's charge distribution and possible interactions with other molecules or surroundings can be inferred from the total number of positively charged residues (Arg + Lys).

The main differences between the two proteins with 55 and 23 positive charged residues (Arg + Lys) are the overall positive charge distribution and possible interactions. This is where the difference could show up: Net Positive Charge: Compared to a protein with 23 positively charged residues, the protein with 55 positively charged residues will probably have a higher net positive charge. The behavior and interactions of the protein may be significantly affected by this increased net positive charge, particularly in situations where negatively charged molecules or surfaces are present. A protein's distribution and quantity of positively charged residues impact its isoelectric point or pI. A higher pI is typically found in proteins with a greater number of positively charged residues.

Consequently, compared to a protein with 23 positively charged residues, the protein with 55 positively charged residues may have a larger pI. Positively charged residues in proteins may enhance their interaction with negatively charged molecules or surfaces. These contacts might involve attaching to negatively charged membranes, interacting with negatively charged areas of other proteins, or binding to nucleic acids (DNA or RNA). Because of its higher net positive charge, the protein with 55 positively charged residues may interact with negatively charged molecules or surfaces more strongly than the protein with 23 positively charged residues. The quantity and distribution of positively charged residues can affect how a protein functions biologically. Proteins with many positively charged residues may be involved in membrane association, enzymatic activity, or DNA or RNA binding. The overall structure, additional amino acid residues, and the cellular environment in which the proteins with 55 and 23 positively charged residues function will determine their particular roles. In conclusion, differences in the positive charge distribution of two proteins can affect their interactions, stability, and biological functions. These variations are indicated by the difference in the total amount of positively charged residues (Arg + Lys) between the two proteins.

Furthermore, a protein's total number of negative charge residues plays a crucial role in its behavior and function. Negatively charged residues, such as aspartic acid (Asp) and glutamic acid (Glu), contribute to the overall net charge of a protein. These charges help prevent protein aggregation by maintaining solubility. Protein aggregation can lead to dysfunction or disease, while solubility is essential for proper protein folding, interactions, and cellular processes. Charged residues form ion pairs, hydrogen bonds, and other electrostatic interactions. These interactions influence protein structure, folding, binding, and condensation. Long-range electrostatic effects impact protein behavior, including ligand binding and enzymatic reactions. As proteins are synthesized, the nascent polypeptide passes through the negatively charged exit tunnel of the ribosome; positively charged stretches within the nascent peptide can interact with ribosome walls and slow down translation. Thus, charged polypeptides affect protein expression and translation efficiency. Charge ladders involve chemical modification of charged residues to generate derivatives with varying charges[48].

The estimated half-life of a protein refers to the time it takes for half of the protein molecules in a cell or biological system to be degraded or otherwise become inactive. Protein half-life can vary widely depending on several factors, including the specific protein, cell type, organism, and physiological conditions. In general, the half-life of proteins can range from minutes to days or even longer. Some proteins have very short half-lives, meaning they are rapidly turned over within cells, while others are more stable and persist for more extended periods. For example, (a)short-lived proteins: proteins involved in cellular signaling, regulation, or response to environmental changes often have short half-lives. These proteins are rapidly synthesized and degraded as part of the cell's dynamic response to stimuli. Examples include transcription factors, cell cycle regulators, and specific signaling molecules. (b)long-lived proteins: structural proteins, enzymes, and proteins that maintain cellular structure and function tend to have longer half-lives. These proteins are essential for the cell's structure and function and are typically turned over more slowly. Examples include structural components of the cytoskeleton, enzymes involved in primary metabolic processes, and histones[49,50].

The half-life of a protein is influenced by various factors :(a) protein structure- proteins with specific structural features, such as disordered regions or post-translational modifications, may be more susceptible to degradation. (b) -cellular environment: cellular conditions such as nutrient availability, stress, and signaling pathways can affect protein stability and turnover rates; (c) protein interactions: protein-protein interactions and association with other cellular components can influence protein stability and degradation; (d): post-translational modifications - modifications such as ubiquitination or phosphorylation can target proteins for degradation by the proteasome or lysosomes, affecting their half-life. Estimating the half-life of a specific protein often involves experimental approaches such as pulse-chase assays, metabolic labeling, or computational modeling. These techniques help researchers understand protein turnover dynamics and their roles in cellular processes. Additionally, databases and computational tools provide estimates or predictions of protein half-lives based on experimental data and computational algorithms, aiding researchers in studying protein dynamics and cellular regulation. Overall, the inhibitory protein has a half-time of five times greater than the stimulant one. Her biological effect lasts longer and is less susceptible to degradation than the stimulant protein.

The instability index of a protein is a numerical value that predicts the stability of a protein based on its amino acid sequence. It was introduced by Guruprasad et al. in 1990 as a method to estimate the stability of proteins from their primary sequence. The instability index is calculated using a formula that considers various physicochemical properties of amino acids in the protein sequence, including the relative volume of each amino acid, the hydropathy index, and the presence of dipeptides that tend to occur in unstable regions. The instability index can be helpful for researchers in various areas, including protein engineering, protein expression, and structural biology. It provides a quick and rough estimate of a protein's stability based solely on its amino acid sequence, which can help researchers prioritize proteins for further study or experimental manipulation. However, it's important to note that the instability index is just one of many factors that contribute to protein stability, and experimental validation is often necessary to confirm the predicted stability of a protein. The instability index is computed after the following formula:

Instability index=10×(Ntotallarge+ncharged−length total) where: n large is the number of amino acids with high relative volume (Val, Ile, Leu, Phe, Tyr, and Trp), n charged is the number of charged amino acids (Arg, Lys, Asp, and Glu), N total is the total number of amino acids in the sequence., length is the length of the protein sequence. Results show that both proteins are stable[51,52,53].

The aliphatic index of a protein is a measure of its thermostability, specifically related to the aliphatic amino acids present in its sequence. Aliphatic amino acids are those with non-aromatic side chains, which typically include alanine (Ala), valine (Val), isoleucine (Ile), and leucine (Leu). The aliphatic index is calculated based on the relative volume occupied by aliphatic side chains in the protein, contributing to its stability at high temperatures. A higher aliphatic index suggests a more significant proportion of aliphatic amino acids in the protein sequence, which is associated with increased thermostability. The difference in aliphatic index between the two proteins is the following: the inhibitory chimeric model has an index of 86.32. This protein has a high aliphatic index, indicating a significant proportion of aliphatic amino acids in its sequence. Such proteins are typically more stable at high temperatures and may be better adapted to environments with extreme conditions, such as heat or pH extremes. The stimulant chimeric model has an aliphatic index of 54.88 - it suggests a lesser proportion of aliphatic amino acids in its sequence, which may result in lower thermostability than the protein with the higher aliphatic index.

In summary, the difference in aliphatic index between these two proteins suggests differences in their potential thermostability. The protein with the higher aliphatic index (86.32) is likely more thermostable than the protein with the lower aliphatic index (54.88). However, other factors beyond aliphatic amino acids, such as overall protein structure and composition, can also influence a protein's stability[54,55].

The Grand average of hydropathicity (GRAVY) is a measure that quantifies the overall hydrophobicity or hydrophilicity of a protein sequence. It is calculated by averaging the hydropathy values of all amino acids in the sequence. Hydropathy values represent the relative hydrophobicity or hydrophilicity of amino acids. Positive hydropathy values indicate hydrophobic amino acids (which tend to be buried inside the protein structure away from water). In contrast, negative values indicate hydrophilic amino acids (those that tend to be exposed to the aqueous environment). The GRAVY score is calculated by summing the hydropathy values of all amino acids in the sequence and dividing by the number of residues. A negative GRAVY score indicates a predominance of hydrophilic residues in the protein sequence, while a positive GRAVY score indicates a predominance of hydrophobic residues. The inhibitory chimeric model has a GRAVY of -0.258: this protein has a negative GRAVY score, suggesting that, on average, its amino acid sequence is hydrophilic. Such proteins will likely have more polar or charged residues on their surface, making them more soluble and potentially interacting favorably with water molecules.

With a GRAVY of -0.594, the stimulant chimeric model's protein is even more hydrophilic than the first protein, indicating a lower GRAVY score. Its sequence probably has more hydrophilic residues than the protein, with the GRAVY value of -0.258. In conclusion, variations in the GRAVY scores of these two proteins point to variations in their general hydrophilicity. Compared to the protein with the higher GRAVY score (-0.258), the one with the lower value (-0.594) is probably even more hydrophilic[56].

Comprehending the molecular architecture of a protein is crucial for deciphering the correlations between its structure and function, forecasting its biological functions, and developing ligands or modulators that engage with particular protein sections or characteristics. Computational approaches, structural biology methods (such as X-ray crystallography and nuclear magnetic resonance spectroscopy), and bioinformatics tools for sequence and structural analysis can all be used to analyze the chemical space of proteins. Both spaces have the same geometry by comparing the inhibitory and stimulant proteins and chemical space. The inhibitory space is narrower than the stimulant one. Also, the stimulant space is more represented in the negative domain, whereas the inhibitory space occupies both negative and positive domains. These results are based on the chemical space representation by chemical descriptors, which follows the chemical space represented by polynomial equations.

The chimeric models' chemical spaces show both as aspected a dimensional reduction. Both spaces have the same geometry. In opposition to the protein chemical spaces, the chimeric model space is more expansive than the stimulant chimeric model.

The "C-alpha distance map" shows explicitly the distances between C-alpha atoms and often depicts the spatial arrangement of atoms in a protein structure. The C-alpha atom, a component of the protein's backbone, is utilized in protein structure as a point of reference to characterize the general folding pattern. The distances between each pair of C-alpha atoms in a protein structure are shown graphically in the C-alpha distance map. This map can be used to comprehend the spatial interactions between various protein components, detect structural motifs, and examine the overall folding pattern[57].

All three polynomials are of degree 6. The leading coefficients are inhibitory space: -23.758, stimulant space: -1.1017, and combine space: -7.9346. The behavior is determined by the leading term of the polynomial: inhibitory space: As x→±∞x→±∞, y→−∞y→−∞, stimulant space: As x→±∞x→±∞, y→−∞y→−∞., combine space: As x→±∞x→±∞, y→−∞y→−∞.while all three polynomials have the same degree, their leading coefficients and coefficients of the other terms differ, leading to distinct behaviors and shapes.

The leading coefficient in the equation generated from inhibitory space is negative (-23.758), meaning that the polynomial function both increases and reduces quickly as x increases and lowers. The function's general shape is likewise influenced by the other coefficients. For example, the positive coefficient of x4x4 implies that there can be local maxima and minima for the function. Because the coefficients' signs alternate, the function may behave oscillatorily or have several turning points. The function approaches negative infinity as x approaches either positive or negative infinity, showing a decreasing tendency at both extremes. Compared to the other two functions, the leading coefficient (-23.758) indicates a stronger decreasing trend.

In the stimulant space, similar to the inhibitory space, the leading coefficient is negative (-1.1017), indicating a downward trend at both extremes. The coefficients contribute to the shape of the function. For example, the positive coefficient of x4 suggests the presence of local maxima and minima. The function may also exhibit oscillatory behavior or have multiple turning points. As x approaches positive or negative infinity, the function approaches negative infinity. The leading coefficient is less negative (-1.1017), indicating a relatively less steep downward trend than the inhibitory space.

Finally, a downward trend is indicated at both extremities by the negative (-7.9346) leading coefficient in the combined space function. Local maxima and minima may result from the coefficients' influence on the function's form. Similar to other functions, there could be several turning points or oscillatory behavior. The function becomes closer to negative infinity as x gets closer to positive or negative infinity. Although it is likewise negative (-7.9346), the leading coefficient's size places it in between the other two, indicating an intermediate rate of decline.

All three polynomial functions exhibit a downward trend at both extremes, with potential oscillatory behavior and multiple turning points. The specific values of the coefficients will determine each function's exact shape and behavior. Graphing these functions would provide a more precise visualization of their behavior and any distinctive features they may have.

Each polynomial has different coefficients for terms of higher orders (i.e., x4,x5,x6). These coefficients contribute to the shape of the polynomial curve and influence the presence of local extrema (maxima and minima). The Inhibitory Space has more significant magnitude coefficients for most higher-order terms than the other two, potentially leading to more pronounced oscillations or sharper turns in the curve. Stimulant Space and combined space have more moderate coefficients for higher-order terms, suggesting smoother curves than inhibitory space.

Critical points, where the function's derivative is zero, correspond to potential local extrema or inflection points. The locations and nature of these vital points would depend on the specific values of the coefficients in each polynomial. Due to its unique coefficient values, inhibitory space might have critical points at different locations than stimulant and combined space. Inhibitory space may exhibit more erratic behavior than smoother curves of stimulant space and combined space, given its more significant and steeper leading coefficients ( as seen in the figure above).

Overall, regarding the inhibitory space, This polynomial function might represent a scenario where the response or activity is inhibited or suppressed. Inhibitory processes are standard in various biological and physical systems where certain factors decrease the activity or effectiveness of other factors. Multiple roots, critical points, and inflection points suggest a complex behavior with potential oscillations or fluctuations in the inhibitory response. The negative leading coefficient indicates a downward trend, suggesting that as the input x increases, the inhibitory effect becomes more robust, decreasing the response or activity.

Stimulant space- This polynomial function may represent a scenario where the response or activity is stimulated or enhanced. Stimulant processes are often observed in biological, chemical, and physical systems where certain factors increase the activity or effectiveness of other factors. Like the inhibitory space, multiple roots, critical points, and inflection points suggest a complex behavior with potential oscillations or fluctuations in the stimulant response. The negative leading coefficient also indicates a downward trend, suggesting that the stimulant effect strengthens as the input x increases, increasing the response or activity.

The combined space polynomial function combines elements of both inhibitory and stimulant effects, perhaps representing a scenario where both factors simultaneously influence the overall response or activity. Multiple roots, critical points, and inflection points suggest a complex interaction between inhibitory and stimulant processes, leading to potentially intricate behavior. The negative leading coefficient indicates an overall downward trend, but the specific behavior depends on the combined effects of the individual terms in the polynomial.

Overall, these polynomial functions provide mathematical representations of complex processes in inhibitory, stimulant, and combined spaces. Their analysis helps understand the behavior and interactions of factors within these spaces. It can be valuable in various fields, such as biology, chemistry, physics, and economics.

In the context of angiogenesis, the inhibitory space polynomial function might represent factors or processes that inhibit or suppress angiogenesis. The polynomial's complex behavior, with multiple roots, critical points, and inflection points, could represent the intricate interplay of various inhibitory factors in regulating angiogenesis. For example, specific molecules like angiostatin or endostatin inhibit angiogenesis by blocking the activity of pro-angiogenic factors. The polynomial could represent the combined effect of these inhibitory factors.

In the context of angiogenesis, the stimulant space polynomial function might represent factors or processes that stimulate or promote angiogenesis. Like the inhibitory space, the polynomial's complex behavior could represent the multifaceted nature of stimulatory factors in regulating angiogenesis. For instance, vascular endothelial growth factor (VEGF) and fibroblast growth factor (FGF) are potent angiogenesis stimulators. The polynomial could represent the combined effect of these stimulatory factors.

The combined space polynomial function combines inhibitory and stimulant effects on angiogenesis. In the context of angiogenesis, this polynomial could represent the balance between inhibitory and stimulatory factors that determine the net impact on angiogenesis. The polynomial's behavior reflects the complex interactions between factors that promote or inhibit angiogenesis, resulting in intricate regulation of blood vessel formation.

4. Materials and Methods

To explore the chemical space of VEGFR2, ligand docking and protein-protein docking methodologies were used. Firstly, VEGFR2 was energetically minimized protonated at physiological pH and temperature. AMBBER 99 force field was used for all protein preparation and docking computations. Regarding the ligand docking, a set of 278 molecules was retrieved randomly from the ChEMBL 01 database using a randomized extraction protocol using the MtiOpneScreen server [58,59]. The molecules were energetically minimized and protonated at physiological pH and temperature. The VEGFR2 binding site was computed using MOE 2009 software and from the literature[60]. AutoDock software was used to dock the ligands[61]. Docking results were selected based on the total energy of the complex (Kcal/mol). The total energy, solvation energy, and angular energy were computed for each protein-ligand complex. The first 27 energetically favorable ligands are represented in the table below ( the rest of the 278 ligands are described in the supplemental material S₃). Table 6:

Regarding protein-protein docking a series of compounds with demonstrated antiangiogenetic or angiogenetic activity were selected from the literature to explore the chemical space. The preferred compounds were studied computationally. The following molecules have been chosen as angiogenesis stimulators: Insulin-Like-Growth-Factor-1 (IGF-1) (PDB ID 1B9G)[62], Basic Fibroblast Growth Factor (bFGF) ( PDB ID 1BFB)[63], Hepatocyte Growth Factor (HGF) (PDB ID 1GP9)[3], Human Epidermal Growth Factor (EGF) (PDB ID 1JL9)[64], Transforming Growth Factor Beta 1 (TGF beta-1) (PDB ID 1KLA)[65], Human Platelet-Derived Growth Factor Bb (PDGF B) (PDB ID 1PDG)[66], Angiopoietin 2(PDB ID 1Z3S)[67], Human Vascular Endothelial Growth Factor-B (VEGFB) (PDB ID 2C7W)[68], Human Transforming Growth Factor Alpha TGF alpha(PDB ID 2TGF)[69], Vascular Endothelial Growth Factor C (VEGFC) (PDB ID 2X1W)[70], Vascular Endothelial Growth Factor D (VEGF D) (PDB ID 2XV7)[71], Interleukin 8 (IL8) ( PDB ID 3IL8)[72], Platelet-Derived Growth Factor A (PDGF A) (PDB ID 3MJK)[73], Angiopoietin 1 ( PDB ID 4JYO)[74], Human Transforming Growth Factor Alpha (TNF alpha) (PDB ID 4TGF)[75], Vascular Endothelial Growth Factor A (VEGFA) (PDB ID 6Z13)[76], Platelet-Derived Growth Factor C (PDGFC) ( homology model 1 UniProt ID Q9NRA1)[77], Phosphatidylinositol-Glycan Biosynthesis Class F Protein (PIGF) ( homology model 2 UniProt ID Q07326)[78] ,Platelet-Derived Growth Factor D (PDGFD) ( homology model 3, UniProt ID Q9GZP0)[79]. Molecules that have been chosen as inhibitors are the following: Human Interleukin-4 (IL4 )(PDB ID 1BBN)[80], Human Interleukin-12 (IL12) ( PDB ID 1F45)[81], Interferon-gamma ( PDB ID 1HIG)[82], Human Pigment Epithelium-Derived Factor ( PEDF) ( PDB ID 1IMV)[83], Human Angiostatin ( PDB ID 1KI0)[84], Endostatin ( PDB ID 1KOE)[85], Thrombostatin 1 ( PDB ID 1LSL)[86], Human Interferon Alpha ( PDB ID 1RH2)[87], Human Skeletal Muscle Troponin (PDB ID 1YTZ)[88], Thrombospondin 2 ( PDB ID 2RHP)[89], Antithrombin II ( PDB ID 4EB1)[90], Vasostatin ( homology model 4 UniProt ID P10645)[91]. Protein-protein docking was performed using HADDOCK 2.0 server[92] ). The total, solvation, and angular energy were computed for each protein-protein complex. To explore the chemical space of inhibitors and stimulants of tyrosin kinase concerning angiogenesis, molecular descriptors were calculated using ChemDes(Web) software packages[93,94,95] for all proteins ( inhibitory as stimulants). Using molecular descriptors ( number of H bond acceptors, number of H bond donors, polar surface area, shape attribute, the sum of degrees, som of valence degrees), the chemical space for VEGFR stimulators and inhibitors was characterized and represented as radial graphs. Furthermore, the chemical space was computed using the same methodology for the chimeric models. Also, docking of the chimeric models with VEGFR 2 was performed, and the total energy, solvation energy, and angular energy were calculated ( kcal/mol). To further explore the molecular systems of angiogenesis stimulants and inhibitors, c-c atom distances were computed. Based on the carbon-carbon distances matrix, a multidimensional space was represented. Based on the multifaceted space representation of considerable data reduction, a six-degree polynomial equation system was computed for the chimeric and inhibitory chimeric models. The six-degree equations calculated 2D and 3D space maps using the online computational server Wolphram Alpha [96]. Ramachandran plots were also used to assess the stability and reliability of the protein models. Finally, Aa sequences and molecular descriptors data have been compared to get insights into angiogenesis's chemical stimulant and inhibitor space.

5. Conclusions

The chemical space of angiogenesis stimulators and inhibitors is slightly similar. However, the chemical space of inhibitors is more expended than stimulators, indicating a most probable interaction. A most probable interaction with the inhibitor space is due to the inhibitors' expenses of the chemical space -by being more conformationally favorable for a diverse set of molecules compared to the stimulants. Also, a broader chemical space is more energetically and conformationally favorable than a less expanded chemical space. These characteristics are also transposed to the chimeric models, where the inhibitor chimeric model is a larger size molecule than those of the stimulant chimeric model. Also, regarding the molecular interactions, the inhibitors have slightly more favorable complex energies than the stimulants. Mathematically, the inhibitory space has a narrower domain than the stimulant space but expands on negative and positive domains. This means the interactions are possible with distinct and variated conformations compared to the stimulant space. Also, interaction with molecules that pose symmetry is favorable. The stimulant space is expended mainly on the negative larger domain. The consequence of this geometry is primarily a selective, wider domain for more specific and less accessible conformations.

The chemical space and domain distribution are critical factors in VEGFR2 behavior as a stimulant or angiogenesis inhibitor. Further experimental and silico studies are needed to characterize and quantify the complex VEGFR system and its role in angiogenesis and vascular morphogenesis.

Supplementary Materials

The following supporting information can be downloaded at the website of this paper posted on Preprints.org.

Author Contributions

C.N.L. conceptualization, writing original draft and methodology; I.M. writing original draft preparation; G.G. validation, review, and editing; M.C.M. conceptualization and review;

Funding

The manuscript received no funding

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Arjan W Griffioen, Andrew C Dudley. Angiogenesis: a year in review. Angiogenesis 2021, 24, 195–196. [Google Scholar] [CrossRef] [PubMed]
Peace Mabeta, Vanessa Steenkamp. The VEGF/VEGFR Axis Revisited: Implications for Cancer Therapy. Int J Mol Sci. 2022, 23, 15585. [Google Scholar] [CrossRef] [PubMed]
Peter Carmeliet VEGF as a key mediator of angiogenesis in cancer. Oncology, 2005; 69, Suppl. 3, 115585.
Napoleone Ferrara, Hans-Peter Gerber, Jennifer LeCouter. The biology of VEGF and its receptors. Nat med 2003, 9, 669–676. [Google Scholar] [CrossRef] [PubMed]
Masabumi Shibuya. Vascular endothelial growth factor and its receptor system: physiological functions in angiogenesis and pathological roles in various diseases. j Biochem 2013, 153, 13–19. [CrossRef]
Alex Kiselyov, Konstantin V Balakin, Sergey E Tkachenko. VEGF/VEGFR signaling as a target for inhibiting angiogenesis. Expert Opin Investing Drugs 2007, 16, 83–107. [Google Scholar] [CrossRef]
Zhibao Wang, Bo Cao, Peng Ji, Fan Yao. Propofol inhibits tumor angiogenesis through targeting VEGF/VEGFR and mTOR/eIF4E signaling. Biochem Biophys Res Commun 2021, 555, 13–18. [Google Scholar] [CrossRef]
Daniel J Hicklin, Lee M Ellis. Role of the vascular endothelial growth factor pathway in tumor growth and angiogenesis. J Clin Oncol 2005, 23, 1011–1027. [Google Scholar] [CrossRef]
Elizabeth Bowler, Sebastian Oltean. Alternative Splicing in Angiogenesis. Int J Mol Sci. 2019, 20, 2067. [Google Scholar] [CrossRef]
Anna-Karin Olsson, Anna Dimberg, Johan Kreuger, Lena Claesson-Welsh. VEGF receptor signalling - in control of vascular function. Nat Rev Mol Cell Biol. 2006, 7, 359–371. [Google Scholar] [CrossRef]
Fatemeh Zare Shahneh, Behzad Baradaran, Fatemeh Zamani, Leili Aghebati-Maleki. Human Antibodies. 2013, 22, 15–19.
Masabumi Shibuya. VEGFR and type-V RTK activation and signaling. Cold Spring Harb Perspect Biol. 2013, 5, a009092. [Google Scholar]
A F Karamysheva. Mechanisms of angiogenesis. Biochemistry. 2008, 73, 751–762. [Google Scholar]
Chloe J Peach, Viviane W Mignone, Maria Augusta Arruda, Diana C Alcobia, Stephen J Hill, Laura E Kilpatrick, Jeanette Woolard. Molecular Pharmacology of VEGF-A Isoforms: Binding and Signalling at VEGFR2. Int J Mol Sci. 2018, 19, 1264.
Masaubmi Shibuya. Vascular endothelial growth factor receptor-1 (VEGFR-1/Flt-1): a dual regulator for angiogenesis. Angiogenesis 2006, 9, 225–230. [Google Scholar] [CrossRef] [PubMed]
Du Yang, Chunna Jin, Hong Ma, Mingyuan Huang, Guo-Ping Shi, Jianan Wang, Meixiang Xiang. The EphrinB2/EphB4 pathway in postnatal angiogenesis is a potential therapeutic target for ischemic cardiovascular disease. Angiogenesis 2016, 19, 297–309. [Google Scholar] [CrossRef] [PubMed]
H H Marti, W Risau. Angiogenesis in ischemic disease. 1999, 82 (Suppl. 1), 44–52. [Google Scholar]
Böttcher S, Zaitseva E, Ewert S, Hunte C, Welte W, Freund C. 2010. The structure of the complex between a peptide transport receptor and the peptidoglycan-associated lipoprotein TsaP suggests a recognition mode similar to that of lipoprotein signal peptides. Journal of Molecular Biology. 2010, 401, 222–234. [Google Scholar]
Okaniwa, M.; Hirose, M.; Imada, T.; Ohashi, T.; Hayashi, Y.; Miyazaki, T.; Arita, T.; Yabuki, M.; Kakoi, K.; Kato, J.; Takagi, T.; Kawamoto, T.; Yao, S.; Sumita, A.; Tsutsumi, S.; Tottori, T.; Oki, H.; Sang, B.C.; Yano, J.; Aertgeerts, K.; Yoshida, S.; Ishikawa, T. Design and synthesis of novel DFG-out RAF/vascular endothelial growth factor receptor 2 (VEGFR2) inhibitors. 1. Exploration of [5,6]-fused bicyclic scaffolds. J Med Chem 2012, 55, 3452–3478. [Google Scholar] [CrossRef] [PubMed]
Gabriela Bitencourt-Ferreira, Walter Filgueira de Azevedo. Homology Modeling of Protein Targets with MODELLER. Methods Mol Biol. 2019, 2053, 231–249. [Google Scholar]
Michael G Prisant, Christopher J Williams, Vincent B Chen, Jane S Richardson, David C Richardson. New tools in MolProbity validation: CaBLAM for CryoEM backbone, UnDowser to rethink "waters," and NGL Viewer to recapture online 3D graphics. Protein Sci. 2020, 29, 315–329. [Google Scholar] [CrossRef]
Janez Konc. Binding site comparisons for target-centered drug discovery. Expert Opin in Drug Discov. 2019, 14, 445–454. [Google Scholar] [CrossRef] [PubMed]
Tianwen Wang, Ningning Yang, Chen Liang, Hongjv Xu, Yafei An, Sha Xiao, Mengyuan Zheng, Lu Liu, Gaozhan Wang, Lei Nie. Detecting Protein-Protein Interaction Based on Protein Fragment Complementation Assay. Curr Ptrotein Sci. 2020, 21, 598–610. [Google Scholar]
Kazuhiro Takemura, Nobuyuki Matubayasi, Akio Kitao. Binding free energy analysis of protein-protein docking model structures by evERdock. J Chem Phys 2018, 148, 105101. [Google Scholar] [CrossRef] [PubMed]
Martin Zacharias. Protein-protein docking with a reduced protein model accounting for side-chain flexibility. Protein Sci. 2003, 12, 1271–1282. [Google Scholar] [CrossRef]
B K Shoichet, I D Kuntz. Protein docking and complementarity. J Mol Biol. 1991, 221, 327–346. [Google Scholar] [CrossRef] [PubMed]
Mayuko Takeda-Shitaka, Daisuke Takaya, Chieko Chiba, Hirokazu Tanaka, Hideaki Umeyama. Protein structure prediction in structure-based drug design. Curr Med Chem, 2004, 11, 551–558.
Jonathan Hasselmann, Morgan A Coburn, Whitney England, Dario X Figueroa Velez, Sepideh Kiani Shabestari, Christina H Tu, Amanda McQuade, Mahshad Kolahdouzan, Karla Echeverria, Christel Claes, Taylor Nakayama, Ricardo Azevedo, Nicole G Coufal, Claudia Z Han, Brian J Cummings, Hayk Davtyan, Christopher K Glass, Luke M Healy, Sunil P Gandhi, Robert C Spitale, Mathew Blurton-Jones Development of a Chimeric Model to Study and Manipulate Human Microglia In Vivo. Neuron. Development of a Chimeric Model to Study and Manipulate Human Microglia In Vivo. Neuron. 2019, 103, 1016–1033.e10.
Javad Fathi, Shahram Nazarian, Emad Kordbacheh, Nahal Hadi. An in silico Design, Expression and Purification of a Chimeric Protein as an Immunogen Candidate Consisting of IpaD, StxB, and TolC Proteins from Shigella spp. Avicenna J Med Biotechnol. 2022, 14, 247–258. [Google Scholar]
Hassan Dana, Ghanbar Mahmoodi Chalbatani, Elahe Gharagouzloo, Seyed Rouhollah Miri, Fereidoon Memari, Reza Rasoolzadeh, Mohammad Reza Zinatizadeh, Peyman Kheirandish Zarandi, Vahid Marmari. In silico Analysis, Molecular Docking, Molecular Dynamic, Cloning, Expression and Purification of Chimeric Protein in Colorectal Cancer Treatment. Drug Des Devel Ther 2020, 14, 309–329. [Google Scholar] [CrossRef]
Rakesh Trivedi, Hampapathalu Adimurthy Nagarajaram. Substitution scoring matrices for proteins - An overview. Protein Sci. 2020, 29, 2150–2163. [Google Scholar] [CrossRef]
32. William R Pearson. Selecting the Right Similarity-Scoring Matrix. Curr Protoc Bioinformatics. 2013; 43, 3.5.1–3.5.9.
Bing Yan, Lu Luo, Li Liu, Zhenyu Wang, Ruiying Chen, Yi Wu, Xiao Xiao. Serpin family proteins as potential biomarkers and therapeutic drugs in stroke: A systematic review and meta-analysis on clinical/preclinical studies. CNS Neuroscio. Ther. 2023, 29, 1738–1749. [Google Scholar] [CrossRef] [PubMed]
K Suzuki. The multi-functional serpin, protein C inhibitor: beyond thrombosis and hemostasis.J Thromb Haemost. 2008, 6, 2017–2026.
Seung Woo Chung, Myungjin Lee, Sang Mun Bae, Jooho Park, Ok Cheol Jeon, Hui Sun Lee, Han Choe, Han Sung Kim, Beom Suk Lee, Rang-Woon Park, Sang Yoon Kim, Youngro Byun. Potentiation of anti-angiogenic activity of heparin by blocking the ATIII-interacting pentasaccharide unit and increasing net anionic charge. Biomaterilas. 2012, 33, 9070–9079. [Google Scholar] [CrossRef] [PubMed]
Teena Bhakuni, Mohammad Farhan Ali, Irshad Ahmad, Shadabi Bano, Shoyab Ansari, Mohamad Aman Jairajpuri. Role of heparin and non heparin binding serpins in coagulation and angiogenesis: A complex interplay. Arch Biochem Biophys. 2016, 604, 128–142. [Google Scholar] [CrossRef] [PubMed]
Matthias W Laschke, Zeynep Cengiz, Johannes N Hoffmann, Michael D Menger, Brigitte Vollmar. Latent antithrombin does not affect physiological angiogenesis: an in vivo study on vascularization of grafted ovarian follicles. Life Sci. 2004, 75, 203–213. [Google Scholar] [CrossRef] [PubMed]
Naganuma, T.; Nomura, T.; & Yamauchi, A.; & Yamauchi, A. Application of cysteine-bridged peptides to angiogenesis research. Biological & Pharmaceutical Bulletin 2014, 37, 330–334. [Google Scholar]
Förstermann, U.; & Sessa, W.C.; & Sessa, W. C. Nitric oxide synthases: regulation and function. European Heart Journal 2012, 33, 829–837. [Google Scholar] [CrossRef] [PubMed]
Martin, P.; Sutherland, D.; & Watt, F.M. (2019). Wound Healing: An overview. In J. C. Hardingham & A. Muir (Eds.), Extracellular Matrix: Protocols (pp. 41–54). Humana Press.
Myllyharju, J.; Kivirikko, K.I. Collagens modify enzymes and their mutations in humans, flies, and worms. Trends in Genetics 2004, 20, 33–43. [Google Scholar] [CrossRef] [PubMed]
Potente, M.; & Carmeliet, P.; & Carmeliet, P. The Link Between Angiogenesis and Endothelial Metabolism. Annual Review of Physiology 2017, 79, 43–66. [Google Scholar] [CrossRef]
Mellor, A.L.; & Munn, D.H.; & Munn, D. H. Creating immune privilege: active local suppression that benefits friends but protects foes. Nature Reviews Immunology 2008, 8, 74–80. [Google Scholar] [CrossRef]
Christos T Chasapis, Garyfallos Konstantinoudis. Protein isoelectric point distribution in the interactomes across the domains of life. Biophys Chem. 2020, 256, 106269. [Google Scholar]
Zuwei Luo, Jing Li, Jing Qu, Weihua Sheng, Jicheng Yang, Mingzhong Li. Cationized Bombyx mori silk fibroin as a delivery carrier of the VEGF165-Ang-1 coexpression plasmid for dermal tissue regeneration. J Mater Chem B. 2019, 7, 80–94.
46. Samuel C Wolff , Ai-Dong Qi, T Kendall Harden, Robert A Nicholas. Charged residues in the C-terminus of the P2Y1 receptor constitute a basolateral-sorting signal. J Cvell Sci. 2010; 123, Pt 14, 2512–2520.
Pragya Tiwari, M J Khan. Molecular and Computational Studies on Apoptotic Pathway Regulator, Bcl-2 Gene from Breast Cancer Cell Line MCF-7.indian j pharm Sci. 2016, 78, 87–93.
Song-Ho Chong, Sihyun Ham. Distinct role of hydration water in protein misfolding and aggregation revealed by fluctuating thermodynamics analysis. Acc Chem Res. 2015, 48, 956–965. [Google Scholar] [CrossRef]
David M Mauger, B Joseph Cabral, Vladimir Presnyak, Stephen V Su, David W Reid, Brooke Goodman, Kristian Link, Nikhil Khatwani, John Reynders, Melissa J Moore, Iain J McFadyen. mRNA structure regulates protein expression through changes in functional half-life. Proc Natl Acad Sci USA. 2019, 116, 24075–24083. [Google Scholar] [CrossRef]
Igor Hrgovic, Monika Doll, Andreas Pinter, Roland Kaufmann, Stefan Kippenberger, Markus Meissner. Histone deacetylase inhibitors interfere with angiogenesis by decreasing endothelial VEGFR-2 protein half-life in part via a VE-cadherin-dependent mechanism. Exp Dermatol. 2017, 26, 194–201. [Google Scholar] [CrossRef]
Dilani G Gamage, Ajith Gunaratne, Gopal R Periyannan, Timothy G Russell. Applicability of Instability Index for In vitro Protein Stability Prediction. Protein Pept Lett. 2019, 26, 339–347. [Google Scholar] [CrossRef]
Sarika Agrawal, Uttam Kumar Jana, Naveen Kango. Heterologous expression and molecular modelling of L-asparaginase from Bacillus subtilis ETMC-2.Int J Biol Macromol 2021 Dec 1:192:28-37.
Guilu Zhang, Wenjun Zhang. Direct protein-protein interaction network for insecticide resistance based on subcellular localization analysis in Drosophila melanogaster. J Environ Sci Health B. 2020, 55, 732–748. [Google Scholar] [CrossRef]
A Ikai. Thermostability and aliphatic index of globular proteins. J Biochem. 1980, 88, 1895–1898. [Google Scholar]
Pranathi Karnati, Rekha Gonuguntala, Kalyani M Barbadikar, Divya Mishra, Gopaljee Jha, Vellaisamy Prakasham, Priyanka Chilumula, Hajira Shaik, Maruthi Pesari, Raman Meenakshi Sundaram, Kannan Chinnaswami. Performance of Novel Antimicrobial Protein Bg_9562 and In Silico Predictions on Its Properties with Reference to Its Antimicrobial Efficiency against Rhizoctonia solani.Antibiotics (Basel). 2022, 11, 363.
Christin Scheller, Finja Krebs, Robert Minkner, Isabel Astner, Maria Gil-Moles, Hermann Wätzig. Physicochemical properties of SARS-CoV-2 for drug targeting, virus inactivation and attenuation, vaccine formulation and quality control. Electrophoresis. 2020, 41, 1137–1151. [Google Scholar] [CrossRef]
Boojala V B Reddy, Yiannis N Kaznessis. Use of secondary structural information and C alpha-C alpha distance restraints to model protein structures with MODELLER. J Biosci. 2007, 32, 929–936. [Google Scholar] [CrossRef]
ChEMBLhttps://www.ebi.ac.uk/chembl/( acess date 05.05. 2024.
MTiOpenScreen. Available online: https://bioserv.rpbs.univ-paris-diderot.fr/services/MTiOpenScreen/ (accessed on 5 May 2024).
M. Abdel-Aziz,O.M.Aly,S.S.Khan,K.Mukherjee&S.Synthesis, Cytotoxic Properties and Tubulin Polymerization Inhibitory Activity of Novel 2-Pyrazoline Derivates Bane. ArchivderPharmazie 2012, 345, 535–548. [Google Scholar]
Morris, G.M.; Huey, R.; Lindstrom, W.; Sanner, M.F.; Belew, R.K.; Goodsell, D.S. and Olson, A.J. Autodock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Computational Chemistry 2009, 16, 2785–2791. [Google Scholar] [CrossRef]
De Wolf, E.; Gill, R.; Geddes, S.; Pitts, J.; Wollmer, A.; Grotzinger, J. Solution structure of a mini IGF-1. Protein Sci 1996, 5, 2193–2202. [Google Scholar] [CrossRef]
Faham, S.; Hileman, R.E.; Fromm, J.R.; Linhardt, R.J.; Rees, D.C. Heparin structure and interactions with basic fibroblast growth factor. Science 1996, 271, 1116–1120. [Google Scholar] [CrossRef]
Watanabe, K.; Chirgadze, D.Y.; Lietha, D.; De Jonge, H.; Blundell, T.L.; Gherardi, E. A New Crystal Form of the Nk1 Splice Variant of Hgf/Sf Demonstrates Extensive Hinge Movement and Suggests that the Nk1 Dimer Originates by Domain Swapping. J Mol Biol 2002, 319, 283. [Google Scholar] [CrossRef]
Lu, H.S.; Chai, J.J.; Li, M.; Huang, B.R.; He, C.H.; Bi, R.C. Crystal structure of human epidermal growth factor and its dimerization. J Biol Chem 2001, 276, 34913–34917. [Google Scholar] [CrossRef]
Hinck, A.P.; Archer, S.J.; Qian, S.W.; Roberts, A.B.; Sporn, M.B.; Weatherbee, J.A.; Tsang, M.L.; Lucas, R.; Zhang, B.L.; Wenker, J.; Torchia, D.A. Transforming growth factor beta 1: three-dimensional structure in solution and comparison with the X-ray structure of transforming growth factor beta 2. Biochemistry 1996, 35, 8517–8534. [Google Scholar] [CrossRef]
R. V. Honorato, P.I. Koukos, B. Jimenez-Garcia, A. Tsaregorodtsev, M. Verlato, A. Giachetti, A. Rosato and A. M.J.J. Bonvin "Structural biology in the clouds: The WeNMR-EOSC Ecosystem." Frontiers Mol. Biosci. 2021, 8, fmolb.2021.729513. [Google Scholar]
Oefner, C.; D'Arcy, A.; Winkler, F.K.; Eggimann, B.; Hosang, M. Crystal structure of human platelet-derived growth factor B. B. EMBO J 1992, 11, 3921–3926. [Google Scholar] [CrossRef]
Barton, W.A.; Tzvetkova, D.; Nikolov, D.B. Structure of the angiopoietin-2 receptor binding domain and identification of surfaces involved in Tie2 recognition. Structure 2005, 13, 825–832. [Google Scholar] [CrossRef]
Iyer, S.; Scotney, P.D.; Nash, A.D.; Acharya, K.R. Crystal Structure of Human Vascular Endothelial Growth Factor-B: Identification of Amino Acids Important for Receptor Binding. J Mol Biol 2006, 359, 76. [Google Scholar] [CrossRef]
Harvey, T.S.; Wilkinson, A.J.; Tappin, M.J.; Cooke, R.M.; Campbell, I.D. The solution structure of human transforming growth factor-alpha. Eur J Biochem 1991, 198, 555–562. [Google Scholar] [CrossRef]
Leppanen, V.M.; Prota, A.E.; Jeltsch, M.; Anisimov, A.; Kalkkinen, N.; Strandin, T.; Lankinen, H.; Goldman, A.; Ballmer-Hofer, K.; Alitalo, K. Structural Determinants of Growth Factor Binding and Specificity by Vegf Receptor 2. Proc Natl Acad Sci U S A 2010, 107, 2425. [Google Scholar] [CrossRef]
Leppanen, V.M.; Jeltsch, M.; Anisimov, A.; Tvorogov, D.; Aho, K.; Kalkkinen, N.; Toivanen, P.; Yla-Herttuala, S.; Ballmer-Hofer, K.; Alitalo, K. . Structural Determinants of Vascular Endothelial Growth Factor-D - Receptor Binding and Specificity. Blood 2011, 117, 1507. [Google Scholar] [CrossRef]
Baldwin, E.T.; Weber, I.T.; St Charles, R.; Xuan, J.C.; Appella, E.; Yamada, M.; Matsushima, K.; Edwards, B.F.; Clore, G.M.; Gronenborn, A.M.; Wlodawer, A. Crystal structure of interleukin 8: symbiosis of NMR and crystallography. Proc Natl Acad Sci U S A 1991, 88, 502–506. [Google Scholar] [CrossRef]
Shim, A.H.; Liu, H.; Focia, P.J.; Chen, X.; Lin, P.C.; He, X. Structures of a platelet-derived growth factor/propeptide complex and a platelet-derived growth factor/receptor complex. Proc Natl Acad Sci U S A 2010, 107, 11307–11312. [Google Scholar] [CrossRef]
Yu, X.; Seegar, T.C.; Dalton, A.C.; Tzvetkova-Robev, D.; Goldgur, Y.; Rajashankar, K.R.; Nikolov, D.B.; Barton, W.A. Structural basis for angiopoietin-1-mediated signaling initiation. Proc Natl Acad Sci U S A 2013, 110, 7205–7210. [Google Scholar] [CrossRef]
Kline, T.P.; Brown, F.K.; Brown, S.C.; Jeffs, P.W.; Kopple, K.D.; Mueller, L. Solution structures of human transforming growth factor alpha derived from 1H NMR data. Biochemistry 1990, 29, 7805–7813. [Google Scholar] [CrossRef]
Gaucher, J.F.; Reille-Seroussi, M.; Broussy, S. Structural and ITC Characterization of Peptide-Protein Binding: Thermodynamic Consequences of Cyclization Constraints, a Case Study on Vascular Endothelial Growth Factor Ligands. (2022) Chemistry.
X Li, A Pontén, K Aase, L Karlsson, A Abramsson, M Uutela, G Bäckström, M Hellström, H Boström, H Li, P Soriano, C Betsholtz, C H Heldin, K Alitalo, A Ostman, U Eriksson. PDGF-C is a new protease-activated ligand for the PDGF alpha-receptor. Nat Cell Biol. 2000, 2, 302–309. [Google Scholar] [CrossRef]
Smrithi Salian, Hind Benkerroum, Thi Tuyet Mai Nguyen, Sheela Nampoothiri, Taroh Kinoshita, Têmis Maria Félix, Fiona Stewart, Sanjay M Sisodiya, Yoshiko Murakami, Philippe M Campeau. PIGF deficiency causes a phenotype overlapping with DOORS syndrome. Hum Genet. 2021, 140, 879–884. [Google Scholar] [CrossRef]
E Bergsten, M Uutela, X Li, K Pietras, A Ostman, C H Heldin, K Alitalo, U Eriksson. PDGF-D is a specific, protease-activated ligand for the PDGF beta-receptor. Nat Cell Biol. 2001, 3, 512–516. [Google Scholar] [CrossRef]
Powers, R.; Garrett, D.S.; March, C.J.; Frieden, E.A.; Gronenborn, A.M.; Clore, G.M. Three-dimensional solution structure of human interleukin-4 by multidimensional heteronuclear magnetic resonance spectroscopy. Science 1992, 256, 1673–1677. [Google Scholar] [CrossRef]
Yoon, C.; Johnston, S.C.; Tang, J.; Stahl, M.; Tobin, J.F.; Somers, W.S. Charged residues dominate a unique interlocking topography in the heterodimeric cytokine interleukin-12. EMBO J 2000, 19, 3530–3541. [Google Scholar] [CrossRef]
Ealick, S.E.; Cook, W.J.; Vijay-Kumar, S.; Carson, M.; Nagabhushan, T.L.; Trotta, P.P.; Bugg, C.E. Three-dimensional structure of recombinant human interferon-gamma. Science 1991, 252, 698–702. [Google Scholar] [CrossRef]
Simonovic, M.; Gettins, P.G.; Volz, K. Crystal structure of human PEDF, a potent anti-angiogenic and neurite growth-promoting factor. Proc Natl Acad Sci U S A 2001, 98, 11131–11135. [Google Scholar] [CrossRef]
Abad, M.C.; Arni, R.K.; Grella, D.K.; Castellino, F.J.; Tulinsky, A.; Geiger, J.H. The X-ray crystallographic structure of the angiogenesis inhibitor angiostatin. J Mol Biol 2002, 318, 1009–1017. [Google Scholar] [CrossRef]
Hohenester, E.; Sasaki, T.; Olsen, B.R.; Timpl, R. Crystal structure of the angiogenesis inhibitor endostatin at 1.5 A resolution. EMBO J 1998, 17, 1656–1664. [Google Scholar] [CrossRef]
Tan, K.; Duquette, M.; Liu, J.H.; Dong, Y.; Zhang, R.; Joachimiak, A.; Lawler, J.; Wang, J.H. Crystal structure of the TSP-1 type 1 repeats: a novel layered fold and its biological implication. J Cell Biol 2002, 159, 373–382. [Google Scholar] [CrossRef]
Radhakrishnan, R.; Walter, L.J.; Hruza, A.; Reichert, P.; Trotta, P.P.; Nagabhushan, T.L.; Walter, M.R. Zinc mediated dimer of human interferon-alpha 2b revealed by X-ray crystallography. Structure 1996, 4, 1453–1463. [Google Scholar] [CrossRef] [PubMed]
Vinogradova, M.V.; Stone, D.B.; Malanina, G.G.; Karatzaferi, C.; Cooke, R.; Mendelson, R.A.; Fletterick, R.J. Ca2+-regulated structural changes in troponin. Proc Natl Acad Sci U S A 2005, 102, 5038–5043. [Google Scholar] [CrossRef]
Carlson, C.B.; Liu, Y.; Keck, J.L.; Mosher, D.F. Influences of the N700S Thrombospondin-1 Polymorphism on Protein Structure and Stability. J Biol Chem 2008, 283, 20069–20076. [Google Scholar] [CrossRef] [PubMed]
Martinez-Martinez, I.; Johnson, D.J.; Yamasaki, M.; Navarro-Fernandez, J.; Ordonez, A.; Vicente, V.; Huntington, J.A.; Corral, J. Type II antithrombin deficiency caused by a large in-frame insertion: structural, functional and pathological relevance. J Thromb Haemost 2012, 10, 1859–1866. [Google Scholar] [CrossRef] [PubMed]
93. Jas Bhachoo , Thijs Beuming. Investigating Protein-Peptide Interactions Using the Schrödinger Computational Suite. Methods Mol Bio. 2017; 1561, 235–254.
Santiago Vilar, Giorgio Cozza, Stefano Moro. Medicinal chemistry and the molecular operating environment (MOE): application of QSAR and molecular docking to drug discovery. Curr Top Med Chem. 2008, 8, 1555–1572. [Google Scholar] [CrossRef]
95. Jie Dong , Dong-Sheng Cao , Hong-Yu Miao , Shao Liu , Bai-Chuan Deng , Yong-Huan Yun , Ning-Ning Wang , Ai-Ping Lu , Wen-Bin Zeng , Alex F Chen. ChemDes: an integrated web-based platform for molecular descriptor and fingerprint computation. J Cheminform. 2015; 7, 60.
https://www.wolframalpha.com/( access date 24.05.2024).

Figure 1. Homology models of PDGF C, PIGF, PDGF D, and Vasostatin, respectively.

Figure 2. aVEGFR2 is represented as a ribbon; cavity one is defined as water clusters (molecules of water shown in grey space-filling).b. VEGFR2 docked with O=C1Oc2c(cccc2)C=C1( slightly moved compared to a. to show the ligand -colored in pink space filling - in the binding pocket).

Figure 3. The protein-protein complex of an inhibitory and stimulant protein is docked with VEGFR2 represented in ribbons.

Figure 4. Chimeric protein models for the inhibitory and stimulant proteins.

Figure 5. Aa composition (%) of inhibitory and stimulant chimeric models.

Figure 6. Chemical space of inhibitory and stimulant proteins of angiogenesis. The chemical space is characterized by the six molecular descriptors: mol weight, number of H bond acceptors, number of H bond donors, polar surface area, shape attribute, sum of degrees, and sum of valence degrees. The chemical space is represented as radar plots.

Figure 7. Angiogenesis chemical space of inhibitory and stimulating proteinschimeric mdels characterized after the molecular weight, number of hydrogen bonds donor, number of hydrogen bonds acceptors shape attribute of each molecule, polar surface, and the sum of degrees, respectively.

Figure 8. C-alpha distance map plot of inhibitory and stimulant chimeric models together with PEDF ( as an inhibitor example) and Angiopoietin1 ( as a stimulant example).

Figure 9. Chimeric and stimulant model multidimensional data represented as scatter plots.

Figure 10. 2D complex map of the six-degree polynomial equation: a. inhibitory space equation; b. stimulant space equation; c.combine space equation.

Figure 11. Ramachandran plots for the homology models used in the computations.

Table 1. Docking energies of VEGFR2 against a set of Chembl1 structures. For ligands 1-27, see the supplemental file S₁ and methods section. E-total energy ( kcal/mol); E sol -solvation energy(Kcal/mol); E ang (angulation energy (kcal/mol).

Ligand	E	Esol	E ang
1	-45.24	-30.07	36.01
2	-44.69	-25.84	35.11
3	-44.65	-50.25	14.67
4	-41.20	-18.80	46.79
5	-39.96	-55.50	10.14
6	-38.82	-56.32	22.77
7	-38.77	-27.66	9.27
8	-38.59	-109.15	10.85
9	-38.03	-41.77	12.69
10	-37.72	-16.26	37.77
11	-37.49	-10.17	54.20
12	-37.21	-78.65	14.45
13	-36.37	-14.36	34.61
14	-36.09	-29.23	7.45
15	-36.03	-29.10	20.93
16	-35.71	-24.03	10.14
17	-35.69	-7.31	26.02
18	-35.68	-111.58	37.03
19	-35.59	-34.22	24.30
20	-35.44	-22.87	23.73
21	-34.88	-105.90	20.87
22	-34.78	-52.75	7.95
23	-34.56	-3.77	23.67
24	-34.41	-23.55	35.22
25	-34.05	-24.48	20.89
26	-33.78	-25.33	25.66
27	-33.57	-49.07	11.48

Table 2. VEGFR2 inhibitors complex energy (kcal/mol) E – the total complex energy, E sol-solvation energy; E ang -angular energy(the lowest and the highest docking energies values were bolded).

Complex	E	E sol	Eang
1AU1	-90.22	-5836.02	924.59
1BBN	-57.36	-2894.99	206.50
1F45	-85.51	-6777.33	1039.77
1IMV	-72.26	-4854.87	763.33
1KI0	-79.51	-4023.45	700.56
1KOE	-72.79	-1867.34	369.31
1LSL	-68.70	-2006.78	203.36
1YTZ	-90.15	-9148.62	603.85
2RHP	-82.51	-7130.11	1184.05
4EB1	-92.87	-15734.68	2814.55
MODEL4 (Vasostatin)	-62.41	-5337.55	796.00

Table 3. VEGFR2 stimulators complex energy(kcal/mol). E ref- the total complex energy; E sol-solvation energy; E ang -angular energy(the lowest and the highest docking energies values were bolded).

Complex	E	E sol	Eang
1BFB	-71.26	-1936.14	393.64
1GP9	-78.27	-11590.14	1582.94
1JL9	-65.71	-1740.59	180.73
1PDG	-76.04	-3093.65	976.97
1Z3S	-71.71	-2725.12	423.11
2C7W	-72.40	-2490.62	586.69
2TGF	-64.31	-987.23	211.88
2X1W	-99.99	-14554.78	2250.33
2XV7	-70.30	-1715.22	253.52
3IL8	-60.05	-1446.22	334.85
3MJK	-78.40	-13882.82	1795.88
4JYO	-71.03	-2370.64	557.32
4TGF	-63.21	-1157.93	137.47
6Z13	-85.60	-4042.03	603.97
MODEL01	-73.53	-7203.08	1149.46
MODEL02	-62.63	-896.16	189.00
MODEL03	-77.92	-7467.03	1208.71

Table 4. Chimeric proteins Aa sequences.

Table 5. Chimeric models properties.

Property	Inhibitory chimeric model	Stimulant chimeric model
Number of amino acids	419	217
Molecular weight	47642.80	24944.16
Theoretical pI	7.06	8.32
Total number of negatively charged residues (Asp + Glu)	55	21
Total number of positively charged residues (Arg + Lys)	55	23
Formula	C2135H3376N568O629S18	C1115H1648N304O320S16
Total number of atoms	6726	3403
The estimated half-life is	100 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). >10 hours (Escherichia coli, in vivo).	>20 hours (mammalian reticulocytes, in vitro). >20 hours (yeast, in vivo). ? (Escherichia coli, in vivo).
Instability index:	The instability index (II) is computed to be 38.35 This classifies the protein as stable.	The instability index (II) is computed to be 35.30 This classifies the protein as stable.
Aliphatic index	86.32	54.88
Grand average of hydropathicity (GRAVY):	-0.258	-0.594

Table 6. Ligand docked with VEGFR2 PDB structure.

1	O=C1Oc2c(cccc2)C=C1
2	S(C)c1ccc(CC(N)C)cc1
3	S(=O)(=O)(Nc1ncccn1)c1c2c(c(N(C)C)ccc2)ccc1
4	O(C)c1cc2[nH]c3c(c2cc1)ccnc3
5	O(C)c1cc2nc(N3CCNCC3)cnc2cc1
6	BrCC1C(O)C(O)C(N2C(=O)NC(=O)C(F)=C2)O1
7	Fc1c(O)nc(O)nc1
8	O[C@@H]1C(C)(C)Oc2c(C1N/C(=N\C(C)(C)C)/NC#N)cc(C#N)cc2
9	Clc1nnc(NS(=O)(=O)c2c3c(c(N(C)C)ccc3)ccc2)c(C)c1
10	O=C(NCc1cocc1)c1ccc(N(CC#C)Cc2cc3c(O)nc(C)nc3cc2)cc1
11	O(C)c1c2c([nH]c3c2CCN=C3)ccc1
12	O=C1N(C2[C@H](O)[C@@H](O)[C@@H](CO)O2)C=CC(N)=N1
13	S(=O)(=O)(N)c1c2c(ccc1)cccc2
14	N(CC1NCC(c2ccccc2)c2c1cccc2)C
15	Clc1cc2C(c3ccccc3)=NCC(=O)N(C)c2cc1
16	O=C(Nc1c(N2CCN(C(=O)c3[nH]c4c(c3)cccc4)CC2)nccc1)C
17	Oc1nc2c(c(C)c1)ccc(O)c2
18	Fc1cc(Cn2c3ncnc(NC)c3nc2)ccc1
19	S(=O)(=O)(N)c1c(N)cccc1
20	O=C(NCCC)CC1CCN(c2nc(N)c3c(n2)cc(OC)c(OC)c3)CC1
21	N(C)(C)C1(C)C(C)(C)C2CC1CC2
22	Clc1cc2C(=O)N(C)Cc3c(C(=O)OCCC(C)(C)C)ncn3-c2cc1
23	Oc1cc(Cn2c3ncnc(NC)c3nc2)ccc1
24	O=C(OCC)C1=C(O)C(=O)N(Cc2ncccc2)C1
25	O=C(OCC(O)CNC(C)(C)C)c1ccc(OCC=C)cc1
26	O=C1Oc2c3[C@@H](O)[C@@H](C)[C@@H](C(C)C)Oc3c3c(OC(C)(C)C=C3)c2C(CCC)=C1
27	O=C(c1[nH]c2c(c1)cc(C)cc2)c1[nH]c2c(c1)cccc2

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.