Spike Receptor Binding Domain (RBD)
Spike protein is a homotrimer with two subunits, namely S1 and S2 in each of the monomer. S1 contains the RBD which binds specifically to the host ACE2 to enable entry into the cells, and is prone to key mutations found in the different variants of concern [
8]. On the other hand, S2 subunit arbitrates the viral-cell fusion followed by viral replication by infusing the viral genetic material [30, 31]. Therefore, blocking the S1 subunit will help prevent the fusion with ACE2 preventing the viral entry and its replication. When we carried out the docking studies of the shortlisted drugs with the viral spike protein (PDB: 6M0J) by giving key interacting residues with ACE2 as the active site grid, the drugs natamycin and alendronate had the least docking score of -7.82 kcal/mol and -7.69 kcal/mol respectively, making them promising candidates. The residues Tyr449, Tyr453, Ser494, Gln498, Asn501 and Gly502 have shown interactions with natamycin. Among these, most of the interactions are due to hydroxyl groups of the drug. Only two residues Ser494 and Tyr453 interacted with the 4-amino group of mycosamine sugar moiety (
Figure 3a). Interestingly, these residues are among those that interacted with the ACE2 receptor during viral entry [
32]. As, natamycin binds to the same residue, binding of the ACE2 receptor is not possible. Similar binding pattern is seen in case of alendronate, wherein the residue Arg403, Tyr453, Ser494 and Gly496 interacts with this drug. Therefore, due to better binding scores, there is a chance that these two drugs could prevent the entry of virus as well as inhibit the spike protein from viral replication.
To determine the behavior of ligand in a biological system, we used molecular dynamics simulation and found that natamycin was stable for the initial 20ns as the residue from the docked complex was retained during this period, but the ligand started losing few contacts after 20ns and moved slightly out of the active site pocket resulting in the increased RMSD and fluctuations until 60ns. After that, the ligand maintained interactions with Asn437, and to a lesser extent with Asn440, until the end of the simulation time. As shown in Figures 3b and 3c, the active site residue has low fluctuations in the RMSF plot; c.f. time frame analysis for pose view at different time frames of natamycin (
Figure 3d). A different trend was observed in case of alendronate, wherein, the complex was stable for the initial 3ns and the contacts were same as that of docked complex. After 3ns, the molecule completely moved out of the assigned active site resulting in escalated RMSD to 60 Å and then declined to 32 Å at 5ns. During this, alendronate deviated from the active site and attached to the other side of the S1 protein by interacting with Asp427 and Asp428 (c.f. time frame analysis,
Figure 3e). Multiple strong interactions were seen with Asp427 and 428 while few contacts were observed with Phe429 and Thr430. From 6ns, the ligand was stable and the interactions with Asp427 and Asp428 were retained until the end of the simulation with minor deviations (
c.f. supplementary file 3.1). As natamycin stays in the pocket till the end of simulation, it is most likely to inhibit the spike protein, and
in vitro studies could show promising results.
Nucleocapsid Protein RNA Binding Domain (NPRBD)
Nucleocapsid protein has a significant role in the viral structure as it binds to the virus’s genetic material (RNA) and facilitates the folding process of hammerhead ribozyme that catalyzes reversible cleavage and ligation reaction at specific RNA sites. This results in preventing the formation of unproductive conformations of the RNA and production of a helical ribonucleoprotein. Further, it also regulates various cellular processes such as, cell cycle progression, apoptosis and actin reorganization [
33]. The nucleocapsid protein has two different domains which aid in different functions. The N-terminal domain (NTD) is very important in case of viral replication and transcription; while the C-terminal domain (CTD) has conserved dimerization mechanisms by forming hydrophilic and hydrophobic interactions [
33]. Since, N-protein has a major role in viral replication, inhibition of this target can be beneficial for treating SARS-CoV-2 infection. Upon performing docking with NPRBD (PDB: 6VYO), the drug with the best score was found to be cromolyn, a mast cell stabilizer with the docking score of -9.54 kcal/mol. The residues that interacted with cromolyn include Ala55, Asn77, Arg92, Arg107, Arg149, Asn154 and Ala173. Except for Ala55, Asn77 and Ala173, all other residues interacted with the carboxyl terminals of the drug’s chromene moiety, while the other residues (Asn77 and Ala173) interacted with the hydroxyl at 2
nd position of propane linker, and Ala55 interacted with the moiety’s keto group at the 4
th position (
Figure 4a).
MD analysis of cromolyn in complex with 6VYO shows good RMSD plot as consistent deviations were observed with the protein residues (
Figure 4b). For the initial 48ns, the ligand has slight variations between RMSD of 3Å-5Å, during which the residues Trp52, Asn153, Asn154, Ala155 of chain A and Ala55, Thr57, Arg107, Tyr109, Arg149, Pro151, Ala156 and Asa173 of chain D shown strong interactions with the cromolyn molecule. After that, some of the interacting residues with the ligand were broken resulting in escalated RMSD between 6Å-10Å up to 75ns (
Figure 4c).
Time frame analysis between 48ns-100ns shown in
Figure 4d reveals that the ligand started uprooting from the binding site and was prevented by the strong interactions of Arg107 and Arg149 which held the cromolyn molecule within the site. The alkyl chain (hydroxyl at the 2
nd position of propane) linker between two chromene moieties were likely responsible for these higher fluctuations. The protein had a similar fluctuating pattern as that of the ligand. Thr76 of chain A formed a new interaction while the residues Asn154, Ala155 of chain A and Arg107, Arg149, Ala156 of chain D retained their strong interactions with the cromolyn molecule until the end of the simulation. Further, from the ligand protein (LP) contacts, it is clear that the residues Arg107, Arg149 and Ala156 of chain D contributed to most of the interactions while the other residues only made moderate contributions along with water mediated interactions (
c.f. supplementary file 3.2).
Figure 4.
(a) Docking pose of chromolyn representation in 3D (right) and 2D (left) (b) RMSD plot of cromolyn showing deviations during the complete simulation (c) RMSF plot of N protein RNA binding domain showing fluctuations of the residues during the simulation (d) time frame analysis of cromolyn showing poses at different time intervals.
Figure 4.
(a) Docking pose of chromolyn representation in 3D (right) and 2D (left) (b) RMSD plot of cromolyn showing deviations during the complete simulation (c) RMSF plot of N protein RNA binding domain showing fluctuations of the residues during the simulation (d) time frame analysis of cromolyn showing poses at different time intervals.
Papain Like Protease (PL pro)
PL proteases are multifunctional enzymes essential for replication of viruses. It is capable of recognizing and hydrolyzing ubiquitin and Interferon-stimulated gene 15 (ISG15; Ubiquitin-like protein) that covalently form an isopeptide bond with the target protein, thereby favouring the entry/replication of the viral cells within the host. Hence, inhibiting PL
pro could be a good approach to prevent the viral replication in the host system [
34]
. From our studies, three drugs, natamycin, alendronate and metaproterenol, were found to have close docking scores of -7.39 kcal/mol, -7.16 kcal/mol and -7.13 kcal/mol respectively. The active site of PL
pro is between two chains (A and C) that gives a proper shape for the ligand to fit (
Figure 5a). Between these two chains, the residues that interacted with natamycin include Asp108 of chain A and Lys157, Leu162, Gly163 and Glu167 of chain C. The hydroxyl on the 25
th carboxylic group of natamycin interacted with Lys157 and Leu162, while hydroxyl groups at 1
st and 26
th positions interacted with Gly163 and Asp108 respectively. Additionally, the hydroxyl at 6
th position of the substituted sugar (mycosamine) interacted with Glu167.
Further MD analysis shows low fluctuations of the target residues and the ligand during the complete simulation as the RMSD was <3.2Å. After major fluctuations during 18-24ns and 44-56ns, the protein residues attained equilibrium for the rest of the simulation time. Change in Natamycin’s conformation from the docked pose resulted in the RMSD of 4Å, and we observed at 1ns that the residue Gln269 formed a contact with the drug while Asp108 and Asn109 formed water mediated interactions. The fluctuations were varying up to 7Å until 30ns and during 31ns-34ns and 40ns-48ns, the ligand had no contact with the protein resulting higher RMSD of 12Å. After 48ns, Tyr268 and Asn109 formed contact with natamycin resulting in decline of RMSD to 9Å and the ligand attained equilibrium as the fluctuations were not high towards the end (
Figure 5b). RMSF analysis shows low fluctuations of the residues except those in the active site region due to drug interaction. The residues between 266-270 formed interactions with the ligand and resulted in higher RMSF due to fluctuations of the interacting atoms of the ligand (>4.0 Å) (
Figure 5c). The residues that were in contact predominantly with natamycin were Trp106, Asp108, Asn109, Asn267 and Tyr268. Among these, Asp108 and Asn267 retained the interactions and prevented the ligand from moving outside the active site (c.f.
Figure 5d). These two residues contributed to bulk of the simulation time (42% and 31% respectively), compared to Asn109, Thr265 and Tyr268 (10-15%) (
c.f. supplementary file 3.3).