3.1. Effect of sulfation on PG16 dynamics and interactions
To investigate the role of tyrosine sulfation on the PG16-gp120 interaction, residue Y100H of PG16 was modified to sulfotyrosine (TYS). The sulfo group could be accommodated in the complex structure (PDB: 6ULC [
16]) without any steric clashes after energy minimization (
Figure 1C). 1
molecular dynamics (MD) simulations were performed for the unmodified PG16 (TYR-PG16) and the sulfated PG16 (TYS-PG16) in complex with HIV-1 Env.
The backbone RMSD of the CDRH3, which harbors the sulfation site Y100H, is rather similar for the TYR-PG16 and TYS-PG16 simulations (
Figure 2).
Apart from some transient fluctuations in the first half of the simulations, all RMSD values level off at 0.5Å–1.0Å in the second half of the simulations, indicating that the CDRH3 remains stably folded. This stable behavior is confirmed by an analysis of the secondary structure over the simulation time (
Figure 3). In all simulations, the characteristic structure of the hammerhead consisting of a
-hairpin centered around a turn at position D100C/D100D is retained. Posttranslational modification at position Y100H neither affects the conformation of this hairpin, nor the structure of the C-terminally adjacent structural element (classified either as a turn or a 3
10-helix that spans F100J-D100L according to dssp analysis;
Figure 3). In summary, this data shows that the CDRH3 backbone secondary structure is stable for both the TYR-PG16 and TYS-PG16.
In the next step, we investigated the energetics of the TYR-PG16 and TYS-PG16 interaction with HIV-1 Env in more detail. A residue-level decomposition of the binding energy (
Figure 4A) reveals the hotspots of the PG16-Env interaction. The largest energetic contributions come from the aromatic residues W100A, Y100G, Y100H, F100J. This data is in line with a previous mutagenesis study of Pejchal [
14], which found that the Y100G-F100J sequence stretch is crucial for binding and neutralization of HIV-1. In addition, the same study showed that a W100A single mutation increased the IC
50 for HIV-1 neutralization by more than 50-fold. When comparing TYR-PG16 and TYS-PG16 (
Figure 4B), differences in energetic contributions are not only observed for the site of modification itself (Y100H), but also for adjacent residues like K100F and D100L, or for the more distant D101. We have therefore inspected the interactions of these residues in more detail and compared them between the unmodified and the sulfated PG16.
The sulfate group of sY100H forms polar interactions with K168 of gp120 (
Figure 5A). This interaction persists over the entire simulation time, although some transient fluctuations of the distance are observed (
Figure 5B). In contrast, the Y100H-K168 distance is considerably larger for the unmodified tyrosine (
Figure 5C). The absence of the sulfo group not only increases the distance of the two side chains, but also reduces electrostatic complementarity due to the lack of a negative charge. Together, these two factors explain the significant energetic benefit of sulfation at this position (~ 3
/
;
Figure 4B). This role of Y100H sulfation to increase electrostatic complementarity is also in line with the previous study of Pancera et al. [
22], who found that the sulfo group forms an intermolecular salt-bridge in a complex between PG16 and the V1-V2 fragment of gp120. In addition to this intermolecular interaction, sY100H also forms an intramolecular interaction with K100F (
Figure 5D). This interaction is rather stable over the simulation time in TYS-PG16 (
Figure 5E) but cannot be formed by the shorter unmodified sidechain in TYR-PG16 (
Figure 5F).
This intramolecular interaction that stabilizes the relative positions of sY100H and K100F also contributes to the stabilization of the intermolecular interactions of K100F. K100F forms a salt-bridge with D167 of gp120 (
Figure 6A). This salt-bridge is present in TYS-PG16 (
Figure 6B) and TYR-PG16 (
Figure 6C); however, larger distance fluctuations (> 5 Å) are only observed for TYR-PG16 (
Figure 6C), indicating that the network of polar interactions is less stable in unmodified PG16. A similar trend is observed for the salt-bridge formed between D100L and R170 of gp120 (
Figure 6D). Again, larger distances (> 5 Å) are observed only for TYR-PG16 (
Figure 6F) but not for TYS-PG16 (
Figure 6E).
The enhanced stability upon sulfation also affects the interactions between PG16 and the glycans of gp120 (
Figure 7). One example is H100R, which interacts with the MAN4 glycan (
Figure 7A). This interaction is stable for TYS-PG16 throughout the entire simulation (
Figure 7B), whereas it is lost within the first 100
in both simulation runs of TYR-PG16 (
Figure 7C). For D101, which interacts with the MAN7 glycan (
Figure 7D), the interpretation is more complicated. For TYS-PG16, this interaction is completely lost for short (< 100
) periods of the simulation time, while at least one hydrogen bond is formed between the D101 side chain and a hydroxyl group of MAN7 in the remaining parts of the simulation, as indicated by a distance of <
Å (
Figure 7E). The corresponding hydrogen bond is very stable and only marginal distance fluctuations are observed in the respective time windows (e.g. from 150
–400
, 500
–750
, and 850
–1000
for run2; light orange line in
Figure 7E). In contrast, in the TYR-PG16 simulation (
Figure 6F) the magnitude of the fluctuations in the D101-MAN7 distance is smaller; however, short distances, indicative of a stable hydrogen bonding, are observed less frequently than for PG16-TYS. For PG16-TYR, the distance mostly fluctuates around the threshold of
Å, which is considered as upper limit for a hydrogen bond (considering the hydrogen-acceptor distance; [
51]). The difference in the strength of this hydrogen bond may contribute to the stronger interaction of D101 in the TYS-PG16 system.
In summary, the data above indicate that PG16 sulfation does not only enhance gp120-binding at the site of the modification, but also has a favorable influence on polar interactions in its vicinity. In case of K100F, this is a rather direct influence by stabilizing the K100F position, which facilitates interaction with D167. In case of the remaining interactions investigated here, the effect is more indirect, i.e. the stabilization of the interface at the site of sulfation lowers fluctuations of other intermolecular interactions. The importance of tyrosine sulfation for enhancing binding affinity has been described previously for a number or protein-protein interactions [
18]. The present analysis confirms this trend and gives additional insight into the atomistic details underlying this effect. The analysis performed here shows a general trend, which, however, cannot readily be quantified. Depending on the HIV-1 isolate, Env proteins differ significantly in protein sequence [
52], glycation sites [
53] and glycan composition [
54], making quantitative statements difficult. However, PG16 is a broadly neutralizing Ab efficient against 70–80 of all HIV-1 isolates [
14,
15], suggesting that the overall principles of recognition and effect of sulfation is similar for different HIV-1 isolates.
3.2. Design of PG16-derived peptides
The investigation above demonstrated that the CDRH3 of the sulfated PG16 has favorable gp120 binding properties. This prompted us to investigate whether peptides presenting the CDRH3 would retain the binding properties of PG16. We initially generated a 33-residue peptide (SL.pg16.lin;
Figure 8A) that covers all residues with energetically favorable contributions to the PG16-gp120 interaction (
Figure 4A). Since the sulfo group was found to be hydrolysis sensitive during peptide synthesis, phosphotyrosine (pY100H) was used as a surrogate, because it exhibits physico-chemical properties similar to those of sulfotyrosine (see
Table 1 for a summary of all peptide sequences investigated). Recombinant, soluble HIV-1 gp120 (HxBc2) was found to bind with an EC
50 of
to linear SL.pg16.lin (
Figure 8B), making it a promising candidate for further optimization. Several previous studies have used artificial disulfide bonds to enhance the binding properties of peptides [
50,
55]. We also used this concept for our PG16 CDRH3 peptides by introducing disulfide bonds at different sites (
Figure 8A). Initially, two different disulfide bonding patterns were investigated: In LW40.4, residues P99 and K100F were replaced by cysteine to allow for the formation of a disulfide bond that fixes the CDRH3
-hairpin. In LW40.9 the native C92 was retained, and a second cysteine was introduced at position 103 to allow for the formation of a disulfide bond that fixes the proximity between termini of the peptide. The experimentally determined EC
50-values (
Figure 8B) show that gp120 binds with higher affinity to LW40.4 than to SL.pg16.lin, whereas the disulfide bonding pattern in LW40.9 leads to a significant decrease in affinity.
To understand the structural basis of these results, MD simulations of the free LW40.4 and LW40.9 peptides were performed. Snapshots of representative structures from the LW40.4 simulation (
Figure 8C) show that the disulfide bond efficiently fixes the central
-hairpin, whereas the termini are rather flexible and deviate significantly from the conformation present in intact PG16. In contrast, the terminal disulfide bond in LW40.9 (
Figure 8D) is insufficient to stabilize the central
-hairpin. The differences in
-hairpin stability between LW40.4 and LW40.9 also become evident from a plot of the secondary structure over simulation time (
Figure 8E, F).
A closer inspection of the termini in LW40.9 (
Figure 8D) reveals that they remain close, as expected from the presence of the disulfide bond; however, their conformation becomes significantly distorted over simulation time. For example, transient helical turns are formed near the disulfide bond (
Figure 8D), which are not present in the initial structure. This finding suggests that the properties of LW40.9 might be improved by changing the position of the terminal disulfide bond. Consequently, we designed LW10.13, in which the sequence position of the C-terminal cysteine is shifted by two residues compared to LW40.9 (
Figure 8A). This change leads to a significantly improved EC
50 of LW10.13 compared to LW40.9 (
Figure 8B) demonstrating that the exact position of the disulfide bond critically affects binding affinity. However, even the optimized terminal disulfide bond in LW10.13 does not result in an improved EC
50 compared to the SL.pg.16.lin precursor (
Figure 8B). This observation suggests that stabilization of the central
-hairpin is more efficient for improving binding properties than fixing the spatial proximity of the CDRH3 termini. The role of intramolecular interactions for the maintenance of the PG16 CDRH3 structure has been previously shown in a study by Kondo et al. [
56]. The authors found that a hydrogen bond between Y100Q and P99 is critical for the structural rigidity of the CDRH3. Position 99 is also used for stabilization by a disulfide bond in LW40.4, suggesting that stabilization at this site might be beneficial for CDRH3 binding properties.
In addition to the stability of the central
-hairpin, the LW40.4 simulations also revealed a high flexibility of the peptide termini (
Figure 8C). In intact PG16, the residues near the termini show favorable energy contributions to the PG16-gp120 interaction (
Figure 4A), which raises the question whether this role is still retained in LW40.4 despite the high flexibility of the free peptide.
To address this question, two terminally truncated versions of LW40.4 (termed LW40.3 and LW40.2) were synthesized (
Figure 9A). The experimental binding data shows that even a moderate truncation (LW40.3) causes a decrease in the EC
50 by almost one order of magnitude, whereas the even shorter LW40.2 shows no detectable binding at all (
Figure 9B). This result demonstrates that those residues, which were mapped as energetically important for PG16 binding (
Figure 4A), are also required for the high-affinity binding of LW40.4. This finding also indicates that the binding mode of LW40.4 is similar to that of PG16. In contrast, an N- and C-terminal extension (peptide LW40.5) did not result in an enhanced EC
50 compared to LW40.4 (
Figure 9A,B) indicating that LW40.4 already contains all CDRH3 residues important for gp120-binding. For all peptides investigated in this study, the dose-dependent binding of gp120 is shown in
Figure 9C. It indicates that LW40.4 exhibits the most favorable binding properties. This peptide thus represents a promising starting point for further optimization to eventually obtain a potent inhibitor that efficiently blocks HIV-1 infection.
Further optimization may include the incorporation of non-natural amino acids, D-amino acids, or the methylation of amide groups to improve proteolytic stability and/or binding affinity. In addition, LW40.4 may be covalently linked to another CDR-derived peptide that binds to an adjacent site of gp120 to create bispecific ligands.
It should be stressed here that the approach of the present study is not limited to antibodies inhibiting HIV-1 infection, but can also be applied to other infectious diseases, for which structural information on antibody-antigen complexes is available. This is for example the case for the interaction between antibodies and the spike protein of SARS-CoV-2, which plays a central role in viral infectivity and transmissibility [
57]. Therefore, the design of inhibitory peptides derived from antibodies may represent a promising alternative therapeutic option for the treatment of infectious diseases.