3.1. Comparative Analysis of the Secondary Structure of Single and Dual Ion Forms of EndoT5
The spatial structure of EndoT5 in complex with catalytic Zn
2+ and regulatory Ca
2+ (EndoT5-Zn
2+Ca
2+, PDB ID: 8P3A) was obtained by high-resolution NMR. For comparative analysis, we used the previously solved 3D structure of EndoT5 in complex with catalytic Zn
2+ ion (EndoT5-Zn
2+, PDB ID 2MZX). An accurate evaluation of the secondary structure content in both forms of the protein was carried out based on the analysis of the average spatial structures of each of the ensembles obtained on the basis of NMR data. It was shown that upon binding of regulatory Ca
2+, the α-helical content decreased from 49 to 36%, while the content of β-sheet structure increased from 12 to 16%. The arrangement of secondary structure elements for both forms of EndoT5 is shown in
Figure 1.
It is interesting that the C-terminal part of the α3-helix of the single-ionic form EndoT5-Zn2+ (I51–A52) transforms into an irregular structure upon Ca2+ binding. In this case, two residues flanking the N-terminus of this helix, V44 and A45, on the contrary, take on an α-helical conformation. Further, the residues L108–G111 flanking the calcium-binding regulatory loop change from an irregular conformation upon Ca2+ binding to a β-structure, forming an additional β-strand that is part of the globular domain. In this case, residues D113-A116 of the beginning of the regulatory loop, which formed an α-helix in the absence of Ca2+, take on an irregular conformation, in which the configuration of only three residues - N115-S117 - is close to the parameters of an α-helix. The short C-terminal α-helix I124–R126 after Ca2+ binding includes one additional residue, E123. The secondary structure of the globular domain also undergoes a moderate transformation upon Ca2+ ion binding. The lengths of α-helical (L20-L30) and β-structural (A71-I78) segments are shortened by one residue from the C- and N-terminal ends, respectively. In contrast, β-strands F35-Q39 and H133-E135 each add one residue from the N- or C-end. In addition, as noted above, some residues form an additional β-strand L108-G111. Thus, the changes in secondary structure induced by binding of regulatory Ca2+ are observed both in the composition of extended loops and in the globular part of the macromolecule.
Translated with DeepL.com (free version)
Figure 1.
Comparative analysis of the secondary structures of EndoT5-Zn2+ and EndoT5-Zn2+Ca2+. Cylinders indicate α-helical sections, arrows indicate β-structural sections. The Ca2+ coordinating residues of the regulatory loop are shown in red. Sequence fragments forming the globular domain are shown in blue, whereas regions located in extended loops are shown in red.
Figure 1.
Comparative analysis of the secondary structures of EndoT5-Zn2+ and EndoT5-Zn2+Ca2+. Cylinders indicate α-helical sections, arrows indicate β-structural sections. The Ca2+ coordinating residues of the regulatory loop are shown in red. Sequence fragments forming the globular domain are shown in blue, whereas regions located in extended loops are shown in red.
3.2. Binding of Ca2+ Ion by the Regulatory Loop and Its Effect on Intramolecular Mobility
Details of the binding of regulatory calcium by the EF-like calcium-binding loop formed by amino acid residues 113 to 123 are presented in
Figure 2. The figure clearly illustrates that polar amino acid residues D113, N115 and S117, corresponding to positions 1, 3, and 5 of the canonical EF loop, are involved in the coordination of the Ca
2+ ion. The emerging coordination bonds fix the spatial position of these residues. As a result, the length of the moving sections of the regulatory loop is significantly reduced, stabilizing it. In this stable conformation of the loop, an increase in the number of van der Waals contacts of the amino acid residues with the residues of the globular core of the protein is observed.
Intramolecular movements in proteins play an important role in many biological processes, in particular enzymatic catalysis and its allosteric regulation. The presence of Ca
2+-mediated allosteric regulation is well illustrated by the differences in the local RMSD of the main chain of the ensembles of EndoT5-Zn
2+ and EndoT5-Zn
2+Ca
2+ structures (
Figure 3). Despite the spatial distance of the extended loops 40-70 and 111-132 and the absence of direct contact between them, the ordering of their conformational states has a symbatic character. Therefore, the Zn
2+Ca
2+ form of EndoT5 manifests itself as a single conformational cooperative unit [
24]. This property of the catalytically active “open” conformation of the enzyme molecule is necessary for all stages of the catalytic cycle.
A comparative analysis of
15N{
1H} NOE amides of the main chains of both forms of the protein (
Figure 4A and
Figure 4B) shows that in the subnanosecond time range, only the backbone fragment of the extended G40–D70 loop has significant intramolecular mobility. At the same time, the regulatory Ca
2+-binding loop (G111–G132) is globally immobile. However, a detailed comparison shows that after calcium binding, the
15N{
1H}NOE values for amide pairs neighboring along the polypeptide chain tend to align throughout the whole structure, indicating the emergence of the dynamic conjugation of residues adjacent within the chain. These data indicate the emergence of conformational relationships in the structure of EndoT5-Zn
2+ macromolecules when they bind Ca
2+ ions, which is consistent with the structural data obtained by analyzing the local RMSD of the main chain of ensembles of EndoT5-Zn
2+ and EndoT5-Zn
2+Ca
2+ structures (see
Figure 3).
The coordination of the intramolecular movements of extended loops in the presence of the Ca
2+ ion is necessary for the implementation of all the stages of catalysis (binding – hydrolysis – substrate release), regardless of the view on the recognition processes: whether to adhere to the theory of induced fit or the theory of conformational selection, the debate around which is still ongoing. From the point of view of Koshland’s theory of induced fit [
25], intramolecular mobility provides the protein globule with the ability to interact with the substrate, and the binding of regulatory calcium with a subsequent change in the loop structure can be considered as a stage of this process and at the same time a special case of induced loop-ligand fit. If one discusses the intramolecular mobility in terms of the conformational selection [
26], mobility provides variability in conformational substates in a population of macromolecules, and the calcium ion, interacting with protein globules that have a conformation with the lowest free binding energy, shifts the equilibrium towards catalytically active forms. Therefore, the binding of the Ca
2+ ion is accompanied by the selection of functionally significant conformational states of the enzyme’s extended loops (40-70 and 111-132).
3.3. Structural Basis of Activation of EndoT5 by Regulatory Ca2+
Figure 5A and
Figure 5B compare the structure of EndoT5 with one Zn
2+ ion (PDB ID: 2MXZ) and two ions, Zn
2+ and Ca
2+ (PDB ID: 8P3A). Both ions in the EndoT5-Zn
2+Ca
2+ structure are located in close proximity, but their roles are different. The zinc ion in EndoT5 is catalytic, tightly coordinated to H66, H133, and D73, and, together with the D130 residue, is involved in the nucleophilic substitution that occurs during hydrolysis of the peptide bond in the substrate by endolysin. Calcium plays a regulatory role and is bound by the mobile EndoT5 loop, which has convergent similarity to the canonical EF loop of the eukaryotic calcium buffers and sensors. The affinity of Ca
2+ for the enzyme is low: the dissociation constant determined previously is 2×10
-4 M [
7].
There are no analogues of the EndoT5-Zn
2+Ca
2+ two-ion structure in the databases yet. Among other members of the family of zinc-containing peptidases M15 of subfamily C homologous to EndoT5, there are several representatives whose structures (or parts of structures) were determined by X-ray diffraction analysis. These include peptidoglycan hydrolase ChiX from the Gram-negative bacterium
Serratia marcescens (PDB IDs: 5OPZ and 5OQ1, [
27]); the LysB4 endolysin from
Bacillus cereus-targeting bacteriophage B4 (PDB ID: 6AKV, [
28]); and catalytic domain of the
Listeria bacteriophage endolysin Ply500 (PDB ID: 2VO9, [
20]). All of these structures contain a zinc ion coordinated by an H-D-H triad, similar to EndoT5, but they do not contain a calcium ion or a calcium-binding loop.
Among the two-ionic structures of endolysins, we can mention the structure of the CHAP catalytic domain of the staphylococcal endolysin LysK, containing Ca
2+ and Zn
2+ [
29]. However, the differences with EndoT5 are radical. CHAP LysK domain is a papain-type cysteine protease that hydrolyzes the D-Ala-Gly bond in staphylococcal peptidoglycan (between D-alanine of the tetra-peptide stem and the first glycine of the penta-glycine cross-bridge). In the CHAP structure (PDB ID: 4CSH, [
29]), unlike EndoT5, the calcium ion is tightly bound by the canonical EF loop. The zinc ion in the LysK structure is coordinated not by the H-D-H triad, as in EndoT5, but by the C57 cysteine residue, is weakly bound, and its role is presumably to regulate the admission of the substrate to the catalytic site. Interestingly, the structure of the CHAP domain of a close LysK homologue, LysGH15 (PDB ID: 4OLK), does not contain zinc ions, whereas calcium is bound similarly to the classical EF loop [
30].
Comparative analysis of EndoT5 protein globules with one and two ions shows that the mechanism of calcium activation of EndoT5 is based on the formation and stabilization of the catalytically active “open” conformation of the active site. Binding of Ca
2+ by side chains D113, N115, and S117 (positions 1, 3, and 5 of the regulatory loop) locally changes the secondary structure, reduces the mobility of the loop, fixing it on the globular domain. We recently showed that tryptophan residue W114, which is strictly conserved among metallopeptidases of the M15_C subfamily, plays an important role in maintaining the affinity for regulatory Ca
2+ [
31]. The presence of tryptophan at position 2 of the calcium-binding loop is not typical for canonical EF motifs. However, in endolysins of the M15_C peptidase family, this residue affects the configuration of the hydrophobic core of the globule, and is possibly involved in the binding of the substrate, peptidoglycan [
31]. Apparently, calcium binding by the EndoT5 regulatory loop regulates the spatial position of the conserved tryptophan residue, which mediates fixation of the loop on the globular domain, while simultaneously entering into hydrophobic interactions with the substrate. As a result, the entire protein globule of the enzyme passes into a catalytically active conformation, which is open relative to the inactive single-ion form, which is clearly visible while comparing
Figure 5A and
Figure 5B.
It should be noted that NMR experiments on the relaxation dispersion of populations of the conformational ensemble of enzyme molecules using the example of adenylate kinase, discussed in the review [
32], showed that the conformational exchange between open and closed conformations is not exclusively a function of catalytic turnover, since it is also observed in free enzyme [
33]. This was confirmed by relaxation experiments using the example of a ligand-dependent enzyme,
E. coli dihydrofolate reductase [
34]. The conformational dynamics of the enzyme was ligand dependent, where the binary complexes with a bound cofactor or product fluctuated into a conformation resembling a ternary complex (with ligand and substrate/product), which correlated with the dynamics of chemical shifts. Therefore, binding (or release) of a ligand can change the ratio of substates in the conformational ensemble, as well as the kinetic and thermodynamic parameters governing conformational equilibrium. From the viewpoint of the conformational selection theory, EndoT5 calcium binding should be considered not as a mechanism for inducing a conformational transition, but as a tool for shifting the equilibrium in a population of molecules towards a catalytically active open conformation with higher energy. In this case, the high value of the calcium dissociation constant reflects the height of the energy barrier between the free and calcium-bound states.
The structural basis for the activation of EndoT5 by calcium ions is similar to mechanism that underlies the work of calcium “sensors”, proteins, the binding of calcium to which causes intramolecular changes leading to the exposure of the hydrophobic regions of the protein globule to the surface of the “sensor”, thereby promoting interaction with other proteins [
34]. A similar mechanism is characteristic, for example, of Ca
2+-calmodulin. Here, two domains of this universal protein form a hydrophobic channel capable of binding target proteins, such as kinases, phosphatases, transcription factors, cytoskeletal components, and transporters [
35]. Therefore, calcium ions mediate signal transduction in the cells, mainly of eukaryotic origin. Conformational rearrangements also underlie the work of the Extracellular Calcium-Sensing Receptor (CaR), where upon the Ca
2+ binding, the open cleft of the Venus Flytrap domain closes inducing, in turn, conformational changes in both the transmembrane domain and the intracellular domains that are believed to initiate signal transduction [
36].
In our case, obviously, calcium binding plays a similar structural role, promoting the enzymatic function of the protein, and not signaling. The low affinity of EndoT5 for regulatory Ca
2+ indicates that the binding of the ion does not occur in the cytoplasm of the host cell of this
E. coli phage, where the Ca
2+ concentration is much lower than the dissociation constant, but in the periplasm serving as a calcium “depot”, where, as was shown by Jones et al. [
37]), the Ca
2+ concentration reaches 1 mM, and where the EndoT5 substrate, peptidoglycan, is located. Probably, like calcium sensors, binding of calcium by EndoT5 triggers conformational changes (or conformational selection) leading to the predominance of protein forms exposure of hydrophobic amino acids, primarily those located in the W114 regulatory loop involved in the interaction with the substrate. The activated enzyme destroys the layer of peptidoglycan, which is the main factor in the strength of bacterial cell walls and the last obstacle to the exit of phage progeny from the host cell. Therefore, activation of the bacteriophage T5 endolysin by periplasmic calcium is one of the triggers for cell lysis by this lytic bacteriophage.
Finally, results of this analysis should be considered from the viewpoint of the proteoform concept, according to which, a single gene can encode a diverse set of structurally and functionally distinct protein molecules [
38]. The structural and functional diversification of proteoforms is achieved through various mechanisms that alter the chemical structure of gene-encoded proteins: at the DNA level via allelic variations such as single or multiple point mutations, indels, and SNPs; at the mRNA level through alternative splicing and other pre-translational events; and at the protein level via a wide array of PTMs [
38,
39,
40,
41,
42,
43]. Additionally, intrinsic disorder and structural alterations induced by protein function contribute to this structural and functional diversification [
44].
One should also keep in mind that protein structure is a highly dynamic entity characterized by an exceptional spatio-temporal heterogeneity and containing a continuous spectrum of differently structured/disordered conformations [
45]. Proteins structure can be envisioned as a complex conformational mosaic containing foldons (independently foldable protein units), inducible foldons (intrinsically disordered regions (IDRs) capable of at least partial folding triggered by the interactions with binding partners), morphing inducible foldons (IDRs with the potential to fold differently at binding to different partners), semi-foldons (regions that are always in a semi-folded form), non-foldons (IDRs that never fold), and even unfoldons (ordered regions undergoing order-to-disorder transition to make protein active). Importantly, these differently (dis)ordered regions can be found within one protein molecule, where they might have different functions. This defines the protein structure-function continuum model postulating that a given protein exists as a dynamic conformational ensemble containing multiple proteoforms of different origin (conformational/basic, inducible/modified, and functioning) characterized by a broad spectrum of structural features and possessing different functional potentials [
46,
47,
48,
49]. The existence of different proteoforms combined with the highly heterogeneous spatio-temporal organization of functional proteins challenges the oversimplified "one gene – one protein – one function" model in favor of a more nuanced "one gene – many proteins – many functions" paradigm [
44,
46,
47,
49,
50,
51,
52,
53,
54].
In line with these considerations, the present study provides important structural grounds for better understanding of the ion-dependent functioning proteoforms of EndoT5. In fact, with its Ca
2+-binding induced structural and dynamic changes, EndoT5 serves as an illustrative example of basic (or conformational, or intrinsic) and functioning proteoforms. Here, basic (or intrinsic, or conformational) proteoforms originate from the fact that structurally, a functional protein represents a dynamic conformational ensemble, members of which have different structures and might have different functions. Furthermore, since protein structural ensemble can be affected by protein function, interaction with specific partners, or even just placement of a protein inside its natural cellular environment that contains high concentrations of various biological macromolecules [
55,
56,
57], has limited available volume [
58], and contains restricted amounts of free water [
55,
59,
60,
61,
62,
63]) protein functionality itself represents a factor generating functioning proteoforms [
44,
50].
Figure 6 illustrates this ideas showing the Ca
2+ binding-induced transformation of a fuzzy EndoT5-Zn
2+ (which is a functioning basic proteoform itself, as it is rather flexible and contains a bound Zn
2+ ion) into a less flexible EndoT5-Zn
2+Ca
2+ thereby promoting the enzymatic function of the protein. In other words, Ca
2+ binding reduces structural flexibility of the basic/intrinsic/conformational proteoform and creates a less flexible functioning proteoform capable of catalytic activity.