3.1. AlphaFold predicted Versus experimentally determined hGDH1 and hGDH2 structure
The AlphaFold provided a satisfactory prediction of the experimental 3D-structures of the hGDH1 and hGDH2 protein (Figure 2). The predicted protein structures presented in this study, show all the important domains found in each subunit of the hexameric glutamate dehydrogenases. These domains include a glutamate binding region towards the N terminus, a NAD binding domain, and a regulatory domain consisting of the antenna and the pivot helix.
We initially explored whether the hGDH1 and hGDH2 structures predicted from their sequences using AlphaFold Colab were accurate. To answer this, the predicted hGDH1 structure derived from AlphaFold Colab and the experimentally determined hGDH1 structure (PDB entry 1L1F) were superimposed (at a total of 3,418 atoms) using PyMOL. The RMSD value between the two superimposed structures was estimated to be 1.745 Å (Figure 2a). Similarly, the predicted hGDH2 structure derived from AlphaFold Colab and the experimental hGDH2 structure (PDB entry 6G2U) were superimposed using PyMOL, at a total of 3,278 atoms. The RMSD between the AlphaFold Colab structure and the experimental template was 0.895Å (Figure 2b). Thus, the comparisons of the AlphaFold predicted structures with the experimentally determined, reported in PDB, structures highlight the ability of this approach to adequately predict the structures of the individual domains.
Also, comparison of the AlphaFold derived structures corresponding to the nodes B, C, D and E in
Figure 1 with the experimentally determined hGDH1 and hGDH2 structures gave comparable results with these described above (
Figure 3 and
Figure 4). Specifically, the comparison of experimental hGDH1 with proteins predicted for nodes B, C, D and E gave RMSD values of 1.695 Å, 1.714 Å, 1.766 Å and 1.680 Å, respectively (
Figure 3). For hGDH2, these values were calculated to be 0.905Å, 0.896Å, 0.943Å and 0.895Å, respectively (
Figure 4).
3.2. hGDH2 AlphaFold Colab predicted structures during evolution that led to humans
Based on the phylogenetic tree of the primates and the gene sequence in different modern-day species, it was found that the retrotransposition event, that led to the emerge of the
GLUD2 gene, occurred after the separation of the phylogenetic branches of the great apes of the Old World and the African green monkey, almost 23 billion years ago [
4]
. The human
GLUD1 gene that encodes for hGDH1 has remained unchanged for the last 23 million years. This indicates that it is an ortholog of and essentially identical to the original GDH gene (node A,
Figure 1) in the common ancestor of modern great apes which gave rise to the
GLUD2 gene through retrotransposition. Thus, we have good reason to support that the experimentally determined hGDH1 structure corresponds to that of the common ancestral enzyme.
In the common ancestor of humans and modern apes, seven amino acid substitutions occurred during the first evolutionary step following the retrotransposition event (node B, Figure 1). These were Ala3Val, Glu34Lys, Asp142Glu, Ser174Asn, Arg443Ser, Gly456Ala and Asn498Ser. During the second evolutionary step, after the separation of the gibbon branch, six amino acid substitutions (Val3Leu, Arg39Gln, Lys299Arg, Ser331Thr, Met370Leu, Arg470His) appeared (node C, Figure 1). Finally, on the last two steps (nodes D and E, Figure 1) three (Ile166Val, Gly247Arg, Ala321Val) and two (Arg299Lys, Met415Leu) substitutions, respectively, led to the current hGDH2 protein in humans.
The GDH predicted structures corresponding to node A and node B were superimposed (at a total of 3,171 atoms) using PyMOL and the RMSD value between the two models was 0.112Å (Figure 1, Figure 5). Similarly, the GDH2 node B and node C predicted structures were superimposed (at 3,212 atoms) and the RMSD value was 0.102Å (Figure 1, Figure 5). The RMSD value between the superimposed GDH2 node C and node D structures (at 3,088 atoms) as well as the GDH2 D and node E structures (at 3,1997 atoms) were 0.095Å and 0.080Å, respectively (Figure 1, Figure 5). Finally, GDH2 node A and node E predicted structures were superimposed (at 3,088 atoms) and the RMSD value was 0.122Å (Figure 1, Figure 5).
3.4. Great ape GDH2 AlphaFold Colab predicted structures during evolution
During great apes’ evolution, after the separation of the gibbon branch (node B, Figure 1) three substitutions (Thr101Ala, Leu377Val, Glu8Lys) emerged and led to the establishment of the current gibbon GDH2 protein. Similarly, five amino acid substitutions (Ile239Asn, Leu240Val, Ile275Val, Leu375Val, Gln441Arg) appeared after the separation of the orangutan branch (Node C, Figure 1) that led to emerge of the current orangutan GDH2 enzyme. The establishment of the gorilla and chimpanzee protein was due to the emergence of four (Ser66Cys, Lys362Arg, Leu365Gln, Glu439Asp) and two (Ile305Leu, Val321Ile) amino acid substitutions, respectively, after the separation of their phylogenetic branches (node D and E, respectively, Figure 1).
The model structure corresponding to the common ape ancestor was superimposed, using PyMOL, with every predicted structure model for each ape (
Figure 7). The RMSD value between the chimpanzee protein and the common ancestor protein was 0.151Å (3,106 atoms), while the RMSD value between the gorilla predicted structure and the ancestor predicted structure was 0.139Å (3,180 atoms). Similarly, RMSD values from the superposition of the gibbon protein and the orangutan protein with the common ancestor predicted structure were 0.109Å (3,289 atoms) and 0.147Å (3,280 atoms), respectively. These results are comparable to the same calculations for modern day hGDH2 (0.122Å).
Figure 7.
Superposition of the model structure corresponding to the common ape ancestor (blue) with every predicted structure model for each ape(green). (a) chimpanzee GDH2-common ape ancestor. The RMSD value was estimated to be 0.151Å. (b) gorilla-common ape ancestor. ). The RMSD value was estimated to be 0.139Å. (c) orangutan-common ape ancestor. The RMSD value was estimated to be 0.129Å. (d) Gibbon-common ape ancestor. The RMSD value was estimated to be 0.109Å.
Figure 7.
Superposition of the model structure corresponding to the common ape ancestor (blue) with every predicted structure model for each ape(green). (a) chimpanzee GDH2-common ape ancestor. The RMSD value was estimated to be 0.151Å. (b) gorilla-common ape ancestor. ). The RMSD value was estimated to be 0.139Å. (c) orangutan-common ape ancestor. The RMSD value was estimated to be 0.129Å. (d) Gibbon-common ape ancestor. The RMSD value was estimated to be 0.109Å.
3.4. Mutational and intramolecular interactions analysis
Totally 18 evolutionary amino acid substitutions (with 15 of them still present in modern humans) were analyzed to predict the result of each amino acid substitution during hGDH2 evolution (nodes A to E, Figure 1). The evaluation of the effect of the amino acid substitutions on protein stability by the consensus indicated that 50% of the mutated sites generated a stabilizing effect and 50% a destabilizing effect (Table 2 and Table S1, Figure S1, Figure 8, 9 and 10). Since our findings revealed that the amino acid substitutions occurring during great apes’ evolution are altering the free energy and the dynamicity of the enzyme, we aimed to investigate the impact of these amino acid replacements on the intramolecular interactions (Table 3). Structure-based analysis by DynaMut, using the hGDH1 structure as template, revealed that the amino acid substitutions were significantly affecting these intramolecular interactions (Table 3).
In specific, during the separation of the phylogenetic branches of the Old-World apes and the African green monkey seven amino acid changes emerged (Node A, Figure 1). Ala3Val, Asp142Glu, Ser174Asn and Gly456Ala increased protein stability based on the consensus of methods (Table 2, Figure 9). On the other hand, Glu34Lys, Arg443Ser and Asn498Ser decreased protein stability (Table 2, Figure 9). The Ala3Val substitution led to loss of a bond with Ser1 and an interaction with Ala5. The substitution of the negatively charged Glu34 by Lys led to the establishment of new interatomic interactions. The Asp142Glu substitution led to significant changes in the interatomic interactions as old were lost, and new interactions were observed. New interactions with Tyr99 and Pro137 were observed in the Ser174Asn substitution. Interactions with Glu439 and Phe440, and Ala447 were lost in the Arg443Ser substitution, while new ones were observed with Phe440 and Ala447. The Gly456Ala substitution led to significant changes in the interatomic interactions, as two bonds with His454 and Tyr459 and one with Thr460 were lost. On the other hand, 2 new hydrophobic interactions with Phe387, 3 bonds with Val453, His454 and Tys459, and one with Ile452 were observed. Finally, the Asn498Ser substitution led to the loss of the hydrophobic interactions with Phe494 and Ile52. On the other hand, new interactions were observed with Gly501, Ala500 and Phe494, respectively.
Five of the six amino acid substitutions that occurred after the separation of the gibbon phylogenetic branch (node B, Figure 1; Table 2) had a destabilizing effect on protein structure, with only Met370Leu increasing protein stability. No significant changes were observed in the interatomic interactions during the Val3Leu and Ser331Thr substitutions as only a bond with Ser1 was gained and an interaction with Gln334 was lost, respectively. Lysine 299 from α1 helix is able to make H-bonds and electrostatic interactions with residues form α2 helix and β1 strand (as it is depicted on Figure 8), therefore connecting all these elements together. The Lys299Arg substitution leads to even more interactions (Figure 8D) and a higher intraconnection of these secondary structure elements (Figure 8). Several hydrophobic interactions with Ile347, Phe230, Met237 were lost in the Met370Leu substitution, while new hydrophobic interactions were observed with Tyr236, Leu479 and Leu481. Also, new bonds were observed with Ile347. The Arg470His substitutions led to the loss of a bond with Met473 and Ala472. Finally, no changes in the intramolecular interactions were noted in the Arg39Gln mutant.
During the separation of the orangutan phylogenetic branch (node C, Figure1) three amino acid substitutions emerged. Ile166Val and Gly247Arg were found to destabilize the protein structure whereas Ala321Val was found to have an opposite stabilizing effect (Table 2; Figure 9; Figure 10). The Ile166Val substitution led to significant changes in the hydrophobic interactions as new bonds with Gly160 and Ile162 emerged. Also, bonds with Gly163 were observed. A bond with Ile318 was lost in the Ala321Val mutant, while several new hydrophobic interactions were observed with Tyr314, Ile318, Val252, Cys323. Additionally, two new bonds were observed with Cys323 and Lys344. Significant changes were observed in the interatomic interactions during the Gly247Arg substitution as new r bonds with Lys249 were gained.
Arg299Lys and Met415Leu that emerged during the separation of the Homo branch from the chimpanzee branch (node E,
Figure 1), decreased and increased protein stability, respectively. On the other hand, new hydrophobic interactions were observed with Gln301 and Phe256. Finally, several interatomic interactions were lost in the Met415Leu mutant. On the other hand, new hydrophobic interactions were observed with Gln301 and Phe256.
Figure 1. Table 2). Results showed that most of these substitutions were destabilizing.
Figure 1.
Effect of Lysine 299 substitution by Arginine. Intramolecular interactions from position 299 (green GDH1/PDB ID 1L1F and cyan GDH2/PDB ID6G2U) when it is occupied by a lysine (panels A, B and C) and an arginine (modelled in 6G2U PDB file). Lys 299 lies on α1 helix and makes H-bonds with the carbonyl group of Phe 256 (α2 helix) and electrostatic interactions/H-bonds with Aso 279 ( β1 strand). When this position is occupied by an Arginine the number of possible interactions with the same elements increases.
Figure 1.
Effect of Lysine 299 substitution by Arginine. Intramolecular interactions from position 299 (green GDH1/PDB ID 1L1F and cyan GDH2/PDB ID6G2U) when it is occupied by a lysine (panels A, B and C) and an arginine (modelled in 6G2U PDB file). Lys 299 lies on α1 helix and makes H-bonds with the carbonyl group of Phe 256 (α2 helix) and electrostatic interactions/H-bonds with Aso 279 ( β1 strand). When this position is occupied by an Arginine the number of possible interactions with the same elements increases.