Crystal Structure of Pyrrolysyl-tRNA Synthetase from a Methanogenic Archaeon ISO4-G1 and its Structure-Based Engineering for Highly-Productive Cell-Free Genetic Code Expansion with Non-Canonical Amino Acids

Tatsuo Yanagisawa; Eiko Seki; Hiroaki Tanabe; Yoshifumi Fujii; Kensaku Sakamoto; Shigeyuki Yokoyama

doi:10.20944/preprints202302.0445.v1

Submitted:

22 February 2023

Posted:

27 February 2023

You are already at the latest version

Abstract

Pairs of pyrrolysyl-tRNA synthetase (PylRS) and tRNAPyl from Methanosarcina mazei and Methanosarcina barkeri are widely used for site-specific incorporations of non-canonical amino acids into proteins (genetic code expansion). Previously, we achieved full productivity of cell-free protein synthesis for bulky non-canonical amino acids, including Ne-((((E)-cyclooct-2-en-1-yl)oxy)carbonyl)-L-lysine (TCO*Lys), by using Methanomethylophilus alvus PylRS with structure-based mutations in and around the amino acid binding pocket (first-layer and second-layer mutations, respectively). Recently, the PylRS•tRNAPyl pair from a methanogenic archaeon ISO4-G1 was used for genetic code expansion. In the present study, we determined the crystal structure of the methanogenic archaeon ISO4-G1 PylRS (ISO4-G1 PylRS) and compared it with those of structure-known PylRSs. Based on the ISO4-G1 PylRS structure, we attempted the site-specific incorporation of Ne-(p-ethynylbenzyloxycarbonyl)-L-lysine (pEtZLys) into proteins, but it was much less efficient than that of TCO*Lys with M. alvus PylRS mutants. Thus, the first-layer mutations (Y125A and M128L) of ISO4-G1 PylRS, with no additional second-layer mutations, increased the protein productivity with pEtZLys up to 578% of that with TCO*Lys, at high enzyme concentrations in the cell-free protein synthesis.

Keywords:

non-canonical amino acids

;

genetic code expansion

;

crystal structure

;

tRNA

;

cell-free protein synthesis

Subject:

Biology and Life Sciences - Biochemistry and Molecular Biology

1. Introduction

Expanding the genetic code with non-canonical amino acids is useful for developing novel structures and functions of proteins [reviewed in 1,2]. Site-specific incorporation of non-canonical amino acids into proteins in response to specified (e.g., UAG) codons has been achieved by pairs of an engineered aminoacyl-tRNA synthetase (aaRS) and tRNA, including the pairs of pyrrolysyl-tRNA synthetase (PylRS) and tRNA^Pyl_(CUA) [2,3,4,5,6,7,8,9,10]. The PylRS•tRNA^Pyl pair was first found in methanogenic archaea, including Methanosarcina barkeri [11,12], and in bacteria, including Desulfitobacterium hafniense [13,14]. The PylRS•tRNA^Pyl pairs from M. barkeri and Methanosarcina mazei have been extensively studied [reviewed in 10,15,16,17,18,19,20,21]. Recently, by using the pairs of PylRS and tRNA^Pyl from Methanomethylophilus alvus, the site-specific incorporations of non-canonical amino acids into proteins have been achieved in the methanogenic archaeon ISO4-G1, the methanogenic archaeon ISO4-H5, Methanomassiliicoccus intestinalis, and Methanomassiliicoccus luminyensis [22,23,24,25,26,27,28,29,30].

The PylRS•tRNA^Pyl pairs are useful for non-canonical amino acid incorporation because of their “orthogonality” (non-reactivity) to the 20 canonical aaRS•tRNA pairs in many organisms [15,16,17,18,19]. PylRS and its mutants showed broad specificity for substrate amino acids, and by using the PylRS•tRNA^Pyl pairs, site-specific incorporations of more than 200 non-canonical amino acids into proteins have been achieved in bacteria including Escherichia coli, and eukaryotes including Saccharomyces cerevisiae, mammalian cells, and multicellular organisms [reviewed in 10,15,16,17,18,19,20,21,31], and by cell-free protein synthesis based on an E. coli cell extract [32,33,34,35,36,37]. Cell-free protein synthesis systems, which are novel protein expression platforms, are particularly suitable for synthesizing cell-toxic proteins and transmembrane proteins that are difficult to synthesize in cellular systems, and can efficiently introduce non-canonical amino acids into such proteins for pharmaceutical research.

The efficiencies of non-canonical amino acid incorporations into proteins at the UAG codon are usually lower than that of the standard protein synthesis with a canonical amino acid at the corresponding position. The incorporation efficiency of a non-canonical amino acid at multiple sites in a protein is much lower than that of the single-site incorporation. Therefore, by using E. coli strains lacking translation termination factor 1 (RF-1) to achieve the complete reassignment of the UAG codon [32,38,39,40,41,42,43], we have increased the incorporation efficiencies of non-canonical amino acids to the maximum level, which was previously designated as the full productivity of the expanded genetic code [26].

M. mazei PylRS (MmPylRS) and M. barkeri PylRS (MbPylRS) consist of the N- and C-terminal domains (PylRSn and PylRSc, respectively). Another group of PylRSs from bacteria, including D. hafniense, is composed of two separate gene products (PylSn and PylSc), which are homologous to PylRSn and PylRSc, respectively [12,13,14,44]. The PylRSc protein exhibited higher solubility than the full-length PylRS protein and was easily crystallized [45]. However, the PylRSc protein retained insufficient tRNA binding and aminoacylation activities [46,47]. Consequently, both the N- and C-terminal domains of PylRS (i.e., full-length PylRS) have been regarded as essential components for the efficient incorporation of non-canonical amino acids into proteins. Notably, recently discovered methanogenic archaea, including M. alvus, M. intestinalis, M. luminyensis, ISO4-G1, and ISO4-H5, have PylSc homologs, but lack the genes encoding PylSn homologs in their genomes [22,48]. The high solubility of the PylSc-type M. alvus PylRS makes it suitable for crystallographic analysis and cell-free protein synthesis for non-canonical amino acid incorporation [26].

Crystal structures of PylRSs from M. mazei and D. hafniense and their complexes have been extensively investigated [13,14,44,47,49]. The catalytic fragment (residues 185–454) of M. mazei PylRS (MmPylRSc) has been crystallized [45], and the structures of MmPylRSc and its mutants in complex with numerous substrate amino acids, aminoacyladenylates, and ATP and its analogs have been determined [8,45,47,49,50,51,52,53,54,55,56,57,58,59,60]. Based on the structural information and random screening, we obtained MmPylRS with the Y306A and Y384F mutations [MmPylRS(Y306A/Y384F)] [8]. The pair of MmPylRS(Y306A/Y384F) and tRNA^Pyl is one of the useful aaRS•tRNA pairs for cellular and cell-free genetic code expansion with bulky non-natural lysine derivatives, including N^ε-benzyloxycarbonyl-L-lysine (ZLys), N^ε-(o-azidobenzyloxycarbonyl)-L-lysine (oAzZLys), and N^ε-(m-azidobenzyloxycarbonyl)-L-lysine (mAzZLys). A variety of applications using the pair have been developed [8,26,61,62]. The crystal structures of the catalytic fragment of MmPylRS(Y306A/Y384F) [MmPylRSc(Y306A/Y384F)] in complex with 14 bulky non-natural lysine derivatives, including ZLys, mAzZLys, and N^ε-((((E)-cyclooct-2-en-1-yl)oxy)carbonyl)-L-lysine (TCO*Lys), revealed the structural bases for their amino acid binding modes [58].

We recently determined the crystal structure of M. alvus PylRS (MaPylRS) [26]. The MaPylRS with the Y126A (corresponding to the Y306A mutation in M. mazei PylRS) and M129L mutations efficiently incorporated bulky non-canonical amino acids, including ZLys and mAzZLys, into proteins in E. coli cells [24,26]. Tyr126 and Met129 are the “first-layer residues”, located within ~5 Å from (and direct contact with) substrate amino acids, while His227 and Tyr228 are the “second-layer residues”, which are located within ~9 Å from (and do not directly contact) substrates but might affect the first-layer residues. This resembles the concept of first-shell and second-shell residues, respectively, as reported by others [63]. The MaPylRS with both the first-layer (Y126A and M129L) and second-layer (H227I and Y228P) mutations enhanced the protein productivities drastically to the maximum level (i.e., full productivities) for ZLys, mAzZLys, and the more difficult amino acid TCO*Lys by E. coli cell-free protein synthesis, as compared with those obtained with the MmPylRS(Y306A/Y384F) and MaPylRS(Y126A/M129L) pairs [26]. Recently, the crystal structures of MaPylRS(N166S/V168C/W239T) and MaPylRS(N166A/C168G/W239C) mutants in complex with the fluorescent non-canonical amino acid acrydonylalanine and ATP (or its non-hydrolyzable analog, adenylyl imidodiphosphate (AMPPNP)) were determined, and the conformational changes of the residues including His227 upon non-canonical amino acid-binding were discussed [64]. It should be emphasized that MaPylRS is much more useful than MmPylRS for cell-free genetic code expansion with bulky non-canonical amino acids [26]. However, some bulky non-canonical amino acids still cannot be incorporated by using the MaPylRS mutant system. To synthesize proteins in which these bulky non-canonical amino acids are efficiently incorporated, we should compare the structures of PylRSs from archaea, and create active mutants for inefficient bulky non-canonical amino acids based on these structures.

The PylSc-type PylRS from the methanogenic archaeon ISO4-G1 (ISO4-G1 PylRS) is highly similar to MaPylRS with 64% sequence identity, while ISO4-G1 PylRS and MaPylRS both share around 38% sequence identity with M. mazei PylRSc. However, the pair of PylRS and tRNA^Pyl from the methanogenic archaeon ISO4-G1 (ISO4-G1 PylRS•tRNA^Pyl) is orthogonal to the M. mazei pair, but the M. alvus pair is not [23,27]. The mechanism by which ISO4-G1 PylRS recognizes tRNA^Pyl and a variety of non-canonical amino acids in its active site must be elucidated for understanding its substrate specificity and orthogonality, and for achieving excellent genetic code expansion systems. Recently, the crystal structure of ISO4-G1 PylRS with the L124A, Y125L, V167A, Y204W, and A221S mutations for the incorporation of the non-canonical amino acid m-cyanopyridylalanine was determined in the apo form [29].

In the present study, we solved the crystal structure of the wild-type ISO4-G1 PylRS apo form, and compared it with that of the multiple mutant of ISO4-G1 PylRS [29]. In the structure of the wild-type ISO4-G1 PylRS, His225 appears to be located in a different position from that in the multiple mutant ISO4-G1 PylRS structure. In this context, in MaPylRS, the conformational changes of the corresponding His227 residue are considered to be important for the PylRS activity. Therefore, the structural changes of the conserved His residues may be common features between the two PylRSs, and might be the driving force for the movement of the specific hairpin (the β5-β6 hairpin, which will be described later) and thus crucial for the PylRS activity. Furthermore, in the present study, the ISO4-G1 PylRS mutants engineered based on the ISO4-G1 PylRS structures have been applied for the milligram-scale preparation of proteins containing useful non-canonical amino acids, including TCO*Lys and N^ε-(p-ethynylbenzyloxycarbonyl)-L-lysine (pEtZLys), in the cell-free protein synthesis system. These rationally engineered ISO4-G1 PylRS mutants will be more useful than ever before for genetic encode expansion technologies..

2. Results

2.1. Overall Structure of ISO4-G1 PylRS

The genome of the methanogenic archaeon ISO4-G1 encodes an ISO4-G1 PylRS protein consisting of 273 amino acids [48], which is quite similar to MaPylRS [22,48](Figure S1). ISO4-G1 PylRS was expressed very well as a soluble protein in Escherichia coli cells. The yield of the ISO4-G1 PylRS protein was over 100 mg per liter E. coli culture (Figure S2), and the protein could be concentrated without aggregation to more than 20 mg/mL, which is comparable to that of M. alvus PylRS. For crystallographic analysis, ISO4-G1 PylRS was purified to homogeneity by three column chromatography steps (Materials and Methods). The crystallization of the ISO4-G1 PylRS protein was successful when using PEG3350 as the precipitant, and the crystal structure of the ISO4-G1 PylRS apo form has been determined at 2.78-Å resolution (Figure 1, Materials and Methods). The structure of the ISO4-G1 PylRS protein is shown in Figure 1. The asymmetric unit contains ten molecules of PylRS (five PylRS dimers, A/B, C/G, D/F, E/H, J/I). The final model shows good geometry and all residues are within the allowed regions of the Ramachandran plot, as evaluated by Procheck [65] and Molprobity [66] (Table S1).

2.2. Structure-Based Sequence Comparison of ISO4-G1 PylRS with Other PylRSs

In the ISO4-G1 PylRS structure, the residues from Met1 to Pro58 contain two α-helices (α1 and α2) (Figures 1b and S1). The following residues Asn60-Asp273 constitute the catalytic domain, with the characteristic topology of the class-II aaRSs: an extended six-strand, anti-parallel β-sheet (β3, β4, β5, β6, β7, β8) surrounded by α-helices. The sequence motifs (motifs 1, 2, and 3), which are conserved in class-II aaRSs [67,68,69], correspond to the residues Gly78-Val88, Cys147-Leu164, and Ala238-Lys251, respectively, in ISO4-G1 PylRS (Figures 1b and S1). The structures of the ordering loop (residues Ile97-Gln106) and the motif-2 loop (residues Lys150-Glu159) are quite similar in the ten PylRS molecules, while the β5-β6 hairpin (residues Thr195-Ile212) adopts open and closed conformations (Figures 1 and S1). The β5-β6 hairpin in ISO4-G1 PylRS corresponds to the β5-β6 hairpin in MaPylRS and the β7-β8 hairpin in MmPylRSc, which randomly adopt open and closed conformations as described previously [26,47].

2.3. Structural Comparison of ISO4-G1 PylRS with MaPylRS, DhPylSc, and MmPylRSc

The structure of ISO4-G1 PylRS is highly homologous with those of MaPylRS, DhPylSc, and MmPylRSc (Figure 2). A DALI search [http://www.embl-ebi.ac.uk/dali] revealed that the ISO4-G1 PylRS structure (molecule B) resembles those of the PylRS proteins (PDBs: 6JP2, 6EZD, 7R6O, 2ZNI, 2ZNJ, 3DSQ, 2E3C, 4CH3, 2ZIN, and 2ZIM), with Z-scores of 31.4–33.7, 30.0–30.6, 33.0–33.3, 30.5–30.7, 29.6–30.9, 30.2, 28.9, 28.6, 28.4, and 27.6, respectively. Residues Asp61–Asp273 of ISO4-G1 PylRS constitute the catalytic core and superimposed well on MaPylRS, DhPylSc (residues Ala72–Asn286), and MmPylRSc (residues Tyr242–Asn453), but the two α-helices (α1 and α2, residues Met1–Ala56) in the N-terminal 59 residues of ISO4-G1 PylRS are slightly different from those of the other PylRSs. The superimposition of the ISO4-G1 PylRS structure on those of MaPylRS (PDB: 6JP2), MmPylRSc (PDB: 2ZIM), and DhPylSc•tRNA^Pyl (PDB: 2ZNI) revealed that the N-terminal two α-helices of ISO4-G1 PylRS and MmPylRSc cause steric hindrance with tRNA^Pyl, in contrast to those of MaPylRS and DhPylSc (Figures 2 and S3). Accordingly, the two α-helices of PylRSs might undergo conformational changes upon tRNA^Pyl binding.

2.4. Structural Comparison of the Amino Acid Binding Residues of ISO4-G1 PylRS with Those of MaPylRS and MmPylRSc

Based on the structure-based sequence alignments of PylRSs (Figure S1), Ala121, Leu124, Tyr125, Asn165, Leu227, and Trp237 in the amino acid binding pocket of ISO4-G1 PylRS are conserved among MaPylRS (Ala122, Leu125, Tyr126, Asn166, Leu229, and Trp239, respectively) and MmPylRSc (Ala302, Leu305, Tyr306, Asn346, Leu407, and Trp417, respectively), whereas the counterparts of Met128, Val167, and Ala221 in ISO4-G1 PylRS correspond to Leu309, Cys348, and Val401, respectively, in MmPylRSc. The present crystallographic analysis revealed the structural differences between ISO4-G1 PylRS, MaPylRS, and MmPylRSc (PDB: 2ZIM) (Figure 3). The ISO4-G1 PylRS Met128 and Val167 residues, which are respectively conserved as Met129 and Val168 in MaPylRS, are bulkier than the corresponding Leu309 and Cys348 residues in MmPylRSc, respectively. Therefore, the internal volumes of the amino acid binding pockets of ISO4-G1 PylRS and MaPylRS are smaller than that of MmPylRSc.

2.5. Structural Changes of Tyr204 and His225 in ISO4-G1 PylRS and Comparison with Those of MmPylRS and MaPylRS

Tyr204, at the tip of the β5-β6 hairpin in ISO4-G1 PylRS molecules A, C, E, F, and J, is located far from the amino acid binding pocket (the open conformation), while in ISO4-G1 PylRS molecules B, D, G, H, and I, Tyr204 is located inside the amino acid binding pocket (the closed conformation) (Figure 2, Figure 4, Figure 5, and Figure S4).

Therefore, the β5-β6 hairpin undergoes random conformational changes. The Tyr204 side-chain in molecule D is disordered and thus might be in an intermediate form (Figure 5f). The β5-β6 hairpin in ISO4-G1 PylRS is similar to that of MaPylRS, while a remarkable difference exists between the β5-β6 hairpin in ISO4-G1 PylRS and the β7-β8 hairpin in MmPylRSc. As described previously, the β7-β8 hairpin in MmPylRSc is very flexible and adopts multiple conformations regardless of the substrate binding [47](Figure 2 and Figure 4). In the MmPylRSc structure (PDB: 2ZIM), the β7-β8 hairpin is bent in the middle, and the tip half of the β-hairpin is elevated. Tyr384, at the tip of the bent β7-β8 hairpin, is buried deeply within the active site [47,49]. While the β5-β6 hairpin in ISO4-G1 PylRS still assumes bent conformations, Tyr204 of ISO4-G1 PylRS is not as deeply accommodated within the active site as compared to Tyr384 of MmPylRSc (Figure 2, Figure 3, Figure 4a,b,h). In the crystal structure of the ISO4-G1 PylRS mutant for cyanopyridylalanine [29], Trp204, which is substituted for the strictly conserved Tyr residue in the PylRS family, at the tip of the bent β5-β6 hairpin is only slightly inserted into the active site as compared to Tyr204 of the ISO4-G1 PylRS apo form (Figure 4c and Figure S5a). In contrast, in the crystal structures of the acrydonylalanine (and ATP/AMPPNP)-bound MaPylRS mutants [64], Tyr206 at the tip of the bent β5-β6 hairpin seems to penetrate more deeply within the active site than Tyr206 in the apo form of MaPylRS (Figure 4f and Figure S5b). The Tyr204/Trp204 residues of the ISO4-G1 PylRSs in the apo form are shallowly inserted within their active sites, as compared to Tyr206 of the MaPylRS mutant. Accordingly, the two structures of ISO4-G1 PylRS are ligand-free forms, in which Tyr204/Trp204 adopt open and partially closed conformations (Figure 4a–c, and Figure S5a), whereas the MaPylRS mutant structure represents the amino acid substrate and ATP/AMPPNP-bound form, and Tyr206 adopts the completely closed conformation (Figure 4f,h, Figure S5b,c). Notably, Tyr206 of the AMPPNP-bound form of the MaPylRS mutant was completely disordered (Figure S5b). Consequently, in the case of MaPylRS, the bound amino acid substrate may induce Tyr206 at the tip of the β5-β6 hairpin to adopt the closed conformation.

Interestingly, the ISO4-G1 PylRS structure revealed that the His225 residue (corresponding to His227 in MaPylRS) undergoes drastic conformational changes in accordance with the location of the Tyr204 residue (corresponding to Tyr206 in MaPylRS) in the β5-β6 hairpin (Figure 2, Figure 4 and Figure 5). On the one hand, when Tyr204 is far from the amino acid binding pocket (the open conformation), a π -π stacking interaction is observed between the imidazole ring of His225 and the indole ring of Trp237. On the other hand, when Tyr204 is located inside the amino acid binding pocket (the closed conformation), the imidazole ring of His225 shifts and is stabilized by a π -π stacking interaction with the aromatic ring of Tyr204. This conformational change is not observed in the corresponding His227 residue of MaPylRS, according to the structure of the MaPylRS apo form (Figure 4d,e, Figure 5, and Figure S5b). However, the recently determined structure of the MaPylRS mutant in complex with the non-canonical amino acid acridonylalanine (and AMPPNP) revealed the conformational changes of residues 224−230, and the movement of His227 away from the active site upon acridonylalanine-binding (Figure 4f and Figure S5b) [64]. In contrast, no conformational changes of the corresponding Ile405 residue in MmPylRS are observed (Figure 4g,h, and Figure S5c). Accordingly, the structural changes of His225/His227 share common features with MaPylRS and ISO4-G1 PylRS, but not with Ile405 of MmPylRS.

2.8. Structure-Based Engineering of the First-Layer Residues in ISO4-G1 PylRS for Site-Specific Incorporation of Bulky Lysine Derivatives into Proteins by Cell-Free Protein Synthesis

Previously, we developed a system for genetic code expansion with bulky ZLys derivatives by using the MmPylRS(Y306A/Y384F)•tRNA^Pyl pair [8,58,62] and the MaPylRS(Y126A/M129L)•tRNA^Pyl pair [24,26]. To examine whether the ISO4-G1 PylRS•tRNA^Pyl pair is useful for genetic code expansion, we rationally engineered two ISO4-G1 PylRS mutants from the PylRS structures. In the previous study, the MaPylRS(Y126A/M129L)•tRNA^Pyl and MaPylRS(Y126A/M129A)•tRNA^Pyl pairs successfully facilitated the site-specific incorporation of TCO*Lys and mAzZLys into proteins in an E. coli cell-free protein synthesis system [26]. Therefore, the Y126A/M129L and Y126A/M129A mutations of the “first-layer residues”, which directly contact the substrate amino acids, were transplanted into the corresponding sites (Tyr125 and Met128) of ISO4-G1 PylRS. The Y125A mutation (corresponding to Y306A in M. mazei PylRS and Y126A in M. alvus PylRS) enlarges the ISO4-G1 PylRS active site pocket, which then becomes suitable for accommodating bulky non-canonical amino acids [26]. In ISO4-G1 PylRS, the Met128 side-chain protrudes into the amino acid binding pocket (Figure 3), which would reduce the pocket size as compared with that of MmPylRSc. The Leu and Ala mutations at position 128 would enlarge the inner space of the active site pocket (Figure 3). All of the ISO4-G1 PylRS mutant proteins were quite soluble, and over 100 mg quantities of the purified ISO4-G1 PylRS proteins per liter E. coli culture were obtained (Materials and methods).

Using the ISO4-G1 PylRS mutants, we tested cell-free protein synthesis for the site-specific incorporation of bulky non-canonical amino acids, such as ZLys, mAzZLys, N^ε-(p-azidobenzyloxycarbonyl)-L-lysine (pAzZLys), N^ε-(p-ethynylbenzyloxycarbonyl)-L-lysine (pEtZLys), and TCO*Lys (Scheme 1), into the N11-GFPS1 protein in response to an amber (UAG) codon at position 17, using the cell extract of the RF-1 (prfA) deletion E. coli strain B-60.ΔA::Z [26,42]. The yields of the N11-GFPS1 proteins containing ZLys, TCO*Lys, pEtZLys, mAzZLys, and pAzZLys by using the ISO4-G1 PylRS(Y125A/M128A)•tRNA^Pyl pair were 2.8, 0.9, 0.3, 2.8, 0.9 mg protein/mL reaction, respectively (132%, 41%, 15%, 131%, and 41% productivities, respectively, relative to the N11-GFPS1 control protein) (Figure 6a).

In contrast, with the ISO4-G1 PylRS(Y125A/M128L)•tRNA^Pyl pair, the yields of the N11-GFPS1 proteins containing ZLys, TCO*Lys, pEtZLys, mAzZLys, and pAzZLys 2.7, 2.9, 0.3, 3, and 2.3 mg protein/mL reaction, respectively (125%, 136%, 14%, 140%, and 106% productivities, respectively, relative to the N11-GFPS1 control protein)(Figure 6a). The ISO4-G1 PylRS(Y125A/M128L) mutant achieved more than 100% protein productivities for ZLys, mAzZLys, pAzZLys, and TCO*Lys (Figure 6a), while the ISO4-G1 PylRS(Y125A/M128A) did so only for ZLys and mAzZLys. The amino acid-binding pocket of the ISO4-G1 PylRS(Y125A/M128A) mutant might not be suitable for TCO*Lys and pAzZLys, unlike that of the ISO4-G1 PylRS(Y125A/M128L) mutant. Unfortunately, both ISO4-G1 PylRS mutants achieved only 14-15% of the protein productivity for pEtZLys, indicating that this substrate is too large for them. In contrast, the protein productivities for pEtZLys with the MmPylRS(Y306A/Y384F)•tRNA^Pyl, MaPylRS(Y126A/M129L)•tRNA^Pyl, and MaPylRS(Y126A/M129L/H227I/Y228P)•tRNA^Pyl pairs were 0.1, 0.14, and 0.34 mg/mL reaction (5, 7, and 13% protein productivities, respectively, relative to the N11-GFPS1 control protein), which were comparable to or less than that of ISO4-G1 PylRS(Y125A/M128L) (Figure 6b). Consequently, the protein productivities of the ISO4-G1 PylRS(Y125A/M128L)•tRNA^Pyl pairs with TCO*Lys were drastically enhanced, and comparable to that obtained using the MaPylRS(Y126A/M129L/H227I/Y228P)•tRNA^Pyl pair [26].

2.9. Effects of the ISO4-G1 PylRS(Y125A/M128L) Concentration on Cell-Free Protein Synthesis with the Inefficient Amino Acid pEtZLys

Previously, we found that the protein productivities for the inefficient, bulky amino acid TCO*Lys can be enhanced by increasing the concentration of the M. alvus PylRS mutant [26]. To examine the effects of higher concentrations of the ISO4-G1 PylRS protein on non-canonical amino acid incorporation, cell-free protein synthesis with the super-inefficient, bulky, non-canonical amino acid pEtZLys, which is useful for alkyne-azide click chemistry [58,70], by using various concentrations of the ISO4-G1 PylRS protein. The protein productivities for pEtZLys were enhanced from 8% (0.19 mg protein/mL reaction) to 57 % (1.32 mg protein/mL reaction) of the N11-GFPS1 control protein when the concentration of the ISO4-G1 PylRS(Y125A/M128L) protein was increased from 10 to 75 μM (Figure 6c). Therefore, we achieved the highest-ever protein productivity for pEtZLys.

The incorporations of the non-canonical amino acids into the N11-GFPS1 protein were confirmed by mass spectrometry analyses (Figure S6). The PMF analysis of the tryptic digests by MALDI-TOF mass spectrometry revealed major peaks (obsd: m/z 1,901.88 [M+H]⁺, m/z 1,925.82 [M+H]⁺, m/z 1,919.93 [M+H]⁺), which match the theoretical masses of the tryptic peptides HEHAHXENLYFQSK, where X represents ZLys, pEtZLys, and TCO*Lys, respectively (calcd: m/z 1,901.63 [M+H]⁺, m/z 1,925.64 [M+H]⁺, m/z 1,919.69 [M+H]⁺). The ESI mass analysis revealed the tryptic peptides containing mAzZLys (obsd: m/z 971.95 [M+2H]²⁺, calcd: m/z 971.95 [M+2H]²⁺; obsd: m/z 648.63 [M+3H]³⁺, calcd: m/z 648.30 [M+3H]³⁺; obsd: m/z 486.73 [M+4H]⁴⁺, calcd: m/z 486.48 [M+4H]⁴⁺) and pAzZLys (obsd: m/z 971.95 [M+2H]²⁺, calcd: m/z 971.95 [M+2H]²⁺; obsd: m/z 648.64 [M+3H]³⁺, calcd: m/z 648.30 [M+3H]³⁺; obsd: m/z 486.48 [M+4H]⁴⁺, calcd: m/z 486.48 [M+4H]⁴⁺). These results confirmed that the efficient synthesis of the full-length N11-GFPS1 protein containing non-canonical amino acids occurs without any non-specific suppression of the UAG codon with canonical amino acids in the cell-free system.

2.10. Effects of the Second-Layer Mutations of ISO4-G1 PylRS for Site-Specific Incorporation of Bulky Lysine Derivatives into Proteins by Cell-Free Protein Synthesis

We found that the productivities of N11-GFPS1 proteins containing ZLys, mAzZLys, and TCO*Lys obtained with the ISO4-G1 PylRS(Y125A/M128L) system were comparable to or higher than those from the MaPylRS(Y126A/M129L/H227I/H228P) system (Figure 6a)[26]. Previously, the second-layer IP (H227I/H228P) additional mutations of MaPylRS(Y126A/M129L) extensively improved the protein productivities for mAzZLys and TCO*Lys [26]. However, mutations of the second-layer residues His227 and Tyr228 in MaPylRS [corresponding to the Ile405 and Pro406 residues in MmPylRSc, respectively (Figure 4, Figures S1 and S5)] might affect the first-layer residues, which interact directly with substrate amino acids within the amino acid binding pocket [26].

Interestingly, the ISO4-G1 PylRS structure revealed that ISO4-G1 PylRS His225, corresponding to MaPylRS His227, undergoes drastic conformational changes in accordance with the location of Tyr204 (corresponding to Tyr206 in MaPylRS) in the β5-β6 hairpin (Figure 5and Figure S5). We introduced the H225A mutation in ISO4-G1 PylRS and examined its effects on protein productivity. The ISO4-G1 PylRS(H225A) mutant had significantly decreased protein productivities for non-canonical amino acids (Figure 7). The yields of the N11-GFPS1 proteins containing N^ε-(t-butyloxycarbonyl)-L-lysine (BocLys) and N^ε-propargyloxycarbonyl-L-lysine (PocLys) (Scheme 1) were only 0.096 and 0.048 mg protein/mL reaction, respectively (4% and 2% productivities, respectively, of the N11-GFPS1 control protein) (Figure 7). These biochemical and crystallographic analyses confirmed that the His225 residue is crucial for the ISO4-G1 PylRS activity and cannot be replaced.

3. Discussion

In the present study, we determined the crystal structure of ISO4-G1 PylRS, and by its structure-based engineering, we achieved full productivity of cell-free protein synthesis according to the expanded genetic code with a variety of bulky non-canonical amino acids. By introducing two mutations into the first layer of the amino acid-binding pocket in ISO4-G1 PylRS, we achieved full productivity of cell-free synthesis with ZLys, TCO*Lys, mAzZLys, and pAzZLys. The first-layer mutant of ISO4-G1 PylRS required no additional second-layer mutations for the full productivity with these bulky non-canonical amino acids. Even with the much bulkier and most inefficient non-canonical amino acid, pEtZLys, we finally achieved the highest-ever levels of protein productivity by using the ISO4-G1 PylRS(Y125A/M128L) protein at a 7.5-fold higher concentration than the standard protocol. So far, this drastic improvement of protein productivity for pEtZLys has never been accomplished with the M. mazei and M. alvus systems.

Previously, we introduced the Y126A mutation of MaPylRS (corresponding to the Y306A mutation of MmPylRS), and the M129L or M129A mutation in the first-layer residues [26]. We found that simply transplanting the MaPylRS(Y126A/M129L or Y126A/M129A) mutations into ISO4-G1 PylRS was appropriate for bulky non-canonical amino acids. The two ISO4-G1 PylRS mutants (Y125A/M128L and Y125A/M128A) with enlarged amino acid binding pockets achieved full productivity and showed much higher activities than those of MmPylRS(Y306A/Y384F) for ZLys, mAzZLys, pAzZLys, and TCO*Lys (Figure 3a and Figure 6a). However, the full productivity level has not yet been achieved for more difficult non-canonical amino acids, such as pEtZLys. Because ISO4-G1 PylRS, as well as MaPylRS, is highly water-soluble, ISO4-G1 PylRS mutants can be used in the cell-free reaction at much higher concentrations than that of the standard protocol. Consequently, the yield of the pEtZLys-incorporated protein reached 1.3 mg/mL per cell-free reaction (57% productivity level of the control protein synthesis) when the concentration of the ISO4-G1 PylRS(Y125A/M128L) protein was increased up to 75 μM (Figure 6c).

The higher catalytic activity of ISO4-G1 PylRS than that of MaPylRS in the cell-free system was achieved for the site-specific incorporation of N^ɛ-(2-(trimethylsilyl)ethoxycarbonyl)-L-lysine into proteins [28]. The molecular mechanism underlying this higher catalytic activity of ISO4-G1 PylRS than those of MaPylRS and MmPylRS remains unknown. Based on the crystal structure of ISO4-G1 PylRS (Figures 2, 4, and 5), the β5–β6 hairpin may exist in a dynamic open-closed equilibrium, and the location and conformational change of the His225 residue appear to be important for the catalytic activity. The ISO4-G1 PylRS His225 residue is conserved as His227 in MaPylRS, and undergoes a drastic conformational change upon non-canonical amino acid (and AMPPNP)-binding (Figure 4 and Figure S5) [64]. However, in MaPylRS, His227 does not interact with Tyr206 and Trp241, in contrast to the interactions of His225 with Tyr204 and Trp237 in ISO4-G1 PylRS. The ISO4-G1 PylRS mutant with His225 replaced by Ala225 abolished the protein productivities for non-canonical amino acids (Figure 7). In the case of ISO4-G1 PylRS, the His225Ala mutation might reduce the interactions of His225 with Tyr204 and Trp237. In the above-mentioned dynamic closed-open equilibrium of the hairpin, the degree of movement of the hairpin in ISO4-G1 PylRS may be comparable to those in MaPylRS and MmPylRS, concerning the tip positions between the open and closed forms, although we still lack ISO4-G1 PylRS structures bound to amino acid substrates (Figure 2, Figure 3, Figure 4, and Figure S5). The interactions of His225 with Tyr204 and Trp237 in ISO4-G1 PylRS (Figure 5), which are not observed in MaPylRS, appear to be a driving force for the rapid conformational changes of the β5-β6 hairpin. The elucidation of the molecular mechanism underlying the higher catalytic activities of the ISO4-G1 PylRS mutants based on the PylRS structures will lead to the development of a next-generation platform for producing non-canonical amino acid-incorporated proteins.

In the present study, we demonstrated that the ISO4-G1 PylRS system extensively improved the protein productivities, even for the very difficult, non-canonical amino acid pEtZLys, which had not been achieved by the MmPylRS and MaPylRS systems. The ISO4-G1 PylRS•tRNA^Pyl pair, rationally engineered based on the ISO4-G1 PylRS crystal structures, will serve as a more useful tool for next-generation genetic code expansion technologies..

4. Materials and Methods

4.1. Materials

Biochemical and molecular biological procedures were performed with commercially available materials, enzymes, and chemicals. ZLys was purchased from Bachem (Switzerland). mAzZLys, pEtZLys, and pAzZLys, were purchased from Sundia (China). BocLys was purchased from Watanabe Chemical (Japan). PocLys and TCO*Lys were purchased from SiChem GmbH (Germany).

4.2. Bacterial Strains and Plasmids

The DNA fragment encoding PylRS (ISO4-G1 PylRS, residues 1–273) from the methanogenic archaeon ISO4-G1 was chemically synthesized (Integrated DNA Technologies), PCR-amplified, and cloned into the pET28c vector. The E. coli BL21-Gold(DE3) strain (Novagen) was used for the expression of PylRS proteins.

4.3. Expression and Purification of PylRS Proteins

The pET28c vector plasmids containing the ISO4-G1 PylRS gene were transformed into the E. coli BL21-Gold(DE3) strain, and selected on LB agar plates supplemented with 50 μg/ml kanamycin. A single colony was grown at 37˚C in broth culture, containing 15 g tryptone, 7.5 g yeast extract, and 15 g NaCl per liter, supplemented with 50 μg/ml kanamycin. The protein expression was induced with 1 mM IPTG when the OD₆₀₀ reached 0.6. The cultivation temperature was then lowered to 20 ˚C, and the culture was continued overnight. The E. coli cells were collected by centrifugation and stored at –80 ˚C. The cells were resuspended in 50 mM potassium phosphate buffer (pH 7.4), containing 500 mM NaCl, 25 mM imidazole, 5 mM β-mercaptoethanol, and protease inhibitor cocktail (Complete-EDTA free ULTRA, Roche) (buffer C), and were sonicated on ice. The cell lysate was centrifuged at 15,000 × g for 15 min at 4 °C, and the supernatant was applied to a HisTrap column (Cytiva), which was equilibrated with buffer C. The protein was eluted with buffer C containing 400 mM imidazole, instead of 25 mM imidazole, and peak fractions were collected. The protein fractions were pooled, concentrated, and applied to a HiLoad 16/60 Superdex 200 column (Cytiva), equilibrated with 30 mM potassium phosphate buffer (pH 7.4), containing 200 mM NaCl and 1 mM DTT. The eluted fraction was collected and dialyzed against 40 mM potassium phosphate buffer (pH 7.4), containing 50 mM NaCl and 1 mM DTT (buffer B). The histidine-tag peptide derived from the pET28c vector was cleaved with thrombin protease (1 u per 0.1 mg PylRS protein, Sigma-Aldrich) at 4 °C overnight. The dialyzed fraction was then loaded on a HiTrap Q column (Cytiva), and after washing the column with buffer B, the bound proteins were eluted by a linear gradient of 50–635 mM NaCl. The eluted fractions were pooled, concentrated, and applied to a HiLoad 16/60 Superdex 200 column (Cytiva), equilibrated with 30 mM potassium phosphate buffer (pH 7.4), containing 200 mM NaCl and 1 mM DTT. The eluted fractions were collected, dialyzed against 10 mM Tris-HCl buffer (pH 8.0), containing 150 mM NaCl, 10 mM MgCl2, and 10 mM β-mercaptoethanol, and concentrated by ultracentrifugation to 16.2 mg/ml. Aliquots of the ISO4-G1 PylRS protein were flash-cooled in liquid nitrogen and stored at –80 °C. The MaPylRS and MmPylRS proteins were purified as described previously [26]. The histidine-tagged PylRS proteins were purified by chromatography on HisTrap and Superdex 200 HiLoad columns. After dialysis, the eluted PylRS proteins were concentrated by ultracentrifugation.

4.4. Preparation of tRNA^Pyl Transcripts

The tRNA^Pyls from the methanogenic archaeon ISO4-G1, M. alvus, and M. mazei were transcribed in vitro with T7 RNA polymerase, using the PCR-amplified DNA fragment as the template. The tRNA transcripts were precipitated with isopropanol, applied onto a Resource Q column (Cytiva) equilibrated with 10 mM Tris-HCl buffer (pH 7.5), containing 5 mM MgCl₂ and 50 mM NaCl, and eluted by a linear gradient of 0.05–0.7 M NaCl. The purified tRNA^Pyl transcripts were precipitated with ethanol and dissolved in 10 mM Tris-HCl buffer (pH 7.5) containing 5 mM MgCl₂.

4.5. Crystallization, Data Collection, and Structure Determination

All crystallization screenings were performed by the sitting-drop vapor-diffusion method, by mixing 0.2 μl of the ISO4-G1 PylRS protein solution with 0.2 μl of reservoir solution, using a Mosquito liquid handling robot (TTP Labtech). Crystals were grown at 20 °C in conditions with 100 mM HEPES-NaOH buffer (pH 7.2), 20% PEG3350, and 200 mM KCl. The crystal was transferred to 100 mM HEPES-NaOH buffer (pH 7.2) containing 20% PEG3350, 200 mM KCl, and 18% trehalose, mounted on a nylon loop, and flash-cooled in liquid nitrogen. The X-ray diffraction datasets were collected at the beamline BL32XU in SPring-8 (Harima, Japan) at –173 °C and were processed with XDS [71]. The crystal of ISO4-G1 PylRS belongs to the space group P2₁2₁2₁, with unit cell parameters of a=98.51 Å, b=102.68 Å, c=349.86 Å, and α=β=γ=90°. The phase was calculated by the molecular replacement method with Phaser, using 6JP2 as the search model. Ten ISO4-G1 PylRS molecules were found per asymmetric unit, with a solvent content of 56.6%. Iterative cycles of model refinement by PHENIX [72] and manual model building with Coot [73] were performed. The R_work and R_free factors for the ISO4-G1 PylRS structure are shown in Table S1. The final model was validated with Molprobity [66] and Procheck [65]. Graphical images were prepared with PyMOL [http://pymol.sourceforge.net/]. The statistics of the data collection and refinement are summarized in Table S1.

4.6. Cell-Free Protein Synthesis and Purification of GFP Proteins Containing Non-Canonical Amino Acids

Cell-free coupled transcription/translation was performed as described previously [32,33,35,74], using pCR2.1-TOPO bearing the gene encoding an N11-tagged superfold type green fluorescent mutant protein (N11-GFPS1) [75]. The tRNA^Pyl transcripts were prepared by in vitro transcription [35,47]. The PylRS proteins were overproduced in E. coli BL21-Gold(DE3) cells and purified as described previously [8]. The pCR2.1-N11-GFPS1 plasmids containing the wild-type N11-GFPS1 gene or the mutant with a single UAG codon at position Ala17 were used as the template DNAs for cell-free protein synthesis with S30 extracts from RF-1 (prfA) deletion E. coli B-60ΔA::Z cells [42] with a pMINOR plasmid encoding rare codon tRNAs [76]. The reaction components for the incorporation of non-natural lysine derivatives at position 17 in N11-GFPS1 were as follows: 2 μg/ml template plasmid, 10 μM PylRS, 10 μM tRNA^Pyl, and 1 mM non-natural lysine derivatives. After an overnight incubation at 25 °C, the synthesized full-length N11-GFPS1 proteins were quantified as described previously, using an ARVO Victor2 V Multilabel Counter plate fluorescence reader (PerkinElmer) [35]. The N11-GFPS1 proteins were purified as follows. After centrifugation of the solution, the supernatant fractions were loaded on a Ni-Sepharose High Performance column (Cytiva). The column was washed with 50 mM potassium phosphate buffer (pH 7.4), containing 500 mM NaCl, 25 mM imidazole, and 5 mM β-mercaptoethanol, and then eluted with 50 mM potassium phosphate buffer (pH 7.4), containing 500 mM NaCl, 400 mM imidazole, and 5 mM β-mercaptoethanol. The PMF analyses of the N11-GFPS1 proteins, containing ZLys, mAzZLys, pAzZLys, pEtZLys, and TCO*Lys, were performed as described above.

6. Patents

A PCT international patent application [WO2020/045656 A1] related to this work has been filed.

Supplementary Materials

The following supporting information can be downloaded at: www.mdpi.com/xxx/s1, Figure S1: Structure-based sequence alignments of ISO4-G1 PylRS and other PylRSs.; Figure S2: Chromatogram for the purification of ISO4-G1 PylRS by Superdex 200 size-exclusion chromatography. Figure S3: Structural comparison of ISO4-G1 PylRS with MaPylRS, MmPylRSc, and DhPylSc•tRNA^Pyl. Figure S4: Electron density map of the β5-β6 region in the ISO4-G1 PylRS structure. Figure S5: Conformational changes of the active-site residues in the open and closed forms of the ISO4-G1 PylRS, MaPylRS, and MmPylRS structures. Figure S6: Mass spectrometry analysis of N11-GFPS1 proteins containing non-canonical amino acids. Table S1: Data collection and refinement statistics.

Author Contributions

Conceptualization, T.Y., S.Y.; investigation, T.Y., H.T., Y.F., E.S.; methodology, T.Y., E.S., H.T., Y.F.; Project Administration, K.S., S.Y.; Writing—Original Draft, T.Y.; Writing—Review & Editing, T.Y., S.Y.; Funding Acquisition, T.Y., S.Y.

Funding

This work was supported by the following Grants. The Platform Project for Supporting Drug Discovery and Life Science Research (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED under Grant Number JP17am0101081 (S.Y.); Leading Advanced Projects for Medical Innovation (LEAP) from AMED under Grant Number JP19gm0010001 (S.Y.); Takeda Science Foundation (S.Y.); MEXT grant numbers 16K05859 and 24550203 (T.Y.).

Data Availability Statement

The coordinates and structure factors have been deposited in the RSCB Protein Data Bank (ID code 8IFJ for ISO4-G1 PylRS).

Acknowledgments

We would like to thank the staff of the beamline BL32XU at SPring-8 (Harima, Japan), as well as Dr. Takaho Terada, Takako Imada, and Tomoko Nakayama for clerical assistance. We thank the Support Unit for Bio-Material Analysis, RIKEN CBS Research Resources Division, especially Kaori Otsuki and Shino Kurata, for the mass spectrometry analysis.

Conflicts of Interest

T.Y., E.S., K.S. and S.Y. are co-inventors on the patent [WO2020/045656 A1] related to this work. S.Y. is a founder and shareholder of LiberoThera Co., Ltd.

Abbreviations

Mm: Methanosarcina mazei; Mb: Methanosarcina barkeri; Ma: Methanomethylophilus alvus; Dh: Desulfitobacterium hafniense; ISO4-G1: The methanogenic archaeon ISO4-G1; BocLys: N^ε-(t-butyloxycarbonyl)-L-lysine; PocLys: N^ε-propargyloxycarbonyl-L-lysine; ZLys: N^ε-benzyloxycarbonyl-L-lysine; mAzZLys: N^ε-(m-azidobenzyloxycarbonyl)-L-lysine; pAzZLys: N^ε-(p-azidobenzyloxycarbonyl)-L-lysine; pEtZLys: N^ε-(p-ethynylbenzyloxycarbonyl)-L-lysine; TCO*Lys: N^ε-((((E)-cyclooct-2-en-1-yl)oxy)carbonyl)-L-lysine; MALDI-TOF MS: matrix assisted laser deionization/ionization-time of flight mass spectrometry; PMF: Peptide mass fingerprinting.

References

Wang, L.; Xie, J.; Schultz, P.G. Expanding the genetic code. Annu. Rev. Biophys. Biomol. Struct. 2006, 35, 225–249. [Google Scholar] [CrossRef] [PubMed]
Liu, C.C.; Schultz, P.G. Adding New Chemistries to the Genetic Code. Annu. Rev. Biochem. 2010, 79, 413–444. [Google Scholar] [CrossRef] [PubMed]
Blight, S.K.; Larue, R.C.; Mahapatra, A.; Longstaff, D.G.; Chang, E.; Zhao, G.; Kang, P.T.; Green-Church, K.B.; Chan, M.K.; Krzycki, J.A. Direct charging of tRNACUA with pyrrolysine in vitro and in vivo. Nature 2004, 17, 503–507. [Google Scholar] [CrossRef] [PubMed]
Polycarpo, C.; Ambrogelly, A.; Bérubé, A.; Winbush, S.A.M.; McCloskey, J.A.; Crain, P.F.; Wood, J.L.; Söll, D. An aminoacyl-tRNA synthetase that specifically activates pyrrolysine. Proc. Natl. Acad. Sci. USA 2004, 101, 12450–12454. [Google Scholar] [CrossRef] [PubMed]
Ambrogelly, A.; Gundllapalli, S.; Herring, S.; Polycarpo, C.; Frauer, C.; Söll, D. Pyrrolysine is not hardwired for cotranslational insertion at UAG codons. Proc. Natl. Acad. Sci. USA 2007, 104, 3141–3146. [Google Scholar] [CrossRef] [PubMed]
Neumann, H.; Peak-Chew, S.Y.; Chin, J.W. Genetically encoding Nε-acetyllysine in recombinant proteins. Nat. Chem. Biol. 2008, 4, 232–234. [Google Scholar] [CrossRef]
Mukai, T.; Kobayashi, T.; Hino, N.; Yanagisawa, T.; Sakamoto, K.; Yokoyama, S. Adding L-lysine derivatives to the genetic code of mammalian cells with engineered pyrrolysyl-tRNA synthetases. Biochem. Biophys. Res. Commun. 2008, 371, 818–822. [Google Scholar] [CrossRef] [PubMed]
Yanagisawa, T.; Ishii, R.; Fukunaga, R.; Kobayashi, T.; Sakamoto, K.; Yokoyama, S. Multistep Engineering of Pyrrolysyl-tRNA Synthetase to Genetically Encode Nε-(o-Azidobenzyloxycarbonyl) lysine for Site-Specific Protein Modification. Chem. Biol. 2008, 15, 1187–1197. [Google Scholar] [CrossRef]
Chen, P.R.; Groff, D.; Guo, J.; Ou, W.; Cellitti, S.; Geierstanger, B.H.; Schultz, P.G. A facile system for encoding unnatural amino acids in mammalian cells. Angew. Chemie—Int. Ed. 2009, 48, 4052–4055. [Google Scholar] [CrossRef]
Wan, W.; Tharp, J.M.; Liu, W.R. Pyrrolysyl-tRNA synthetase: An ordinary enzyme but an outstanding genetic code expansion tool. Biochim. Biophys. Acta—Proteins Proteomics 2014, 1844, 1059–1070. [Google Scholar] [CrossRef]
Hao, B.; Gong, W.; Ferguson, T.K.; James, C.M.; Krzycki, J.A.; Chan, M.K. A new UAG-encoded residue in the structure of a methanogen methyltransferase. Science 2002, 296, 1462–1466. [Google Scholar] [CrossRef]
Srinivasan, G.; James, C.M.; Krzycki, J.A. Pyrrolysine encoded by UAG in archaea: Charging of a UAG-decoding specialized tRNA. Science 2002, 296, 1459–1462. [Google Scholar] [CrossRef]
Lee, M.M.; Jiang, R.; Jain, R.; Larue, R.C.; Krzycki, J.; Chan, M.K. Structure of Desulfitobacterium hafniense PylSc, a pyrrolysyl-tRNA synthetase. Biochem. Biophys. Res. Commun. 2008, 374, 470–474. [Google Scholar] [CrossRef]
Nozawa, K.; O’Donoghue, P.; Gundllapalli, S.; Araiso, Y.; Ishitani, R.; Umehara, T.; Söll, D.; Nureki, O. Pyrrolysyl-tRNA synthetase-tRNAPyl structure reveals the molecular basis of orthogonality. Nature 2009, 457, 1163–1167. [Google Scholar] [CrossRef]
Chin, J.W. Expanding and Reprogramming the Genetic Code of Cells and Animals. Annu. Rev. Biochem. 2014, 83, 379–408. [Google Scholar] [CrossRef]
Crnković, A.; Suzuki, T.; Söll, D.; Reynolds, N.M. Pyrrolysyl-tRNA synthetase, an aminoacyl-tRNA synthetase for genetic code expansion. Croat. Chem. Acta 2016, 89, 163–174. [Google Scholar] [CrossRef]
Brabham, R.; Fascione, M.A. Pyrrolysine Amber Stop-Codon Suppression: Development and Applications. ChemBioChem 2017, 18, 1973–1983. [Google Scholar] [CrossRef]
Chin, J.W. Expanding and reprogramming the genetic code. Nature 2017, 550, 53–60. [Google Scholar] [CrossRef]
Wang, L. Engineering the Genetic Code in Cells and Animals: Biological Considerations and Impacts. Acc. Chem. Res. 2017, 50, 2767–2776. [Google Scholar] [CrossRef] [PubMed]
Vargas-Rodriguez, O.; Sevostyanova, A.; Söll, D.; Crnković, A. Upgrading aminoacyl-tRNA synthetases for genetic code expansion. Curr. Opin. Chem. Biol. 2018, 46, 115–122. [Google Scholar] [CrossRef] [PubMed]
Tharp, J.M.; Ehnbom, A.; Liu, W.R. tRNAPyl: Structure, function, and applications. RNA Biol. 2017, 15, 441–452. [Google Scholar] [CrossRef]
Willis, J.C.W.; Chin, J.W. Mutually orthogonal pyrrolysyl-tRNA synthetase/tRNA pairs. Nat. Chem. 2018, 10, 831–837. [Google Scholar] [CrossRef]
Meineke, B.; Heimgärtner, J.; Lafranchi, L.; Elsässer, S.J. Methanomethylophilus alvus Mx1201 Provides Basis for Mutual Orthogonal Pyrrolysyl tRNA/Aminoacyl-tRNA Synthetase Pairs in Mammalian Cells. ACS Chem. Biol. 2018, 13, 3087–3096. [Google Scholar] [CrossRef]
Yamaguchi, A.; Iraha, F.; Ohtake, K.; Sakamoto, K. Pyrrolysyl-tRNA synthetase with a unique architecture enhances the availability of lysine derivatives in synthetic genetic codes. Molecules 2018, 23, 2460. [Google Scholar] [CrossRef]
Beránek, V.; Willis, J.C.W.; Chin, J.W. An Evolved Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetase/tRNA Pair Is Highly Active and Orthogonal in Mammalian Cells. Biochemistry 2019, 58, 387–390. [Google Scholar] [CrossRef]
Seki, E.; Yanagisawa, T.; Kuratani, M.; Sakamoto, K.; Yokoyama, S. Fully Productive Cell-Free Genetic Code Expansion by Structure-Based Engineering of Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetase. ACS Synth. Biol. 2020, 9, 718–732. [Google Scholar] [CrossRef]
Meineke, B.; Heimgärtner, J.; Eirich, J.; Landreh, M.; Elsässer, S.J. Site-Specific Incorporation of Two ncAAs for Two-Color Bioorthogonal Labeling and Crosslinking of Proteins on Live Mammalian Cells. Cell Rep. 2020, 31, 107811. [Google Scholar] [CrossRef]
Abdelkader, E.H.; Qianzhu, H.; Tan, Y.J.; Adams, L.A.; Huber, T.; Otting, G. Genetic Encoding of N6-(((Trimethylsilyl)methoxy)carbonyl)-L-lysine for NMR Studies of Protein-Protein and Protein-Ligand Interactions. J. Am. Chem. Soc. 2021, 143, 1133–1143. [Google Scholar] [CrossRef]
Abdelkader, E.H.; Qianzhu, H.; George, J.; Frkic, R.L.; Jackson, C.J.; Nitsche, C.; Otting, G.; Huber, T. Genetic Encoding of Cyanopyridylalanine for In-Cell Protein Macrocyclization by the Nitrile-Aminothiol Click Reaction. Angew. Chem. Int. Ed. Engl. 2022, 61, e202114154. [Google Scholar] [CrossRef]
Avila-Crump, S.; Hemshorn, M.L.; Jones, C.M.; Mbengi, L.; Meyer, K.; Griffis, J.A.; Jana, S.; Petrina, G.E.; Pagar, V.V.; Karplus, P.A.; Petersson, E.J.; Perona, J.J.; Mehl, R.A.; Cooley, R.B. Generating Efficient Methanomethylophilus alvus Pyrrolysyl-tRNA Synthetases for Structurally Diverse Non-Canonical Amino Acids. ACS Chem. Biol. 2022, 17, 3458–3469. [Google Scholar] [CrossRef]
Yanagisawa, T.; Umehara, T.; Sakamoto, K.; Yokoyama, S. Expanded genetic code technologies for incorporating modified lysine at multiple sites. Chembiochem 2014, 15, 2181–2187. [Google Scholar] [CrossRef]
Mukai, T.; Yanagisawa, T.; Ohtake, K.; Wakamori, M.; Adachi, J.; Hino, N.; Sato, A.; Kobayashi, T.; Hayashi, A.; Shirouzu, M.; et al. Genetic-code evolution for protein synthesis with non-natural amino acids. Biochem. Biophys. Res. Commun. 2011, 411, 757–761. [Google Scholar] [CrossRef]
Yanagisawa, T.; Takahashi, M.; Mukai, T.; Sato, S.; Wakamori, M.; Shirouzu, M.; Sakamoto, K.; Umehara, T.; Yokoyama, S. Multiple site-specific installations of Nε-monomethyl-L-lysine into histone proteins by cell-based and cell-free protein synthesis. ChemBioChem 2014, 15, 1830–1838. [Google Scholar] [CrossRef]
Chemla, Y.; Ozer, E.; Schlesinger, O.; Noireaux, V.; Alfonta, L. Genetically expanded cell-free protein synthesis using endogenous pyrrolysyl orthogonal translation system. Biotechnol. Bioeng. 2015, 112, 1663–1672. [Google Scholar] [CrossRef]
Seki, E.; Yanagisawa, T.; Yokoyama, S. Cell-free protein synthesis for multiple site-specific incorporation of noncanonical amino acids using cell extracts from RF-1 deletion E. coli strains. Methods Mol. Biol. 2018, 1728, 49–65. [Google Scholar] [CrossRef]
Adachi, J.; Katsura, K.; Seki, E.; Takemoto, C.; Shirouzu, M.; Terada, T.; Mukai, T.; Sakamoto, K.; Yokoyama, S. Cell-free protein synthesis using S30 extracts from Escherichia coli RFzero strains for efficient incorporation of non-natural amino acids into proteins. Int. J. Mol. Sci. 2019, 20, 492. [Google Scholar] [CrossRef]
Gerrits, M.; Budisa, N.; Merk, H. Site-Specific Chemoselective Pyrrolysine Analogues Incorporation Using the Cell-Free Protein Synthesis System. ACS Synth. Biol. 2019, 8, 381–390. [Google Scholar] [CrossRef]
Mukai, T.; Hayashi, A.; Iraha, F.; Sato, A.; Ohtake, K.; Yokoyama, S.; Sakamoto, K. Codon reassignment in the Escherichia coli genetic code. Nucleic Acids Res. 2010, 38, 8188–8195. [Google Scholar] [CrossRef]
Johnson, D.B.F.; Xu, J.; Shen, Z.; Takimoto, J.K.; Schultz, M.D.; Schmitz, R.J.; Xiang, Z.; Ecker, J.R.; Briggs, S.P.; Wang, L. RF1 knockout allows ribosomal incorporation of unnatural amino acids at multiple sites. Nat. Chem. Biol. 2011, 7, 779–786. [Google Scholar] [CrossRef]
Lajoie, M.J.; Rovner, A.J.; Goodman, D.B.; Aerni, H.R.; Haimovich, A.D.; Kuznetsov, G.; Mercer, J.A.; Wang, H.H.; Carr, P.A.; Mosberg, J.A.; et al. Genomically recoded organisms expand biological functions. Science 2013, 342, 357–360. [Google Scholar] [CrossRef]
Hong, S.H.; Ntai, I.; Haimovich, A.D.; Kelleher, N.L.; Isaacs, F.J.; Jewett, M.C. Cell-free protein synthesis from a release factor 1 deficient Escherichia coli activates efficient and multiple site-specific nonstandard amino acid incorporation. ACS Synth. Biol. 2014, 3, 398–409. [Google Scholar] [CrossRef] [PubMed]
Mukai, T.; Hoshi, H.; Ohtake, K.; Takahashi, M.; Yamaguchi, A.; Hayashi, A.; Yokoyama, S.; Sakamoto, K. Highly reproductive Escherichia coli cells with no specific assignment to the UAG codon. Sci. Rep. 2015, 5, 9699. [Google Scholar] [CrossRef] [PubMed]
Fredens, J.; Wang, K.; de la Torre, D.; Funke, L.F.H.; Robertson, W.E.; Christova, Y.; Chia, T.; Schmied, W.H.; Dunkelmann, D.L.; Beránek, V.; et al. Total synthesis of Escherichia coli with a recoded genome. Nature 2019, 569, 514–518. [Google Scholar] [CrossRef]
Suzuki, T.; Miller, C.; Guo, L.T.; Ho, J.M.L.; Bryson, D.I.; Wang, Y.S.; Liu, D.R.; Söll, D. Crystal structures reveal an elusive functional domain of pyrrolysyl-tRNA synthetase. Nat. Chem. Biol. 2017, 13, 1261–1266. [Google Scholar] [CrossRef] [PubMed]
Yanagisawa, T.; Ishii, R.; Fukunaga, R.; Nureki, O.; Yokoyama, S. Crystallization and preliminary X-ray crystallographic analysis of the catalytic domain of pyrrolysyl-tRNA synthetase from the methanogenic archaeon Methanosarcina mazei. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 2006, 62, 1031–1033. [Google Scholar] [CrossRef] [PubMed]
Herring, S.; Ambrogelly, A.; Gundllapalli, S.; O’Donoghue, P.; Polycarpo, C.R.; Söll, D. The amino-terminal domain of pyrrolysyl-tRNA synthetase is dispensable in vitro but required for in vivo activity. FEBS Lett. 2007, 581, 3197–3203. [Google Scholar] [CrossRef] [PubMed]
Yanagisawa, T.; Ishii, R.; Fukunaga, R.; Kobayashi, T.; Sakamoto, K.; Yokoyama, S. Crystallographic Studies on Multiple Conformational States of Active-site Loops in Pyrrolysyl-tRNA Synthetase. J. Mol. Biol. 2008, 378, 634–652. [Google Scholar] [CrossRef] [PubMed]
Borrel, G.; Parisot, N.; Harris, H.M.B.; Peyretaillade, E.; Gaci, N.; Tottey, W.; Bardot, O.; Raymann, K.; Gribaldo, S.; Peyret, P.; et al. Comparative genomics highlights the unique biology of Methanomassiliicoccales, a Thermoplasmatales-related seventh order of methanogenic archaea that encodes pyrrolysine. BMC Genomics 2014, 15, 679. [Google Scholar] [CrossRef] [PubMed]
Kavran, J.M.; Gundllapalli, S.; O’Donoghue, P.; Englert, M.; Söll, D.; Steitz, T.A. Structure of pyrrolysyl-tRNA synthetase, an archaeal enzyme for genetic code innovation. Proc. Natl. Acad. Sci. USA 2007, 104, 11268–11273. [Google Scholar] [CrossRef]
Takimoto, J.K.; Dellas, N.; Noel, J.P.; Wang, L. Stereochemical basis for engineered pyrrolysyl-tRNA synthetase and the efficient in vivo incorporation of structurally divergent non-native amino acids. ACS Chem. Biol. 2011, 6, 733–743. [Google Scholar] [CrossRef]
Schneider, S.; Gattner, M.J.; Vrabel, M.; Flügel, V.; López-Carrillo, V.; Prill, S.; Carell, T. Structural insights into incorporation of norbornene amino acids for click modification of proteins. ChemBioChem 2013, 14, 2114–2118. [Google Scholar] [CrossRef] [PubMed]
Yanagisawa, T.; Sumida, T.; Ishii, R.; Yokoyama, S. A novel crystal form of pyrrolysyl-tRNA synthetase reveals the pre- and post-aminoacyl-tRNA synthesis conformational states of the adenylate and aminoacyl moieties and an asparagine residue in the catalytic site. Acta Crystallogr. Sect. D Biol. Crystallogr. 2013, 69, 5–15. [Google Scholar] [CrossRef] [PubMed]
Flügel, V.; Vrabel, M.; Schneider, S. Structural basis for the site-specific incorporation of lysine derivatives into proteins. PLoS ONE 2014, 9, e96198. [Google Scholar] [CrossRef] [PubMed]
Schmidt, M.J.; Weber, A.; Pott, M.; Welte, W.; Summerer, D. Structural basis of furan-amino acid recognition by a polyspecific aminoacyl-tRNA-synthetase and its genetic encoding in human cells. ChemBioChem 2014, 15, 1755–1760. [Google Scholar] [CrossRef] [PubMed]
Guo, L.T.; Wang, Y.S.; Nakamura, A.; Eiler, D.; Kavran, J.M.; Wong, M.; Kiessling, L.L.; Steitz, T.A.; O’Donoghue, P.; Söll, D. Polyspecific pyrrolysyl-tRNA synthetases from directed evolution. Proc. Natl. Acad. Sci. USA 2014, 111, 16724–16729. [Google Scholar] [CrossRef] [PubMed]
Englert, M.; Nakamura, A.; Wang, Y.S.; Eiler, D.; Söll, D.; Guo, L.T. Probing the active site tryptophan of Staphylococcus aureus thioredoxin with an analog. Nucleic Acids Res. 2015, 43, 11061–11067. [Google Scholar] [CrossRef]
Lee, Y.J.; Schmidt, M.J.; Tharp, J.M.; Weber, A.; Koenig, A.L.; Zheng, H.; Gao, J.; Waters, M.L.; Summerer, D.; Liu, W.R. Genetically encoded fluorophenylalanines enable insights into the recognition of lysine trimethylation by an epigenetic reader. Chem. Commun. 2016, 52, 12606–12609. [Google Scholar] [CrossRef]
Yanagisawa, T.; Kuratani, M.; Seki, E.; Hino, N.; Sakamoto, K.; Yokoyama, S. Structural Basis for Genetic-Code Expansion with Bulky Lysine Derivatives by an Engineered Pyrrolysyl-tRNA Synthetase. Cell Chem. Biol. 2019, 26, 936–949. [Google Scholar] [CrossRef] [PubMed]
Jiang, H.K.; Wang, Y.H.; Weng, J.H.; Kurkute, P.; Li, C.L.; Lee, M.N.; Chen, P.J.; Tseng, H.W.; Tsai, M.D.; Wang, Y.S. Probing the Active Site of Deubiquitinase USP30 with Noncanonical Tryptophan Analogues. Biochemistry 2020, 59, 2205–2209. [Google Scholar] [CrossRef]
Vatansever, E.C.; Yang, K.S.; Geng, Z.Z.; Qiao, Y.; Li, P.; Xu, S. , Liu, W.R. A Designed, Highly Efficient Pyrrolysyl-tRNA Synthetase Mutant Binds o-Chlorophenylalanine Using Two Halogen Bonds. J. Mol. Biol. 2022, 434, 167534. [Google Scholar] [CrossRef]
Kato, A.; Kuratani, M.; Yanagisawa, T.; Ohtake, K.; Hayashi, A.; Amano, Y.; Kimura, K.; Yokoyama, S.; Sakamoto, K.; Shiraishi, Y. Extensive Survey of Antibody Invariant Positions for Efficient Chemical Conjugation Using Expanded Genetic Codes. Bioconjug. Chem. 2017, 28, 2099–2108. [Google Scholar] [CrossRef] [PubMed]
Yamaguchi, A.; Matsuda, T.; Ohtake, K.; Yanagisawa, T.; Yokoyama, S.; Fujiwara, Y.; Watanabe, T.; Hohsaka, T.; Sakamoto, K. Incorporation of a Doubly Functionalized Synthetic Amino Acid into Proteins for Creating Chemical and Light-Induced Conjugates. Bioconjug. Chem. 2016, 27, 198–206. [Google Scholar] [CrossRef] [PubMed]
Baumann, T.; Hauf, M.; Richter, F.; Albers, S.; Möglich, A.; Ignatova, Z.; Budisa, N. Computational aminoacyl-tRNA synthetase library design for photocaged tyrosine. Int. J. Mol. Sci. 2019, 20, 2343. [Google Scholar] [CrossRef] [PubMed]
Gottfried-Lee, I.; Perona, J.J.; Karplus, P.A.; Mehl, R.A.; Cooley, R.B. ; Structures of Methanomethylophilus alvus Pyrrolysine tRNA-Synthetases Support the Need for De Novo Selections When Altering the Substrate Specificity. ACS Chem. Biol. 2022, 17, 3470–3477. [Google Scholar] [CrossRef] [PubMed]
Collaborative Computational Project, No. 4 The CCP4 suite: Programs for protein crystallography. Acta Crystallogr D Biol Crystallogr. 1994, 50, 760–763. [Google Scholar] [CrossRef] [PubMed]
Davis, I.W.; Leaver-Fay, A.; Chen, V.B.; Block, J.N.; Kapral, G.J.; Wang, X.; Murray, L.W.; Arendall, W.B.; Snoeyink, J.; Richardson, J.S.; et al. MolProbity: All-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 2007, 35, 375–383. [Google Scholar] [CrossRef] [PubMed]
Eriani, G.; Delarue, M.; Poch, O.; Gangloff, J.; Moras, D. Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature 1990, 347, 203–206. [Google Scholar] [CrossRef]
Cusack, S.; Berthet-Colominas, C.; Härtlein, M.; Nassar, N.; Leberman, R. A second class of synthetase structure revealed by X-ray analysis of Escherichia coli seryl-tRNA synthetase at 2.5 Å. Nature 1990, 347, 249–255. [Google Scholar] [CrossRef] [PubMed]
Ruff, M.; Krishnaswamy, S.; Boeglin, M.; Poterszman, A.; Mitschler, A.; Podjarny, A.; Rees, B.; Thierry, J.C.; Morast, D. Class II aminoacyl transfer RNA synthetases: Crystal structure of yeast apartyl-tRNA synthetase complexed with tRNA(Asp). Science 1991, 252, 1682–1689. [Google Scholar] [CrossRef]
Kolb, H.C.; Finn, M.G.; Sharpless, K.B. Click Chemistry: Diverse Chemical Function from a Few Good Reactions. Angew. Chemie—Int. Ed. 2001, 40, 2004–2021. [Google Scholar] [CrossRef]
Kabsch, W. XDS. Acta Crystallogr. D. Biol. Crystallogr. 2010, 66, 125–132. [Google Scholar] [CrossRef]
Adams, P.D.; Afonine, P.V.; Bunkóczi, G.; Chen, V.B.; Davis, I.W.; Echols, N.; Headd, J.J.; Hung, L.W.; Kapral, G.J.; Grosse-Kunstleve, R.W.; et al. PHENIX: A comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. Sect. D Biol. Crystallogr. 2010, 66, 213–221. [Google Scholar] [CrossRef]
Emsley, P.; Cowtan, K. Coot: Model-building tools for molecular graphics. Acta Crystallogr. Sect. D Biol. Crystallogr. 2004, 60, 2126–2132. [Google Scholar] [CrossRef]
Kigawa, T.; Yabuki, T.; Matsuda, N.; Matsuda, T.; Tanaka, A.; Yokoyama, S. Preparation of Escherichia coli cell extract for highly productive cell-free protein expression. J. Struct. Funct. Genomics 2004, 5, 63–68. [Google Scholar] [CrossRef]
Seki, E.; Matsuda, N.; Yokoyama, S.; Kigawa, T. Cell-free protein synthesis system from Escherichia coli cells cultured at decreased temperatures improves productivity by decreasing DNA template degradation. Anal. Biochem. 2008, 377, 156–161. [Google Scholar] [CrossRef]
Chumpolkulwong, N.; Hori-Takemoto, C.; Hosaka, T.; Inaoka, T.; Kigawa, T.; Shirouzu, M.; Ochi, K.; Yokoyama, S. Effects of Escherichia coli ribosomal protein S12 mutations on cell-free protein synthesis. Eur. J. Biochem. 2004, 271, 1127–1134. [Google Scholar] [CrossRef]
Thompson, J.D.; Higgins, D.G.; Gibson, T.J. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22, 4673–4680. [Google Scholar] [CrossRef]

Figure 1. Overview of the ISO4-G1 PylRS structure. (a) The ISO4-G1 PylRS dimer. The protomers of the dimer are colored gray-white and blue. The crystallographic 2-fold axis is perpendicular to the paper. (b) The α-helices, 3₁₀ helices, and β-sheets are colored wine red, olive, and green, respectively.

Figure 2. Structural comparisons of ISO4-G1 PylRS with MaPylRS, DhPylSc, and MmPylRSc. (a) The β5-β6 hairpins in the ISO4-G1 PylRS molecules A, B, C, D, E, F, G, H, I, J, and H are colored green, cyan, magenta, yellow, vermilion, white, lavender, orange, light green, and turquoise blue, respectively. Tyr204 is shown as stick models. (b-e) Surface models of ISO4-G1 PylRS, MaPylRS (PDB: 6JP2), DhPylSc (PDB: 2ZNI), and MmPylRSc (PDB: 2ZIM). (f-h) Superimpositions of the crystal structures of ISO4-G1 PylRS, MaPylRS (PDB: 6JP2), DhPylSc•tRNA^Pyl complex (PDB: 2ZNI), and the pyrrolysyladenylate (Pyl-AMP)-bound MmPylRSc (PDB: 2ZIM), represented by surface models. The β5-β6 hairpins in ISO4-G1 PylRS and MaPylRS and the β7-β8 hairpins in MmPylRSc are colored cyan, yellow, green, and orange, respectively. The catalytic core structures of ISO4-G1 PylRS, MaPylRS, DhPylSc, and MmPylRSc superimposed well.

Figure 3. Comparison of the amino acid binding pocket of ISO4-G1 PylRS with those of MaPylRS and MmPylRSc. (a) Ribbon models of ISO4-G1 PylRS molecule B (brown) superimposed on the MaPylRS (light green) and MmPylRSc (gray) monomers. (b, c) Close-up views of different angles of the ISO4-G1 PylRS active site. The active site residues in the ISO4-G1 PylRS apo form (brown) are superimposed on those of MaPylRS (light green) and Pyl-AMP-bound MmPylRSc (gray). Along with pyrrolysine (Pyl), the active site residues in ISO4-G1 PylRS (Tyr125, Met128, Asn164, Val166, and Tyr204), MaPylRS (Tyr126, Met129, Asn166, Val168, and Tyr206), and MmPylRSc (Tyr306, Leu309, Asn346, and Cys348) are represented as stick models. The β5-β6 and Tyr204 in ISO4-G1 PylRS, the β5-β6 and Tyr206 in MaPylRS, and the β7-β8 and Tyr384 in MmPylRSc are highlighted in cyan, yellow, and orange, respectively.

Figure 4. Open and closed conformations of the β5-β6 (ISO4-G1 PylRSs), β5-β6 (M. alvus PylRSs), and β7-β8 (M. mazei PylRScs) hairpins. Translucent surface models of the active-site pockets in the ISO4-G1 PylRS apo form (brown, a; gold, b), the ISO4-G1 PylRS mutant for cyanopyridylalanine in the apo form (light pink, c) (PDB: 7R6O), the MaPylRS apo form (yellow green, d; grass green, e) (PDB: 6JP2), the acrydonylalanine and AMPPNP-bound MaPylRS mutant (light orange, f) (PDB: 8DQG), the M. mazei PylRSc apo form (blue, g) (PDB: 2E3C), and the Pyl-AMP-bound MmPylRSc (gray, h) (PDB: 2ZIM) are shown. The β5-β6 (ISO4-G1 PylRSs), β5-β6 (M. alvus PylRSs), and β7-β8 (M. mazei PylRScs) hairpins are colored differently. Tyr204, Tyr206, and Tyr384 are highlighted in magenta. The bound TCO*Lys in the PylRS active site is shown as a stick model (PDB: AAO).

Figure 5. His225 undergoes drastic conformational changes in accordance with the β5-β6 hairpin. Superimposition of the ISO4-G1 PylRS molecules A to J (a). The superimposed Tyr204, His225, and Trp237 residues are shown as stick models (b). The Tyr204, His225, and Trp237 residues in the ten ISO4-G1 PylRS molecules are each colored green (c, molecule A), cyan (d, molecule B), magenta (e, molecule C), yellow (f, molecule D), vermilion (g, molecule E), white (h, molecule F), lavender (i, molecule G), orange (j, molecule H), light green (k, molecule I), and turquoise blue (l, molecule J).

Scheme 1. Chemical structures of pyrrolysine (Pyl) and non-canonical amino acids: BocLys, PocLys, ZLys, mAzZLys, pAzZLys, pEtZLys, and TCO*Lys are shown.

Figure 6. Cell-free protein synthesis for site-specific incorporation of non-canonical amino acids into the N11-GFPS1 protein by using ISO4-G1 PylRS with the first-layer mutations. The N11-GFPS1 proteins containing non-canonical amino acids were synthesized with the S30 extract from the E. coli RF1 deletion strain B-60ΔA::Z/pMINOR cells. Non-canonical amino acids were site-specifically incorporated into the N11-GFPS1 protein at position 17 in response to the UAG codon, by using the ISO4-G1 PylRS(Y125A/M128L)•tRNA^Pyl and ISO4-G1 PylRS(Y125A/M128A)•tRNA^Pyl pairs. Protein productivities with non-canonical amino acids were compared with that of the cell-free synthesis of wild-type N11-GFPS1 protein containing Ala at position 17 (WT control) and are shown above the bars. (a) The yields of the N11-GFPS1 proteins containing ZLys, mAzZLys, pAzZLys, pEtZLys, and TCO*Lys, estimated by fluorescence. The values represent the means of three independent experiments with standard deviations. (b) Cell-free synthesis of the N11-GFPS1 protein containing pEtZLys by using 10 μM of the M. mazei PylRS(Y306A/Y384F), M. alvus PylRS(Y126A/M129L), and M. alvus PylRS(Y126A/M129L/H228I/Y228P) proteins. The values represent the means of three independent experiments with standard deviations. (c) Cell-free protein synthesis of the GFPS1 protein containing pEtZLys, using increased concentrations (from 10 to 75 μM) of the ISO4-G1 PylRS(Y125A/M128L) protein. The values represent the means of three independent experiments with standard deviations.

Figure 7. Cell-free protein synthesis with non-canonical amino acids using ISO4-G1 PylRS with the H225A mutation. The N11-GFPS1 proteins synthesized with the S30 extracts from E. coli B-60ΔA::Z/pMINOR cells in the presence of non-canonical amino acids. Non-canonical amino acids were site-specifically incorporated into the N11-GFPS1 protein at position 17 in response to the UAG codon, by using the ISO4-G1 PylRS•tRNA^Pyl and ISO4-G1 PylRS(H225A)•tRNA^Pyl pairs for BocLys and PocLys. The yields of the N11-GFPS1 proteins containing non-canonical amino acids were estimated by fluorescence. Protein productivities with non-canonical amino acids were compared with that of the cell-free synthesis of wild-type N11-GFPS1 protein containing Ala at position 17 (WT control) and are shown on the bars. The values represent the means of three independent experiments with standard deviations.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Crystal Structure of Pyrrolysyl-tRNA Synthetase from a Methanogenic Archaeon ISO4-G1 and its Structure-Based Engineering for Highly-Productive Cell-Free Genetic Code Expansion with Non-Canonical Amino Acids

Abstract

Keywords:

Subject:

1. Introduction