2.1. Strains, Media and Growth Conditions
The yeast strains and plasmids used in this study are listed in
Table 1. The
Escherichia coli DH5α strain (F’/ endA1hsdR17 (rK-mK+) glnV44 thi-1 recA1 gyrA (NaIr) relA1Δ (lacZYA-argF) U169 deoR Φ80dlac Δ(LacZ) M15) [
49], was used for cloning and plasmid propagation, and was grown in Luria broth containing 1% tryptone, 0.5% yeast extract, 0.5% sodium chloride, pH 7.0, and 100 mg/L ampicillin when required (Sigma-Aldrich Brazil Ltda., São Paulo, SP, Brazil).
Yeasts were grown in rich YP medium (1% yeast extract, 2% Bacto peptone, Sigma-Aldrich), or in synthetic complete (YNB) medium (0.67% yeast nitrogen base without amino acids, supplemented with 1.92 g/L of yeast synthetic Drop-out media without uracil, or 1.82 g/L without uracil and tryptophan, Sigma-Aldrich), with 20 g/L glucose or cellobiose as carbon source. The pH of the medium was adjusted to pH 5.0 with HCl, and when required, 2% Bacto agar (Sigma-Aldrich) or 0.1 g/L nourseothricin (cloNAT, WERNER BioAgents GmbH, Jena, Germany) were added to the medium. The laboratory strains transformed with plasmids were pre-grown in YNB with glucose as carbon source, and used to inoculate new YNB medium containing 20 g/L cellobiose with an initial cell concentration of 0.1 optical density at 600 nm (A600nm), measured with a Cary 60 UV-VIS spectrophotometer (Agilent Technologies, Santa Clara, CA, USA). Growth was performed aerobically in cotton plugged Erlenmeyer flasks filled to 1/5 of the volume with medium at 28°C with 160 rpm orbital shaking. Cellular growth was followed by absorbance measurements at 600 nm (A600nm). For batch fermentations, cells were collected in the exponential phase of growth, centrifuged at 6,000 g for 5 min at 4°C, and washed twice with sterile water, and inoculated at a high cell density (~10 g dry cell weight/L) into a 25 mL flask containing 20 mL of rich YP medium containing 20 g/L cellobiose at 30°C, with shaking at 100 rpm. For the industrial strains, fermentations were also performed in rich YP medium containing 20 g/L glucose or cellobiose, or 20 g/L cellobiose plus 20 g/L xylose. Culture samples were harvested regularly, centrifuged (5,000 g, 1 min at 40C), and supernatants used for the quantification of substrates and fermentation products.
2.2. Molecular Biology Techniques
Standard procedures for DNA manipulation and analysis, as well as bacterial and yeast transformation, were employed [
59,
60]. Purification of plasmids, PCR products or Gibson Assembly products was performed using the QIAquick PCR Purification Kit (QIAGEN Antwerp, Belgium). The genomic DNA from the yeast strains was purified using a YeaStar Genomic DNA kit (Zymo Research, Irvine, CA, USA). For plasmid extraction, we used the manual mini-prep method [
60], or using the commercial QIAprep Spin Miniprep Kit (QIAGEN). DNA fragments for cloning, sequencing, or transformations were PCR-amplified using Phusion High-Fidelity (Thermo Fisher Scientific Inc., Waltham, MA, USA) or PrimeSTAR® GXL DNA polymerases (Takara Bio Europe SAS, Saint-Germain-en-Laye, France). Purified plasmids, products of Gibson Assembly, or cells lysed by incubation at 100°C in 20 mM NaOH for 10 min (colony PCR) served as DNA templates.
Based on the genome of
C. tropicalis and
M. guilliermondii [
61],
Sc. stipitis [
62] and
Sp. passalidarum [
63], primers were designed (
Table 2) to amplify genes encoding a β-glucosidase from
Sp. passalidarum (
SpBGL7, NCBI Gene ID: 18871961, primers SpBGL7-F and SpBGL7-R), the cellobiose transporters
HXT2.4 from
Sc. illinoinensis (the
HXT2.4 gene from
Sc. stipitis has NCBI Gene ID: 4850978, primers HXT2.4-F and HXT2.4-R),
CtCBT1 from
C. tropicalis (NCBI Gene ID: 8298855, primers CtCBT1-F and CtCBT1-R), and
MgCBT2 from
M. guilliermondii (NCBI Gene ID: 5129179, primers MgCBT2-F and MgCBT2-R), introducing restriction sites (for
BamHI,
EcoRI,
HindIII,
SalI or
XhoI enzymes) for cloning into multicopy shuttle vectors containing strong and constitutive promoters and terminators (pGPD-424 and pGPD-426,
Table 1) as well as the
TRP1 and
URA3 genes used as selective marker.
To clone and overexpress modified versions of the
MgCBT2 transporter, we amplified the
MgCBT2 gene from the pGPD-426-MgCBT2 plasmid (
Table 1) using specific primer pairs: MgCBT2ΔC-F and MgCBT2ΔC-R (
Table 2) for generating a version of the gene that encodes a truncated permease in the C-terminal region (pGPD-426-MgCBT2ΔC plasmid,
Table 1), and MgCBT2ΔNΔC-F and MgCBT2ΔC-R (
Table 2) for generating a version that encodes a truncated permease in both N- and C-terminal regions (pGPD-426-MgCBT2ΔNΔC plasmid,
Table 1). These primers ensured the retention of the ATG codon for the initial methionine and the TGA stop codon for protein synthesis termination. The gene encoding the transporter truncated in the C-terminal lacked base pairs 4 to 60, resulting in a protein lacking the first 19 amino acid residues after the initial methionine. The gene encoding the transporter truncated in both N- and C-terminal regions, in addition to having the same modification described above, also lacked the last 36 coding base pairs, resulting in a protein lacking the last 12 amino acid residues in addition to the first 19 after the initial methionine.
The pV1382 plasmid (
Table 1) served as the platform for expressing the CRISPR-Cas9 system in
S. cerevisiae [
58]. The ARS208 and ARS1309 loci were chosen for integrating the overexpression modules of the
SpBGL7 and
MgCBT2 genes, respectively, based on the research by Reider Apel and colleagues [
64]. After sequencing both regions of interest in the MP-C5H1 strain genome (
Table 1), we identified 20 bp segments to serve as the crRNA recognition sites. These segments were required to be followed by a protospacer adjacent motif (PAM) sequence recognized by the CRISPR-Cas9 system ("NGG" in this case). For the ARS208 site, the selected sequence was "GTCCGCTAAACAAAAGATCT", followed by the PAM sequence "TGG", located approximately 325 base pairs upstream of the ARS208 locus. For the ARS1309 site, the chosen sequence was "CCTGTGGTGACTACGTATCC", followed by the PAM sequence "AGG", situated approximately 180 base pairs upstream of the ARS1309 locus.
Each DNA fragment responsible for crRNA transcription, specific to the sequences mentioned above, was cloned into pV1382 as described [
58]. The vector pV1382 was treated with enzyme
BsmBI (New England Biolabs, Leiden, The Netherlands), and the 5' ends of each DNA strand of the linearized plasmid were dephosphorylated using alkaline phosphatase (Quick CIP, New England Biolabs), followed by purification. The digested plasmid and the specific pair of oligonucleotides (sgRNA.ARS1309-F and sgRNA.ARS1309-R,
Table 2) to target the CRISPR-Cas9 system to the ARS1309 locus, and sgRNA.ARS208-F and sgRNA.ARS208-R primers (
Table 2) for the ARS208 locus, were incubated at 15°C for 16 hours in the presence of T4 DNA ligase (Thermo Fisher). The resulting plasmids were sequenced using the seq.p1382.sgRNA-F and seq.p1382.sgRNA-R primers (
Table 2) to verify the correct insertion of the DNA fragments, yielding plasmids pV1382-ARS1309 and pV1382-ARS208 (
Table 1).
To assemble the PCR-amplified DNA fragments for constructing the repair and gene overexpression modules, we utilized Gibson Assembly® with the NEBuilder® HiFi DNA Assembly kit (New England Biolabs). Each PCR reaction employed a pair of primers (primer sequences can be provided upon request) designed with at least 20 base pairs at the 3' end that anneal to the beginning or end of the intended amplification region, and at least 20 nucleotides at the 5' end identical to the adjacent end of the DNA portion in the other DNA molecule intended for joining. For constructing the repair fragment for inserting the gene encoding each version of the
MgCBT2 transporter, three initial fragments were joined: (I) a 515 bp DNA fragment identical to the region upstream of the cleavage site of the ARS1309 locus (5’ARS1309, obtained via colony PCR from the industrial strain MP-C5H1,
Table 1); (II) a DNA fragment containing the P
TDH3 promoter, the desired
MgCBT2 version, and the T
CYC1 terminator (obtained via PCR using pGPD-426-MgCBT2, pGPD-426-MgCBT2ΔC, or pGPD-426-MgCBT2ΔNΔC plasmids as templates,
Table 1); and (III) a 624 bp DNA fragment identical to the region downstream of the cleavage site of the ARS1309 locus (3’ARS1309, obtained via colony PCR from the industrial strain MP-C5H1).
For constructing the repair fragment containing the
SpBGL7 overexpression module, circular construction was chosen due to the low efficiency of Gibson Assembly in forming linear molecules from more than three distinct fragments. Circularization was achieved using the pMV vector (
Table 1). The repair fragment was constructed by joining five distinct initial fragments: (I) a 695 bp DNA fragment identical to the region upstream of the cleavage site of the ARS208 locus (5’ARS208, obtained via colony PCR from strain MP-C5H1); (II) a DNA fragment containing 608 bp immediately upstream of the coding region of the
TEF1 gene, corresponding to its promoter region (P
TEF1, obtained via colony PCR from strain MP-C5H1); (III) a DNA fragment containing the
SpBGL7 gene (obtained via PCR using pGPD-424-SpBGL2 plasmid as template); (IV) a DNA fragment containing the 428 bp immediately downstream of the coding region of the
PGK1 gene, corresponding to its terminator region (T
PGK1, obtained via colony PCR from strain MP-C5H1); and (V) a 742 bp DNA fragment identical to the region downstream of the cleavage site of the ARS208 locus (3’ARS208, obtained via colony PCR from strain MP-C5H1). We changed the constitutive promoter (P
TEF1) and terminator (T
PGK1) controlling the
SpBGL7 gene to avoid any chromosomal instability with the promoter (P
TDH3) and terminator (T
CYC1) used for the
MgCBT2 permeases. Each purified PCR fragment was incubated at 50°C for 60 minutes in the presence of NEBuilder® HiFi DNA Assembly reagent. After incubation, the resulting plasmid (pMV-SpBGL7,
Table 1) was transformed into
E. coli DH5α.
For the insertion of the
SpBGL7 repair/overexpression module into the genome of the MP-C5H1 strain, transformations were performed using 300 ng of the purified pV1382-ARS208 plasmid and 10 µg of the repair/overexpression module amplified using primers ARS208-F and ARS208-R (
Table 2) and the pMV-SpBGL7 plasmid as template (
Table 1). Transformants were selected in YP-20 g/L glucose plates containing 0.1 g/L nourseothricin. Flipout of the pV1382-ARS208 plasmid was performed by overnight growth (twice) in nonselective liquid YP-20 g/L glucose medium. Drug-sensitive colonies, which had lost the plasmid, were identified by plating for single colonies on nonselective media and subsequent identification by replica plating to selective media. The correct insertion of the
SpBGL7 module at ARS208 in the MP-B7 strain (
Table 1) was confirmed by sequencing. The same concentrations of the pV1382-ARS1309 plasmid and the repair/overexpression modules containing the different
MgCBT2 transporters (
MgCBT2,
MgCBT2ΔC or
MgCBT2ΔNΔC), produced by Gibson Assembly, were used to transform strain MP-B7, yielding strains MP-B7-CBT2, MP-B7-CBT2ΔC and MP-B7-CBT2ΔNΔC, respectively (
Table 1). All insertions at the ARS1309 locus were confirmed by sequencing.
2.3. Enzymatic and Transport Activity Assays
The hydrolysis of
p-nitrophenyl-β-D-glucopyranoside (
pNPβG), cellobiose, or
p-nitrophenyl-β-D-xylopyranoside (
pNPβX) was determined using permeabilized yeast cells [
65]. Approximately 50 μL of permeabilized cell suspension (at concentrations ranging from approximately 0.1 to 0.4 g/L) were added to 450 μL of 100 mM MOPS-NaOH, pH 6.8 buffer containing the desired amount of substrate, and incubated at 30°C for 10 minutes. The reaction was stopped by placing the tubes at 100°C for 3 minutes. Pre-boiled cells for 3 minutes were used as controls. We used final concentrations ranging from 0.05 to 10 mM of
pNPβG and
pNPβX, or 1 to 80 mM cellobiose as substrates. After the reaction, cells were centrifuged at 2600x
g for 5 minutes, and the supernatant from assays using
pNPβG and
pNPβX were used to determine the enzymatic activity by measuring the concentration of
p-nitrophenol released by substrate hydrolysis, at an absorbance of 400 nm. To determine cellobiose hydrolysis, the supernatant from the assays was used to measure the concentration of glucose formed using a commercial glucose oxidase-peroxidase kit (Glicose Pap Liquiform Labtest, Centerlab, Belo Horizonte, MG, Brazil). Activities are expressed as nmol of
p-nitrophenol or glucose produced by (mg dry cell weight)
-1 min
-1. The values of
Km and
Vmax were determined through nonlinear regression applied to the Michaelis-Menten kinetic model using the GraphPad Prism v. 8.0 software (GraphPad Software, Boston, MA, USA).
The transport assays followed a colorimetric method originally developed for determination of α-glucoside (
p-nitrophenyl-α-D-glucopyranoside) transport by yeast maltose permeases [
66]. Cells were harvested from liquid culture, washed twice with chilled (4°C) sterile distilled water, and resuspended in 50 mM succinate-Tris pH 5.0 buffer to achieve a cell concentration of approximately 30 g/L. Aliquots of 50 μL of this cell suspension were transferred to Eppendorf tubes, and a volume of 50 μL of 10 mM
pNPβG or
pNPβX in the same buffer was added. The cells were incubated at 30°C for 10 minutes, during which the internalized substrate underwent hydrolysis due to the activity of the intracellular β-glucosidase. The reaction was stopped by incubating the tubes at 100°C for 3 minutes. Subsequently, 200 μL of 2 M NaHCO
3 was added. Pre-boiled cells for 3 minutes were used as negative controls. Cells were centrifuged at 2600x
g for 5 minutes, and the supernatant was used to determine the concentration of
p-nitrophenol produced, measured by absorbance at 400 nm. The transport activities are expressed as nmol of
pNPβG or
pNPβX transported (
p-nitrophenol produced) by (mg dry cell weight)
-1 min
-1.