2.1. Molecular Identification and GABA Production
Molecular identification of 119 isolates from the PCA plates incubated at 32ºC proved lactococcal species to be the majority populations in raw milk, accounting for circa 35% of the isolates, including L. lactis (26 isolates), Lactococcus raffinolactis (8 isolates), and Lactococcus garvieae (7 isolates). Similarly, among 117 colonies selected from the GLM17 plates incubated at 42ºC, a majority of isolates were identified as S. thermophilus, accompanied by a few isolates of Staphylococcus epidermidis, Lactobacillus delbrueckii, Streptococcus gallolyticus subsp. macedonicus, and Streptococcus equinus. These isolates accounted for the species withstanding pasteurization and high incubation temperature. Thus, under the specific isolation conditions of this study, L. lactis (21.8%) and S. thermophilus (41.8%) proved to be, respectively, the majority mesophilic and thermophilic LAB species in milk.
Since
L. lactis and
S. thermophilus isolates were recovered from every milk sample, an isolate of each of the species per sample was tested for GABA production in GM17 (for
L. lactis) or GLM17 (for
S. thermophilus) broth supplemented with 5 mM MSG. GABA was detected in the supernatant of a majority of cultures, although tiny amounts were measured in some isolates. Isolates producing <0.64 mM of GABA in broth were considered non-producers. With this threshold, six
L. lactis and eight
S. thermophilus isolates converted most of the MSG precursor into GABA (>4.0 mM; >80% conversion). To avoid the presence of replicates, these 14 isolates were subjected to RAPD and rep-PCR typing. Taking into account the repeatability of the method (93%), the combined fingerprinting results suggested the presence of six different strains for each species (
Figure S1). Among those, four unrelated strains each of
L. lactis (Lc 5.5, Lc 14.4, Lc 19.3, and Lc 21.1) and
S. thermophilus (St 8.1, St 9.1, St 18.1, and St 21.1) were selected for the characterization of relevant technological and safety properties, as well as for genome sequencing and analysis.
2.2. Biochemical and Technological Characterization
The carbohydrate fermentation profile of the strains, as determined by the API-50 CHL system, matched those of their respective species (
Table S1). The
L. lactis profile ranged from 13 to 21 carbohydrates. Lc 21.1 utilized the smallest number of compounds, which is typical of well-adapted-to-milk (“domesticated”) lactococcal strains. The profile of the
S. thermophilus strains included the fermentation of 3-5 carbohydrates, a short but typical sugar-utilization profile for this species.
Species-specific profiles for enzyme activities, assayed using the API-ZYM system, were also scored for the
L. lactis and
S. thermophilus strains (
Table S2). Strains of the two species showed strong leucine aminopeptidase and phosphohydrolase (against naphthol-AS-BI) activities.
L. lactis strains showed strong acid phosphatase activity and moderate cysteine aminopeptidase activity. In general, the
S. thermophilus strains showed strong β-galactosidase activity, moderate valine aminopeptidase and esterase activities, and weak lipase activity. Although with variable enzymatic levels, all
S. thermophilus strains returned a positive urease test.
Two strains of each of the species (Lc 5.5 and Lc 14.4, and St 8.1 and St 18.1) grew well and coagulated the milk by 12 h of incubation (pH 4.43-4.70), forming a firm coagulum of good appearance and pleasant acidic flavor and taste. In contrast, the pH of the milk at this time was ≥6.0 for all other strains, and none of these clotted the milk even after 24 h of incubation. Minor differences in acidification were observed when the milk was supplemented with 10 mM MSG (
Figure 1).
GABA production in supplemented milk was associated with growth (
Figure 1). However, GABA synthesis in
S. thermophilus was retarded compared to that recorded for
L. lactis. At 12 h of incubation, only
L. lactis Lc 5.5. and Lc 14.4 produced significant amounts of GABA, while at 24 h three
L. lactis (Lc 5.5, Lc 14.4, and Lc 19.3) and two
S. thermophilus (St 8.1 and St 18.1) strains did so. In laboratory media, the presence of MSG caused a small increased in the final pH of the cultures (0.23±0.06). However, this effect was not observed in milk. Finally, under the conditions of this study, none of the strains produced GABA in MSG-supplemented fecal homogenates.
The strains produced and consumed different amounts of organic acids and sugars. However, no clear-cut species-specific profiles were observed (
Table 1). In
L. lactis, major differences between the strains were associated with the utilization of citrate. Strains using this compound (Lc 14.4 and Lc 21.1) produced large amounts of acetic acid. In addition, strains Lc 5.5 and Lc 14.4 produced moderate amounts of formic and succinic acids. In
S. thermophilus, profiles for sugar fermentation and the production of organic compounds seemed to be mostly influenced by growth in milk. Strains St 8.1 and St 18.1 grew well and generated large amounts of lactic acid, releasing large quantities of galactose. Strains St 9.1 and St 21.1 did not grow well in milk and, in general, their metabolic activity was rather poor. At the same time, they both released notable quantities of glucose, suggesting premature lysis of the cells.
After growing the strains in milk at either 32ºC (
L. lactis strains) or 42ºC (
S. thermophilus strains), 29 volatile compounds were detected (
Table 2), 15 of which were not present in the starting milk. For the
L. lactis strains, no species-specific profile was observed due to the different behaviour shown by the Lc 21.1 strain; unlike its counterparts it did not produce isovaleric acid, 4-octanone, or phenylacetaldehyde. Further, Lc 5.5 and Lc 14.4 produced 2-phenyl ethanol, isoamyl alcohol, and 5-methyl-2-phenyl-2-hexenal. Lc 14.4 produced acetoin (diacetyl cannot be detected with the solid-phase microextraction (SPME)-gas chromatography method utilized in this study), and large amounts of acetic acid. Lc 5.5 and Lc 19.3 both produced notable amounts of benzoic acid, while Lc 14.4 and Lc 19.3 produced 2,5-dimethyl-3-hexanone and 2,3-heptanodione. In contrast, with only minor differences between the strains, species-specific profiles were seen for the four
S. thermophilus strains. Two species-specific volatile compounds were associated with
S. thermophilus, namely tetradecanoic acid and 2-heptanone.
Under the conditions of the assay, none of the strains produced bacteriocin-like inhibitory substances active against L. sakei CECT906T.
2.4. Genome Sequence and Analysis
To understand the genetic basis of GABA production -and to help ensure their safe use- all eight strains were subjected to genome sequencing and analysis. The acquired genomic data served to confirm the taxonomic identification of the strains via in silico dDDH and orhtoANI analysis. The examined strains were compared with type strains of related species, including
L. lactis subsp.
lactis,
L. lactis subsp.
lactis biovar. diacetylactis, and
L. cremoris subsp.
cremoris, or the
S. thermophilus type strain and the type strains of related species (
Table S4).
These analyses confirmed the previous identification of the strains as either L. lactis or S. thermophilus. Further, L. lactis Lc 21.1. was identified as belonging to biovar. diacetylactis. Although lower than those of Lc 21.1, the dDDH and orhtoANI values indicated Lc 14.4 to be slightly closer to the L. lactis subsp. lactis biovar. diacetylactis type strain (GL2T) than to the L. lactis subsp. lactis type strain (ATCC 19435T).
Table 3 shows the general features of the genome of all the sequenced strains. Genomic analysis of the strains showed they all carried the gene coding for the GAD enzyme (
gadB) and its corresponding associated antiporter (
gadC) organized as an operon-like structure (
Figure 2). The assembly of the genome of the
L. lactis strain Lc 14.4 split the operon into four contigs, impeding the determination of its structure. One contig harboured genes encoding proteins involved in plasmid replication, which strongly suggest that there are two copies of the GABA operon in the genome of this strain, of which one is plasmid-associated. This structure may account for the assembly failure in Lc 14.4 (
Figure 2A). As expected, the order of the gad genes in
L. lactis and
S. thermophilus was reversed, and the open reading frames (ORFs) flanking them were different in the two species (
Figure 2B). The GABA operon in
L. lactis was flanked by chromosomal genes encoding ribonuclease HII (
rnhB) and a potassium efflux system (
kefA). Located at the first position, the operon in
L. lactis also comprised the regulatory gene (
gadR) (
Figure 2A).
The
S. thermophilus operon also appeared to be encoded on the chromosome, and in the same position in all strains. However, the operon in this species lacked the gene encoding the regulator and was flanked by sequences coding for insertion elements and mobilization proteins (
Figure 2B), suggesting a (recent) acquisition by horizontal transfer.
The genetic makeup for lactose and galactose utilization included clusters for lactose utilization via the Leloir pathway in both the
L. lactis and
S. thermophilus strains, and an operon for lactose utilization via the tagatose-6-phosphate pathway in the genome of all lactococcal strains (
Figure S2). These pathways are well characterized in strains of many LAB species, including
L. lactis and
S. thermophilus [
25].
Genome analysis of the citrate-associated genes in the two
L. lactis strains utilizing citrate (Lc 14.4 and Lc 21.1) revealed striking differences (
Figure S3). Three operons, each located in a different contig, were identified in Lc 21.1. Two operons were considered to be located on the bacterial chromosome, encoding the citrate lyase complex and the malolactic enzyme. Each of the operons harbored a gene encoding a citrate/H+ symporter-like permease, which contained mutations that might render the enzymes inactive. The third operon was found in a contig together with genes encoding plasmid-replication proteins, strongly suggesting a plasmid location. This operon also contained an ORF encoding the well-described citrate-specific transporter CitP. In contrast, only the chromosomal operons encoding citrate lyase and malolactic enzyme were identified in the genome of the Lc 14.4 strain. Surprisingly, the genes coding for the corresponding associated permeases were complete and contained no mutations. Genes encoding citrate synthase and isocitrate dehydrogenase were found in the genome of all
L. lactis strains; these enzymes might intervene in the synthesis of citrate.
The analyzed genomes showed a complex repertoire of metabolic genes involved in the growth of the strains in milk and the formation of taste and aroma compounds (
Tables S5 and S6). With only minor differences between strains, gene content species-specific profiles were established. Nonetheless, these minor differences might be critical for rapid development and/or for the synthesis of key flavor compounds in milk. In addition to the genetic machinery involved in diacetyl production from citrate, genes coding for the
L. lactis and
S. thermophilus caseinolytic proteinases (PrtP and PrtS, respectively) were identified in most strains, but not in Lc 21.1, St 9.1, and St 18.1. High cell densities and milk coagulation after overnight culture were only attained by the strains harboring either
prtP or
prtS genes. As expected from the biochemical characterization results, a complete urease operon consisting of three catalytic enzymes and seven accessory genes was found in all four
S. thermophilus strains (
Figure S4).
As expected, no CRISPR-Cas-associated loci were identified in the genome of the
L. lactis strains; these systems are uncommon in this species. In agreement, a large proportion of phage-associated genes was identified in the genome of all
L. lactis strains (
Table 3). The Phaster software associated these genes with either complete (from one to three) or incomplete (from 4 to 13) phages. In contrast, all four
S. thermophilus strains possessed three loci showing both CRISPR repeats and genes coding for Cas proteins. A short series of isolated CRISPR repeats was also found in all strains except in St 8.1. Only two loci showing an incomplete set of genes of phage origin were detected in all
S. thermophilus strains. In addition, a complete phage of 43.9 kbp was identified in strain St 21.1. PlasmidFinder identified two plasmid-derived sequences in St 9.1, while one plasmid-replication protein was detected by BV-BRC in St 8.1, St 9.1, and St 18.1 (
Table 3). PlasmidFinder has been reported not to be reliable for the detection of plasmids in LAB species [
26]. Despite this, the software identified plasmid sequences in all
L. lactis strains, which agrees well with the large number of genes encoding putative plasmid-associated proteins detected by BV-BRC in all lactococcal genomes (
Table 3).
Despite the absence of antimicrobial activity in the agar well-diffusion assay, several loci were identified by BAGEL 4 as putative regions encoding bacteriocin-like peptides (Blp) or ribosomally synthesized and post-translationally modified peptides (RiPPs) (
Table 3). Besides several lactococcins, Lc 14.4 encoded a protein with homology to enterolysin A (53% amino acid identity). Three lactococcal strains showed a locus for the synthesis of sactipeptides (sulfur-to-α-carbon-containing peptides; also classified as Class Ic bacteriocins). Finally, a locus for the synthesis of linaridin (a class of linear, dehydrated peptides) was detected in the genome of Lc 19.3. For the
S. thermophilus strains, loci involved in the synthesis of several Blp antimicrobials (Class II bacteriocins with a double-glycine leader peptide) were identified. In addition, they all contained a locus for the synthesis of streptides (a class of 20-membered cyclic peptides). Finally, St 8.1 showed a locus with the capability to encode a bacteriocin homologous to the BhtR bacteriocin (a Class I two-component lantibiotic) identified in
Streptococcus ratti [
27].
Interrogation of the genomes with the CARD database for antibiotic resistance genes detected two strict hits in the genome of all four
S. thermophilus strains with low homology (<33% identity) to the deduced proteins for the
vanT and
vanY genes in
Enterococcus spp. These genes were annotated by BV-BRC as alanine racemase and D-alanyl-D-alanine carboxypeptidase, respectively. Neither genes nor mutations involved in antibiotic resistance were detected in the
S. thermophilus genomes by ResFinder. The D-alanyl-D-alanine carboxypeptidase gene was also detected as a strict hit by CARD in all
L. lactis strains. In addition, this database identified the
lmrD gene in all strains as a perfect hit (100% identity, 100% length coverage).
lmrD encodes a subunit of the well-characterized LmrCD heterodimeric efflux ABC transporter, which is involved in multidrug resistance in
L. lactis [
28].
Glutamate decarboxylase was the only amino acid carboxylase encoded in the genome of any strain. A few genes encoding housekeeping enzymes, multidrug efflux pumps, and proteins involved in heavy metal homeostasis and resistance, were found in the genomes of the strains on the Rast Server, except for a gene encoding a serine protease with partial similarity to the exfoliative toxin A from Staphylococcus aureus present in all L. lactis and S. thermophilus strains, no genes encoding toxins were identified.