2.2. Functions of uS2 beyond the Ribosome
uS2 is a highly conserved r-protein essential for all organisms from bacteria to human, although its exact functions as a ribosomal component remain incompletely understood. It has been suggested that prokaryotic uS2 might be involved in stabilizing the Shine-Dalgarno (SD) helix docked in a chamber between the head and the platform [
78], as well as in protecting the SD duplex at the early post-initiation step [
79]. However, this does not explain the vital function of uS2 in organisms that do not exploit the SD interactions in translation initiation. uS2 is one of the latest components in the 30S assembly [
80]. In
E. coli and most likely in other Gram-negative bacteria, its association with the 30S particle is indispensable for binding bS1 that accomplishes the assembly of the 30S subunit fully competent in recruiting mRNA [
35,
36]. Within the ribosome, uS2 is located on the back of the 30S subunit at the hinge between the head and body. Possessing an elongated bidomain structure, uS2 forms direct contacts with several 16S rRNA helices,
viz h35-h37 in the head via the coiled-coil α
2 domain and h26 in the body via the large globular domain [
81].
uS2 is encoded by the first gene of the
rpsB-tsf operon that also comprises a gene for the elongation factor Ts (
Figure 1). When synthesized in excess over the 30S ribosome, uS2 acts as a translational autogenous repressor of the
rpsB-
tsf mRNA [
82,
83]. As a repressor, uS2 recognizes and binds the unique structural features within the 5’UTR of the mRNA, inhibiting its own translation directly. This repression interrupts transcription-translation coupling in the operon, thereby decreasing the level of the bicistronic
rpsB-tsf mRNA and hence the level of the essential Ts; that’s why expression of uS2 from a plasmid significantly slows down the growth rate.
The mRNA structural features recognized by uS2 are highly conserved at least across γ-proteobacteria [
83,
84]. The mechanistic details of the autoregulation remain unclear as the ribosome binding site (RBS) including the SD sequence and the start codon is not involved in the operator structure, and a small deletion of the conserved bulge far upstream RBS may eliminate the uS2-mediated regulation [
82,
83]. To act as an autogenous repressor efficiently, uS2 needs a companion, bS1, with which uS2 forms a complex not only on but also outside the ribosome [
82], indicating an intimate relationship between these two r-proteins. Moreover, moderate overexpression of bS1 from a plasmid can suppress the thermosensitive phenotype of one of the
rpsB mutant,
rpsB1ts, allowing its growth at the elevated temperature otherwise lethal to the strain [
85]. Thus, there are several distinct features of the uS2-mediated autogenous regulation: (i) uS2 as an autogenous repressor is not a primary RNA binding protein but binds to the 30S subunit at the late step of the assembly; (ii) to serve as a repressor efficiently, uS2 needs an assistance of bS1; (iii) the
rpsB operator site bears no visible similarity to the regions on 16S rRNA bound by uS2 within the ribosome. However, more sophisticated analysis of the
rpsB mRNA regulatory structure (in-cell PAIR-MaP analysis) has revealed a common architecture of the uS2 binding sites on 16S rRNA and
rpsB mRNA at least in enterobacteria [
86].
Besides its role as an autogenous repressor, uS2 may have other moonlighting activities in pathogenic bacteria, although the underlying mechanisms remain unclear and await further studies. Recently, RpsB (uS2) has been shown to be a surface-exposed protein of rickettsia, representing an important ligand and adhesin of these obligate intracellular microorganisms [
87]. Elevated expression of peptides derived from uS2 has been observed in the highly virulent strains of
Streptococcus suis cultured under the host-simulated conditions, indicating that uS2 or its peptides might serve as specific virulence factors [
88]. The role of uS2 in pathogenesis is obviously not related to its functions in the ribosome. Potential functioning of uS2 beyond the ribosome has also been proposed for a harmful human pathogen
M. tuberculosis (
Mtb).
Mtb RpsB has been identified not only in cytosolic but partially in cell wall fractions, while its counterpart from non-pathogenic
M. smegmatis (
Msm) localizes only in cytoplasm. Moreover,
Mtb RpsB ectopically expressed in
Msm has been also found to associate with cell membrane/wall.
Msm cells expressing
Mtb RpsB in trans show reduced cell wall permeability and increased tolerance to drugs, oxidative stress, SDS and starvation. An ability to impart stress resilience to mycobacteria can be ascribed to the unique C-terminal sequence of
Mtb RpsB which is absent from RpsB of non-tuberculosis mycobacteria, and a deletion of the C-terminal extra-fragment deprives
Mtb S2 of its influence on the cell resistance to stresses [
89].
2.3. uS4, an Essential r-Protein Functioning in Ribosome Biogenesis, Translation, and Transcription
An essential r-protein uS4 is a primary protein in the 30S ribosomal subunit biogenesis, which nucleates assembly by binding to a five-way helix junction in the 16S rRNA 5’ domain. It is believed that early uS4-16S rRNA interactions guide rRNA folding and impact later steps of the 30S assembly [
90,
91,
92]. Surprisingly, additional r-proteins, particularly a late-binding protein uS12, may accelerate proper binding of uS4 during rRNA transcription by acting on the nascent rRNA as an RNA-chaperonin [
93]. Within the ribosome
, uS4 is involved in multiple functions including mRNA decoding, and mutations in uS4 have impact on translation fidelity [
94,
95]. uS4 along with uS3 and uS5 are located at the mRNA entry site between the head and the shoulder of the 30S subunit and endow the ribosome with a helicase activity necessary to disrupt downstream helices in mRNA as the narrow mRNA channel can accommodate only unpaired mRNA segments [
96].
In addition to its role in ribosome biogenesis and formation of the mRNA entry site, uS4 possesses moonlighting activities as a regulator of both translation and transcription. First, it is a regulatory protein in the post-transcriptional control of the α-operon that in
E. coli comprises genes for four r-proteins and the α-subunit of RNA polymerase (uS13, uS11, uS4, RpoA, bL17, in this order, see
Figure 1). Autogenous repression of the
E. coli α-operon mRNA translation by uS4 has been thoroughly studied for years [
97,
98,
99,
100,
101]. Interaction of uS4 with the target site in the α-operon mRNA results in translational repression of not only the proximal three cistrons for uS13, uS11 and uS4, but also of the last cistron encoding bL17, without affecting the intervening α-cistron that is regulated independently. A presumable second binding site for uS4 on the α mRNA in front of
rplQ has been proposed, suggesting that uS4 may repress bL17 translation directly [
102], but strong evidence for this has not been provided.
The operator site for the uS4-repressor on
E. coli α mRNA forms a complex pseudoknot structure comprising the ribosome binding site of the first cistron,
rpsM [
99,
100,
101]. The uS4 interaction with a pseudoknot traps mRNA in a conformation that allows binding of the 30S subunit but prevents formation of the active initiation complex with the initiator tRNA, thus blocking translation. Such a mechanism for the translation inhibition was named “entrapment” to underline the absence of the competition between the repressor and the ribosome for mRNA binding [
100,
101].
In
B. subtilus (Bsu), as well as in other species of the class Bacilli, the
rpsD gene does not belong to the cluster of genes encoding uS13, uS11, α, bL17; it is situated in a separate region of the chromosome, while the other genes are kept in the same order. At the same time,
Bsu S4 binds to the untranslated leader of the
rpsD mRNA and represses its own translation. The regulatory region does not form a pseudoknot hence the regulation is essentially different from that in
E. coli [
103,
104]. How r-protein genes within the α operon in Bacilli are regulated remains unknown. Interestingly, the members of the class Clostridia, in contrast with the class Bacilli, keep the
rpsD gene in the α operon (NCBI, Gene database). Although the
E. coli-like gene order of the α operon containing
rpsD is widely distributed in bacteria, there are many lineages where
rpsD is separated. Thus, β-, γ- and ε-proteobacteria bear
E. coli-like α operons, while in α-proteobacterial species,
rpsD is located distantly (NCBI, Gene database). The reason for this diversity remains an open question.
Another well-known moonlighting activity of uS4 is its role in transcription as a general antitermination factor with properties very similar to NusA [
105]. uS4 associates with RNA polymerase (RNAP) in vivo and inhibits the premature termination on the rRNA operons. The antitermination activity of uS4 is specific for Rho-dependent terminators. Thus, uS4 together with uS10 (NusE, see below) are important components of the
rrn antitermination system involved in ribosome biogenesis. The antitermination complex forms in response to cis-acting elements (boxB, boxA and boxC) in the nascent pre-rRNA. Recent studies [
106,
107] show that Nus-factors (A, B, E, G), SuhB (the inositol mono-phosphatase) and uS4 assemble on RNAP into a capped ring around the RNA-exit channel, where uS4 serves as a flexible lid. Such a bulky protein structure may block an approach of the termination factor Rho to RNAP. Moreover, Nus-factors, SuhB and uS4 together support the co-transcriptional rRNA folding by acting as an RNAP-associated RNA chaperone according to the well-known RNA- chaperone molecular principles [
106,
107]. The structure of the rRNA-specific antitermination complex is represented on
Figure 3.
Finally, it has been recently found that uS4 (along with TufA and GacA) can be cross-linked in vivo in the stationary phase to a “mysterious” PA2504 protein from
Pseudomonas aeruginosa. It was supposed that PA2504 might block the biological functions of these proteins to fine-tune cellular response to stationary phase-dependent nutrient starvation [
108]. It is currently unknown which of the uS4 functions is blocked by PA2504.
2.4. Ribosomal Proteins bS6 and bS18 Act in Tandem
Proteins bS6 and bS18 should be considered in tandem because they function as a heterodimer both in the ribosome assembly and in regulating expression of their own operon
rpsF (S6)-
priB-
rpsR(S18)-
rplI (L9). In β- and γ
-proteobacteria this operon includes non-ribosomal gene
priB that encodes the primosomal n protein necessary for replication restart, while in certain phyla
priB is absent from the operon (α-proteobacteria, Bacteroidetes/Chlorobi). Actinobacteria have
ssb instead of
priB as well as Firmicutes which, in addition, lack
rplI at the end of the operon. Despite these differences, in most taxonomic groups
rpsF and
rpsR are expressed from the same transcription unit and, most likely, are regulated jointly [
109]. bS6 and bS18 are secondary binding proteins in the assembly of the central domain of a 30S subunit, which is nucleated by binding of the primary uS15 protein to a highly conserved 16S rRNA region. However, there is no protein-protein contacts between uS15 and bS6-bS18, and the rRNA site bound by bS18 within a heterodimer is formed due to the uS15-mediated structure remodeling [
110].
Relatively recently, it has been discovered that bS6-bS18 proteins regulate expression of their own operon at the translation level by binding to the 5’ UTR upstream of the
rpsF start [
109,
111,
112]. Initially, a high phylogenetic conservation of a presumable regulatory region has been computationally predicted, and it has been demonstrated that an S6-S18 complex indeed binds to this RNA fragment from
E. coli in vitro. A putative RNA operator bears a conserved CCG sequence in a bulge flanked by a stem and a hairpin, which is analogous to the structural context of the 16S rRNA binding site for bS6-bS18, thus suggesting the molecular basis for the autoregulatory mechanism [
109]. Further, wide distribution across many bacterial phyla of the structural RNA motif in front of
rpsF has been also described by Meyer’s group, and a direct interaction of an S6-S18 complex with the RNA motif from
B. subtilis has been confirmed [
111]. Finally, in vivo reporter experiments in
E. coli have demonstrated that an S6-S18 complex indeed functions as an autogenous repressor to regulate expression of the operon by binding to the regulatory site preceding
rpsF, with the bS18-mRNA interaction being crucial for the translation inhibition [
112].
Interestingly, bS6 in
E. coli is modified by ATP-dependent glutamate ligase RimK that can add up to four glutamate residues to the C-terminus of the protein. Oligoglutamylation of bS6 by RimK occurs only in the stationary phase [
113]. It is yet difficult to ascribe any reasonable role for such a modification in
E. coli. The same modification of bS6 by RimK has been studied in a soil bacterium
Pseudomonas fluorescence [
114,
115] where it is able to change expression of a set of genes encoding surface attachment factors, amino acid transporters and secreted molecules. However, the mechanistic details of the impact of a single r-protein modification on gene expression and proteomic changes are not yet understood.
2.5. A Key Primary Assembly r-Protein uS7 Is Bifunctional
Ribosomal protein uS7 is a key primary protein in the 30S subunit assembly [
80]. Its interaction with 16S rRNA initiates folding of the 3’-major domain and further formation of the 30S head where it faces the decoding center. Like uS4, uS7 initially makes numerous short-lived contacts with the 3′ domain of the 16S rRNA, but the stable incorporation of uS7 is promoted by the secondary r-proteins uS9, uS13, and uS19 which act as chaperones to provide the correct folding of rRNA helices [
92]. Besides the interaction with 16S rRNA, uS7 forms contacts with uS9 and uS11 within the ribosome [
81].
uS7 stably associates with the trigger factor chaperone (TF) in vivo in
E. coli, in
Thermotoga maritima [116] and likely in other bacteria [117]. TF in the TF:S7 complex masks 16S RNA binding sites on uS7, and uS7 within the complex is more stable than free uS7 in solution. It was suggested that by providing correct folding of r-proteins (e.g. uS7) TF might act as a ribosome assembly factor [
117].
The only moonlighting activity of uS7 described so far is its ability to serve as an autogenous translational repressor of the
str operon. If the uS7 synthesis in a cell exceeds that of 16S rRNA, the same RNA-binding determinants which provide uS7 binding to 16S rRNA participate in binding of uS7 to its own str mRNA [
118,
119]. The
str operon encode r-proteins uS12, uS7, and translation elongation factors EF-G (
fus) and EF-Tu (
tufA), in this order (
Figure 1). To inhibit translation of the
str mRNA, uS7 binds to the intercistronic region preceding its own cistron [
118,
119,
120,
121]. Interestingly, uS7 acts as a translational repressor in vivo only in the presence of the intact
rpsL (uS12) cistron but does not repress independent
rpsG (S7) translation, indicating that the coupled
rpsL-rpsG translation is indispensable to achieve autogenous repression [
120]. Both targets, 16S rRNA and
str mRNA, bear similar sites recognized by uS7 [
119,
121]. Overexpression of uS7 from a plasmid inhibits bacterial growth due to translational repression of the essential
fus gene (EF-G) whose translation is coupled with
rpsG [
119]. At the same time, expression of the last gene in the
str operon,
tufA, that encodes EF-Tu, is not noticeably inhibited because of the presence of two additional promoters within the
fus gene (
Figure 1). The first cistron,
rpsL, is regulated by the “retroregulation” mechanism based on destabilization of the corresponding mRNA region by the repressor binding [
120].
A similar mechanism likely regulates the
str operon in cyanobacteria where the intercistronic region separated
rpsL and
rpsG bears structural similarity with the S7-binding region of 16S rRNA [
122]. It is a pity that uS7-mediated regulation has not been studied in other bacterial phyla, as it has been reported that while extended distances between the uS12 and uS7 cistrons exist in many species, the mRNA structure observed in
E. coli is not obviously conserved [
84].
2.6. uS8 Regulates the Longest spc Operon
uS8 is an important rRNA-binding protein that occupies a central position within a 30S subunit. It interacts with 16S rRNA specifically binding the helix h21 and is crucial for the correct folding of the central domain of 16S rRNA [
81]. The binding of uS8 to 16S rRNA has been extensively characterized using a variety of techniques. A minimal 16S rRNA fragment located in helix 21 was shown to be sufficient to confer specificity and high affinity to the S8-rRNA interaction [
123].
Like uS7, uS8 is bifunctional. It serves as an autogenous repressor controlling translation of the
spc mRNA [
124,
125,
126,
127]. The
spc operon in
E. coli is the longest r-protein operon encoding uL14, uL24, uL5, uS14, uS8, uL6, uL18, uS5, uL30 and uL15, and, in addition, comprising the
secY gene encoding a component of the protein export machinery, and
rpmJ, a gene for a small r-protein bL36 (
Figure 1). The regulatory mechanism is analogous to the repression of the
str operon by uS7 (see above). The repressor uS8 binds not upstream of the first cistron but at the initiation region of the third cistron,
rplE, encoding uL5. This binding directly blocks translation of
rplE, while translation of the downstream cistrons appears inhibited due to the interruption of translational coupling [
126]. The first two cistrons,
rplN and
rplX, are subject to “retroregulation” resulting from mRNA destabilization [
127]. Regulation of the last two cistrons,
secY and
rpmJ, remains unclear.
The S8 binding site at the beginning of
rplE (uL5) bears a strong resemblance to the S8 binding site on the 16S rRNA [
123,
125,
128,
129]. The structure of the uS8 complex with its operator site on the
spc mRNA has been resolved at a high resolution, and it has been shown that uS8 uses the same RNA-binding site both for 16S rRNA and mRNA binding [
128,
129]. The high similarity of both RNA targets for uS8 implies the principle of molecular mimicry. The
spc operon of
V. cholerae (γ-proteobacterium) is autogenously regulated by uS8 presumably in a
E. coli-like manner [
130], but how the
spc operon is regulated in
B. subtilis or other species remains unknown [
131]. Given that the
E. coli-like structure involved in uS8 binding with the
spc mRNA is not found in
B. subtilis, the regulatory mechanism seems to be different [
131]. Unfortunately, phylogenetic studies of the
spc mRNA autogenous regulation have not been advanced.
2.7. uS10, an Essential Player in Transcription-Translation Coupling and Transcription Antitermination
uS10 is a tertiary binding protein in the 30S assembly; its addition to the assembly intermediates depends on uS9, a secondary protein, and uS7, a primary assembly protein interacting with 16S rRNA [
132]. A well-studied functional role of uS10 in
E. coli ribosomes is its assistance in transcription-translation coupling where uS10 provides physical contacts between the leading ribosome and RNA polymerase (RNAP) synthesizing the mRNA [
133,
134]. Another factor critical for a direct link between RNAP and a translating ribosome is NusG. NusG contacts with RNAP via its N-terminal domain while through its C-terminal domain it can physically interact with uS10 on the 70S ribosome or with the termination factor Rho to stimulate Rho-dependent termination [
134,
135]. A competition between uS10 and Rho for NusG might explain why Rho cannot terminate translated transcripts. When the translation rate matches that of transcription, an approach of Rho to the transcript is blocked by the NusG interaction with uS10 on the ribosome, but when translation is inhibited, weakened NusG-uS10 contacts lead to uncoupling, resulting in RNAP backtracking and Rho-mediated termination [
136,
137]. Close relations between the leading translating ribosome and transcribing RNAP have led to the suggestion that they may form a physical complex, a so-called ‘expressome’, a molecular super-machine performing both steps of gene expression [
138,
139,
140]. However, the current models structurally describing bacterial transcription-translation coupling are rather controversial [
140].
Surprisingly, in contrast with
E. coli, transcription and translation in
B. subtilis seem to be functionally uncoupled, and an active transcription elongation complex may be independent of the leading ribosome, suggesting that
E. coli and
B. subtilis use divergent regulatory mechanisms [
141]. While translation elongation in these species proceeds with a similar rate, the transcription elongation rate of mRNAs in
B. subtilis is nearly twice as fast as that in
E. coli (runaway transcription). Moreover, factors that mediate translation- transcription coupling, Rho and NusG. are essential in
E. coli but dispensable in
B. subtilis. Phylogenetic analysis predicts that uncoupling of transcription and translation may be widespread in Gram-positive bacteria [
141]. Thus, the idea that transcription and translation are tightly coupled in all bacteria appears misleading: in
E. coli - tightly coupled, in
Bacillus – uncoupled, in other bacteria – we do not know yet.
The most studied moonlighting activity of uS10 is its functioning in antitermination of transcription first discovered in studies of phage λ. Upon transcription of N-utilization (
nut) sites on the λ genome, phage protein λN and a set of host Nus factors (
N-
utilization
substances) A, B, E (uS10) and G associate with RNAP, thus enabling the enzyme to read through intrinsic and Rho-dependent terminators [
142]. uS10 was the first r-protein for which its participation in transcription regulation was shown [
143]. It forms a complex with NusB to bind to a single-stranded boxA motif on λ
nut sites. The same boxA motif is present in
E. coli rRNA operon (
rrn) transcripts, and binding of uS10-NusB to the boxA sequences in nascent rRNA is indispensable for the
rrn antitermination. The bound to boxA uS10-NusB complex interacts with elongating RNAP via uS10-NusG interaction [
144,
145]. It should be noted that in the complex with NusB, uS10 adopts the same fold as in the 30S subunit and is blocked from simultaneous association with the ribosome [
146]. Thus, the functions of uS10 in transcription-translation coupling and in antitermination are very similar: in both processes its interaction with NusG bound to RNAP prevents the Rho-dependent transcription termination.
Further studies have identified an additional member of the
rrn antitermination machinery, SuhB (inositol mono-phosphatase), and revealed that a complex of Nus factors (NusB, NusE, NusA, NusG and SuhB) not only participates in antitermination on the
rrn operons but also provides correct folding and maturation of rRNA [
147]. Moreover, Nus factors may act beyond rRNA and regulate expression of mRNAs as well. Thus, in
E. coli, binding of the NusB-NusE (uS10) complex to the boxA sequence within the
suhB 5’UTR represses translation of the
suhB mRNA. This binding sterically prevents ribosomes from initiating translation, and translation inhibition in turn promotes Rho-dependent termination within the
suhB gene due to transcription-translation uncoupling [
148]. It is believed that the boxA-mediated regulation of Nus factors is highly conserved and widespread.
The regulatory activity of Nus factors has been recently proposed to be involved in the complex regulatory cascade of flagella biosynthesis [
149]. One of the flagella-specific sRNAs implicated in regulation of flagellar operons, MotR, appears to base pair internal to the
rpsJ (uS10) coding sequence and to promote Hfq binding to the
rpsJ leader sequence, which in turn results in increased
rpsJ translation. Due to its elevated concentration in a cell, uS10 in conjunction with NusB may increase transcription antitermination of long flagellar operons, contributing to flagellin protein levels, flagella number and cell motility [
149].
2.8. uS15, a Translational Auto-Repressor in Various Bacterial Phyla
uS15 is a primary protein in the 30S assembly pathway. It interacts with a highly evolutionary conserved central domain of 16S rRNA comprising a three-helix junction (h20, h21, h22), and this interaction is required for the subsequent binding of other proteins (e.g., bS6, bS18) necessary for the formation of the 30S subunit platform [
110]. Surprisingly, despite its primary role in the 30S assembly, uS15 is not essential as the strains with a deleted
rpsO gene are viable albeit reveal a cold-sensitive phenotype [
150]. This means that under appropriate temperature conditions in vivo the 30S assembly may proceed in the absence of uS15.
The only extraribosomal activity of uS15 found so far is its functioning as an autogenous repressor of the
rpsO gene. The uS15-mediated autogenous control represents the most studied case among the regulatory processes involving r-proteins; it has been examined in various bacterial species including
E.coli [
151,
152,
153,
154,
155,
156],
B. stearothermophilus [
157,
158]
, Geobacillus kaustophilus [
159],
Thermus thermophilus [
160],
Rhizobium radiobacter [
161]. In all these cases, the autoregulation operates at the translation initiation level but through divergent mechanisms: e.g. in
E. coli, binding of uS15 to the
rpsO mRNA leads to the ribosome ‘entrapment’ in a non-productive complex [
151], while a direct competition with the ribosome binding takes place in
Th. thermophilus [
160] and
B. stearothermophilus [
157,
158]. In contrast with a high conservation of uS15 and its 16S rRNA targets, the regulatory structures on the
rpsO mRNAs widely vary both at the primary and secondary structure levels, suggesting that there are many ways to allow autogenous regulation [
159,
161,
162].
In
E. coli, the regulatory site (operator) on the
rpsO mRNA folds in a pseudoknot that is stabilized by uS15, allowing the 30S ribosome to bind but preventing the formation of an active initiation complex [
151,
152,
154,
155]. The only common determinant shared by the two uS15 targets on rRNA and mRNA is a U-G/C-G motif that contributes modestly to rRNA binding but is crucial for mRNA recognition [
153,
154,
155]. Unlike
E. coli, the operator structures for uS15 on the
B. stearothermophilus and
Th. thermophilus rpsO mRNAs are organized in three-way junction motifs that mimic the conserved three-way junction of the S15 rRNA-binding site [
157,
158,
160]. Stabilization of the three-helix junction on the mRNA by uS15 may prevent the ribosome binding thus blocking translation initiation. Recently, we have examined the
rpsO regulation in mycobacteria
M. smegmatis (
Msm) and
M. tuberculosis (
Mtb) and provided evidence for the S15-mediated autoregulation at the translation initiation level [
163]. Remarkably, the autogenous regulation of the mycobacterial
rpsO genes appears to strictly require the pseudoknot conformation of the 5′UTR, so that mutations destroying the pseudoknot completely abolish the uS15-mediated translational repression (
Figure 4). As in the case of
E. coli, a U-G/C-G motif in a pseudoknot turns out to be crucial for the autogenous control. Moreover,
E. coli S15 appears capable of acting as an efficient repressor of the
Msm/Mtb rpsO expression, but this ability has been lost after destroying the pseudoknot. Thus, the mechanism for the uS15-mediated autogenous control in mycobacteria bears close resemblance to that described for
E. coli, despite the large phylogenetic distance between these species. At the same time, while the regulatory pseudoknot in
E. coli embraces the
rpsO SD-sequence and the initiator codon (in a loop2 region), the pseudoknots in mycobacteria are situated upstream from the initiation site, implying that the mechanistic details of the autogenous repression may be different, and in mycobacteria, uS15 binding to the pseudoknot might prevent the ribosome binding rather than entrap the ribosome.
2.9. bS20, a Curious Case of a Regulatory Protein
bS20 is one of the six primary r- proteins (along with uS4, uS7, uS8, uS15 and uS17) that bind to 16S rRNA during the 30S subunit assembly. bS20 may interact with at least two regions on 16S rRNA, in the 5’domain and in the 3’minor domain (specifically with helix 44), bringing these very distant regions into proximity [
164,
165]. This is a puzzle since in vivo the 3’domain is transcribed much later than the 5’domain, given that the 30S ribosome assembly proceeds co-transcriptionally. bS20 is encoded by the gene
rpsT that is a monocistronic operon located apart from clusters of most r-protein genes on a bacterial chromosome (
Figure 1). Though bS20 is a primary binding protein, it is not essential, but its absence results in a slow growth due to poor assembly of the 70S initiation complex and defects in the translation initiation. These defects are caused by a significant reduction in the rate of mRNA association rather than impairment in P-site fMet-tRNA
fMet binding [
166]. Curiously enough, being a 30S subunit r-protein, bS20 has been repeatedly co-purified with the 50S subunit and even designated as a protein L26. Moreover, the copy number of bS20/bL26 in the 70S ribosome was evaluated as 1.38, thus implying the possibility of its independent binding to each subunit [
167]. A specific feature of the
rpsT mRNA is the use of the otherwise inefficient UUG as a start codon that is unusual for the mRNA of a highly abundant r-protein. The use of UUG is not widespread among bacterial
rpsT mRNAs, being typical only for several γ-proteobacterial families (e.g., Enterobacteriaceae, Pasteurellaceae, Vibrionaceae) but not for others, e.g., members of Pseudomonadaceae, Legionellaceae and Xanthomonadaceae use GUG, and alpha-, beta- and epsilon-proteobacteria use a normal AUG start codon to initiate translation of the
rpsT mRNAs (NCBI Gene Database). It is believed that like other primary r-proteins in
E. coli bS20 may regulate its own synthesis as an autogenous repressor, and the weak start codon plays a fundamental role in the autoregulation, allowing bS20 to compete with ribosomes for mRNA binding [
168]. However, compelling arguments have not been provided, and attempts to demonstrate any measurable affinity of bS20 for its own mRNA have appeared unsuccessful [
169]. No conserved mRNA secondary structures typical of autogenous operators for r-protein-repressors have been found either [
84]. Thus, until now the belief that bS20 acts as a translational autogenous repressor remains neither proved nor disproved.
2.10. bS21 and Heterogeneity of Ribosome Population
In
E. coli, the
rpsU gene encoding bS21 belongs to the unique operon called macromolecular synthesis operon (MMS) that comprises genes involved in initiation of the major processes in the flow of genetic information [
170]: bS21 (
rpsU) - in initiation of translation, DNA primase (
dnaG) - in initiation of chromosome replication, and the
rpoD-encoded major sigma factor, sigma-70, - in transcription initiation (
Figure 1).
E. coli bS21 is an essential r-protein that participates in translation initiation by providing base pairing of the 3’ terminus of 16S rRNA with the SD sequence on mRNA [
171]. It should be noted,
rpsU belongs to the MMS operon only in Gram-negative relatives of
E. coli, while in Gram-positive Firmicutes
rpsU is situated separately (NCBI Gene database). A striking example is the absence of bS21 in every member of
Actinobacteria. This protein is also missing in all representatives of the phyla
Deinococcus-Thermus,
Fusobacteria, and
Thermotogae [23], Conversely, some bacterial species encode multiple bS21 homologs, e.g. an intracellular bacterial pathogen
Francisella tularensis encodes three distinct homologs of bS21
[172,173].
bS21 is one of the last proteins in the 30S assembly, which is loosely bound to and easily exchangeable among ribosomes [
174]. Its absence in a part of the cellular ribosomal population leads to intrinsic ribosome heterogeneity and hence may provide a regulatory capacity. The presence of several bS21 homologs, as in
F. tularensis, also implies ribosome heterogeneity that can contribute to post-transcriptional regulation of gene expression. Recent data convincingly demonstrate how variations in the bS21 content may affect translation efficiency of certain mRNAs.
Selective translational control mediated by bS21 has been reported for
Flavobacterium johnsoniae [
175,
176]. Representatives of Flavobacteria, as well as other members of the phylum Bacteroidota, do not use the SD interactions for translation initiation. Although the anti-SD sequence is present at the 3’-terminus of 16S rRNA, it is buried in a pocket formed by bS21, bS6 and bS18 on the 30S platform and hence is unavailable for base-pairing with mRNA. The C-terminal region of bS21, highly conserved in Bacteroidota but not in other phyla like γ-Proteobacteria, is responsible for the anti-SD sequence sequestration [
175]. The
rpsU mRNA in Flavobacteria represents an exception in that it bears unusually extended Shine-Dalgarno sequences and therefore can be efficiently translated only by a subpopulation of ribosomes lacking bS21, resulting in replenishing the cellular amount of bS21 [
176]. This kind of the autoregulatory mechanism represents a unique case when the r-protein serves as an autogenous translational regulator not in a free state but as an integral part of the ribosome. It should be mentioned that regulation of
rpsU in
E. coli, unlike in Flavobacteria, remains unknown.
The presence of several bS21 homologs in a cell can play a regulatory role as well. In a human pathogen
F. tularensis, one of the three bS21 homologs, bS21-2, specifically governs translation of virulence genes [
172]. The mRNAs responsive to bS21-2 bear specific features in their 5’UTRs, such as an imperfect SD sequence and a particular six nucleotide sequence, while mRNAs with a classic SD-element do not require bS21-2 for translation [
173]. This raises a possibility that other bS21 homologs, in
F. tularensis or other organisms, may influence translation in a leader sequence-dependent manner.