1. Introduction
“The whole case here rests upon the demonstration that codon-amino acid pairing interactions exist and that the codon assignments in some way reflect these interactions… all-or-none specificities are not required for such interactions to determine the form of the codon catalog, either a general form or one specified down to the very last detail. All that is required here is that a sufficient number of slight preferences be shown.”.
Carl Woese, 1969
The origin of the genetic code and the emergence of biological information is a notoriously elusive question. Even on its discovery, it was clear there are non-random patterns in the code [
1,
2,
3]. These patterns loosely correspond to the biosynthetic precursors of the amino acids encoded [
4,
5,
6,
7], the hydrophobicity of those amino acids [
8,
9], and less clearly, their size [
4,
10], all of which point to some kind of direct biophysical interactions. Nonetheless, as suspected by Woese as early as 1969, ‘all-or-none preferences’ do not exist [
1,
11]. Woese argued that a large number of slight preferences (weak and not highly specific) could still have played a strong role in fixing the codon assignments [
11]. But this prescient argument was displaced by Crick’s adaptor hypothesis, which highlighted the lack of direct interactions between amino acids and either the codon or anticodon in tRNA [
12,
13]. Crick’s position was unequivocal: since neither evidence for such interactions nor a reasonable model for them existed, the interactions themselves did not exist under any circumstances [
9,
14]. While others have continued to pursue the idea that direct stereochemical interactions underpin the code, even the most promising experiments and simulations over the following half century [
15,
16,
17,
18,
19,
20,
21,
22] did little to dispel the prevailing scepticism [
23,
24,
25].
A second theme of Woese’s early ideas on the code has also been largely neglected since the rise of the RNA world hypothesis. Woese noted that the main ideas on the optimization of the code implicitly assumed selection at the level of the cell, whether through minimizing the effects of mutations [
26,
27], constraining ambiguity [
28,
29,
30,
31], expanding an early amino-acid vocabulary [
13], or fine-tuning weak biophysical preferences into the all-or-none assignments seen today [
9,
11,
31]. In contrast, the RNA world hypothesis advocated selection at the level of the gene. Beginning with another foundational paper in 1968, Orgel questioned how far complexity could emerge in a peptide world versus an RNA world [
32]. He concluded that only the simple copying of RNA templates could account for the emergence of natural selection, and so focused attention on the replication of RNA as a unit of selection [
32]. By the early 1980s, the discovery of ribozymes gave the RNA world a sense of concrete reality [
33,
34,
35,
36]. Orgel’s influential scenario held that “the very first replicators were ‘naked genes’ adsorbed on the surface of mineral particles”; and later on “impermeable membrane caps were ‘invented’ by the genetic system as it became metabolically competent [
37].” While these ideas do not rule out the emergence of the genetic code in cells, selection at the level of RNA is proposed to generate a set of functional RNA catalysts that sustain exponential growth in a prebiotic environment [
36]. Metabolism emerged with ‘RNA cofactors’ such as NAD, and the first proteins performed the same reactions as ribozymes, but more effectively, thereby eventually displacing RNA [
36,
38]. Neither the code nor cells are emphasised as critical early steps.
There is no doubt that RNA played a determining role in the early translational system and the emergence of the code [
35,
39]. Yet the idea that RNA ‘invented metabolism’, though dominant, has not been expansively developed. Attention has instead focused on the difficult problem of RNA replication [
38,
40,
41], and escaping the tendency to select for replication speed, often leading to parasitic collapse [
42,
43,
44,
45,
46]. Producing stable longer chain RNAs has been an end in itself [
47], on the assumption that natural selection can then drive the emergence of metabolism, ‘bit by bit’ [
48]. But this scenario overlooks the complexity of metabolic pathways, the known endpoint of evolution. While cofactors probably played an important role, by no means do they catalyse every step. And if RNA encoded ribozymes or enzymes that each catalyzed individual steps, introducing them one at a time would have no benefit if the rest of the pathway was missing [
49,
50,
51]. Building metabolic pathways step by step from one end or the other lowers the combinatorial odds but requires that all intermediates be stable, useful and available [
52,
53,
54], which is certainly not the case for pathways such as purine synthesis. The problem is not simply combinatorial. As observed by Walker and Davies, “biological information has an additional quality which may roughly be called ‘functionality’ – or ‘contextuality’ – that sets it apart from a collection of mere bits as characterized by its Shannon information content.” [
55] The simplest solution to this near-intractable problem is to assume that the core of metabolism is thermodynamically and kinetically favoured in a propitious environment [
56,
57,
58], so the first RNA genes only had to enhance flux through the protometabolic network [
59]. But that presumes a lot from prebiotic chemistry, to the point that Orgel memorably dismissed it as ‘an appeal to magic’ [
60].
Experimental work over the last decade has now weakened Orgel’s position. Starting from CO
2 and H
2 – the autotrophic core of all life – much of intermediary metabolism is not only thermodynamically favoured but occurs spontaneously following the universally conserved biochemical pathways. Mineral catalysts [
61,
62] or proton gradients across inorganic barriers [
63] can drive CO
2 fixation, directly generating the carboxylic-acid intermediates of the acetyl CoA pathway [
64] and reverse Krebs cycle [
65,
66]. From these universal precursors, α-amino acids [
66,
67,
68,
69], acetyl phosphate [
70], sugars [
71,
72,
73,
74], and some nucleobases [
75] have been formed via pathways that prefigure metabolism in the absence of genes and enzymes. While much still needs to be done, the concept of a spontaneous protometabolism is no longer an appeal to magic. Going further, modelling shows that catalytic positive feedbacks from nucleotide cofactors could drive flux through protometabolism, giving rise to autotrophic protocells growing from CO
2 and H
2 [
59]. If RNA polymerization could occur within these replicating protocells, the emergence of the genetic code through weak biophysical interactions between amino acids and cognate bases could solve the RNA-world problem along the lines postulated by Woese [
11].
Reexamining patterns in the code from the standpoint of autotrophic protometabolism is little short of revelatory. Harrison
et al. have shown that the base at the first position of the codon corresponds to the distance from CO
2 fixation, following the universal metabolic map [
59,
76]. Amino acids encoded by a G at the first position of the codon are usually the closest to CO
2 fixation, followed by A, then C and U, which might suggest a purine-rich early metabolism [
76]. Given that most nucleotide cofactors are derived from purines, including NAD, FAD, CoA, ATP, folates and pterins, the idea of a purine-rich early metabolism is consistent with cofactor-catalyzed positive feedbacks. When structured according to the base at the first position of the codon, there is a much stronger relationship between the hydrophobicity of the amino acid and the base at the second position of the anticodon – in other words, the correlation is stronger for earlier amino acids (closer to CO
2 fixation) than for later amino acids [
76]. Finally, the patterns of redundancy across the code are far from random but are governed by rules pertaining partly to the size of the amino acid and adjoining bases [
76]. These patterns predict weak biophysical interactions between amino acids and cognate bases. As noted, while there are tantalizing hints that such interactions do exist [
15,
16,
17,
18,
19,
20,
21,
22], they have not yet been systematically demonstrated across the full code [
23,
24]. Here, we have taken a novel approach, using molecular dynamics simulations to analyze the forces acting between atoms, which has allowed us to revisit the interactions between the 20 standard proteinogenic amino acids and four RNA mono-nucleotides in three distinct charge states. We find that half of the amino acids do bind best with their anticodonic middle base in the
1 charge state common to the backbone of RNA (which is greater than 99% of randomized assignments), while 95% of amino acids interact most strongly with at least one of their cognate codonic or anticodonic bases. We verify a selection of our results using NMR. Our results corroborate Woese’s proposals from more than half a century ago, and offer a compelling framework for the emergence of genetic information in biology.
4. Discussion
In this work, we have explored whether preferential stereochemical interactions between amino acids and nucleotides exist, and if so, whether the interactions match the patterns observed in the genetic code. Our results, using both MD and NMR, strongly support the hypothesis that stereochemical interactions do exist, and we find they often do match the modern codon assignments. The interactions are weak and probabilistic, but at scale they can be identified through the noise. Other notable features of the codon table, such as the frequency and non-redundance of uracil at anticodon base 2, and the patterning related to hydrophobicity, also appear to arise from these biophysical interactions. These patterns even hold true in modern codon reassignments, with 77% of known codon reassignments retaining the same middle base [
88,
89]. Taken together, our results corroborate Woese’s prescient conjectures from more half a century ago, that the genetic code is based on a set of ancient and spontaneous interactions that have not been overwritten since the origin of life [
1,
11].
While this central idea is supported by both MD and NMR, some discrepancies remain. In the NMR, all the nucleotides are likely to be in the
2 charge state at pH 7 [
90] but the MD simulations mainly predicted cognate preferences in the
1 state (which matters because this corresponds to the charge state in the RNA backbone). Exact rank preferences were also not replicated perfectly between the two techniques. More generally, while our results support the hypothesis that hydrophobicity shapes interactions, we have struggled to identify patterns that directly resemble those observed in the codon table. For example, being the most hydrophobic base [
15,
85] we expected AMP to interact most strongly with hydrophobic amino acids. UMP is the least hydrophobic base [
15,
85], so we expected it to interact most strongly with hydrophilic amino acids. We anticipated similar but weaker patterning for CMP and GMP, which are intermediate in their hydrophobicity. But none of these differences were recovered in our simulations. We suspect that this discrepancy may reflect limitations in the fixed-charge forcefields used in our MD simulations, which also struggle to model dynamic charges and subtle changes in electron densities [
91,
92]. If so, then hydrophobic effects were poorly captured in our simulations, which could explain the poor overall binding of amino acids to AMP. This interpretation is supported by
Figure 2, which shows that our simulations correctly predicted the cognate anticodon middle base for most hydrophilic amino acids (to the right end of the Trinquier scale) but fared badly with the hydrophobic amino acids (to the left). The negative charge on the phosphate group might then be an overpowering factor in our simulations, as suggested by the strong negative correlation between hydrophobicity and amino-acid binding in the
2 state across all nucleotides (SI Figure 4). Because this trend reverses as charge is decreased towards zero (SI Figure 3) the charge on the phosphate seems to play a dominant role.
Another puzzle relates to the striking prediction that the anticodonic middle base and first codonic base were both preferred binding partners for cognate amino acids (
Figure 2). This matches the observation that clear patterning is observed for both these bases in the codon table [
76]. It also matches the complementary prediction from
Figure 6, showing an elevated affinity of amino acids to the cognate anticodonic and codonic dinucleotides. These findings suggest the emergence of coding in some sort of binding pocket containing both the codon and anticodon. Nonetheless, it is still surprising that the binding affinities of amino acids to dinucleotides were not greater than those for mononucleotides. One possible explanation might be that very short polynucleotides are a known weak-spot for MD [
93]. Or it could be that our results reflect a sampling bias due to the relatively limited number of amino-acid–dinucleotide pairs simulated. More interesting: if the patterns do reflect binding to a pocket, this would probably require the cognate nucleotides to be positioned opposite one another (as in normal codon-anticodon interactions). Without this multidirectional binding, the extra complexity of dinucleotides might confound preferences rather than strengthen specificity.
If our results do indeed point to some sort of selective RNA binding pocket for amino acids, then the challenge becomes: how could this pocket evolve into to the modern translational and informational system? While much remains ambiguous, we imagine a model in which non-enzymatic chemistry and stereochemical interactions between amino acids and their cognate nucleotides could build incrementally towards the modern translational system. At issue here, once more, is Crick’s adaptor hypothesis [
12,
13], which stresses the absence of any direct correspondence between either the codon or anticodon and amino-acid binding. On the contrary, on tRNA, the amino acid always binds to a CCA acceptor stem at one end of the molecule, while the anticodon-codon interactions take place far away at another end of the molecule. While there may be interactions between anticodons and amino acids in the ribosome [
94], these have nothing to do with amino-acid loading of the tRNA by aminoacyl tRNA synthetases. The question then becomes how, physically, could the interaction between a binding pocket containing the anticodon and its cognate amino acid become separated into an interaction between a CCA acceptor stem and the cognate amino acid, and elsewhere, between the anticodon on tRNA and the codon on mRNA? We sketch a possible model in
Figure 7.
We propose that translation began in autotrophically growing protocells, as outlined in the Introduction [
59,
76]. In this structured setting, the undirected polymerization of nucleotides and amino acids could in theory occur spontaneously, driven by nucleoside triphosphates, notably ATP [
95], and catalysed by metal ions such as Mg
2+ and amino acids such as aspartate (which is conserved in the active sites of modern RNA polymerases [
96]) or lysine (which is conserved in the active site of modern RNA ligase enzymes [
97]). Nucleotide polymerization in turn should form short non-templated RNA aptamers, some of which may resemble a single hairpin loop of tRNA in structure [
18,
98] as depicted in
Figure 7A. We imagine amino acids binding to these RNA pockets by way of the weak biophysical affinities demonstrated here, giving a statistical likelihood of repeatability. This biophysical patterning can in principle explain the emergence of information in biology. Consider: if random RNA sequences bind amino acids in a non-random fashion, and this facilitates the polymerization of those amino acids into short templated peptides with non-random sequences, then biological meaning, linked with function, is introduced in the context of autotrophically growing protocells. Functions in growing protocells could include CO
2 fixation, RNA polymerization and cofactor binding, all of which would facilitate protocell growth and heritability [
59,
76].
For RNA to template functional peptides, the next necessary step would be the transfer of the amino acid from its binding pocket onto the proto-tRNA acceptor stem. The universal acceptor stem is the CCA terminal of tRNA, and this is rigorously enforced in biology [
99]. Curiously, CCA is an anticodon for glycine, the simplest amino acid with only an H for an R group, meaning that CCA is most likely to interact with the amino or carboxyl groups rather than the R group. This general binding affinity means that the CCA could act as a universal ‘fishing rod’ for all amino acids; the presence of a CCA terminus could facilitate the binding of amino acids and may have been what initially began to differentiate proto-tRNA from other sequences. In effect, any short RNA hairpin with a terminal CCA would behave like a proto-tRNA. The stacking of ATP on the terminal AMP of the CCA stem would help colocalise the ATP and amino acid, which is an essential function of amino acyl-tRNA synthetases (aaRS) as depicted in
Figure 7A. Progenitors of aaRS, even very short polypeptides [
18,
100], have been shown to catalyse amino-acid adenylation and even tRNA acylation, by protecting the adenylate from water, and colocalising reactants. We show the adenylation of amino acid in
Figure 7B, followed by transfer of the amino acid onto the CCA acceptor stem in
Figure 7C, which acylates the tRNA. Thus, with no more than a short tRNA, we can picture the binding of an amino acid to a specific pocket, followed by its adenylation and acylation of the tRNA. These simple proto-tRNA molecules could eventually augment their specificity beyond the simple biophysical preferences shown here, for example through the size discrimination of amino acids, which constitutes the deep split between the two classes of aaRS [
101].
Exactly how such a tRNA could facilitate amino acid polymerization is another question. As depicted in
Figure 7D, we imagine that the anticodon was initially positioned at the opposite end of the tRNA hairpin loop to the CCA acceptor stem, on a flexible hinge that could twist around to interact directly with the codon on a proto-rRNA. In
Figure 7D, we depict a small peptide growing from these interactions, but have not specified a mechanism. It is feasible that other short RNAs could catalyse amino-acid polymerization by colocalizing proto-tRNA and mRNA templates. Such an assemblage of RNA would be the first steps to forming the proto-ribosome, and there is some evidence of this process in the structure and sequences of modern ribosomes [
102,
103]. These RNA complexes presumably catalysed peptide bond formation, but later proto-ribosomes had to facilitate alignment with templates and begin to enforce reading frames from looser stereochemical roots.
This model is admittedly based on some rather large extrapolations from the literature, but provides a route to build a full translational system from the simple biophysical interactions observed here. The critical steps to test will be the non-enzymatic chemistry of nucleotide polymerization in water, the specificity of RNA pockets for amino acids, and the templated polymerization of peptides on RNA. We will also address some of the mechanistic puzzles relating to the complexification of key components, notably tRNA, which eventually moves the cognate nucleotides away from the acceptor stem. More mundane but immediate goals include exploring a wider range of experimental conditions for MD simulations. Advances in polarizable forcefields would also improve our results if they are able to simulate non-polar interactions with more sophistication. There is also plenty of scope for further work with the NMR, including improving the precision of the experimentally determined binding constants, investigating more amino acids, and progressing to polynucleotides, though the number of possible combinations when using longer RNAs will present a challenge to comprehensive exploration.
In conclusion, the results presented here support the existence of weak and probabilistic binding preferences between amino acids and nucleotides, as argued by Woese more than 50 years ago. Our results point to the origin of translation in binding pockets in RNA hairpin loops. The fact that these interactions are evident even with mononucleotides suggests that genetic information is based on spontaneous interactions built into the structure of the code from the very origins of polymerization. That these biophysical interactions still shine through the genetic code shows they form a cornerstone that has supported the dazzling complexity of life ever since.
Figure 1.
Figure 1. Volume adjusted proximity probability distributions for a selection different combinations of amino acids and nucleotides. Proximity is the closest atom of the nucleotide to the closest atom of the closest of 10 amino acids in the 40 Å periodic box. Proportion of simulation time is adjusted for the volume of the amino acid and nucleotide, so that different systems are comparable. Most interactions demonstrate multiple binding modes, at ~ 1.9 Å, 2.5 Å and 4 Å. An additional peak at around 5.5 Å is also visible, which is interpreted as the average distance of the closest amino acid when not bound. The vertical red line indicates the 5 Å threshold for binding. (A) proline; (B) arginine; (C) aspartate; (D) phenylalanine; (E) glycine.
Figure 1.
Figure 1. Volume adjusted proximity probability distributions for a selection different combinations of amino acids and nucleotides. Proximity is the closest atom of the nucleotide to the closest atom of the closest of 10 amino acids in the 40 Å periodic box. Proportion of simulation time is adjusted for the volume of the amino acid and nucleotide, so that different systems are comparable. Most interactions demonstrate multiple binding modes, at ~ 1.9 Å, 2.5 Å and 4 Å. An additional peak at around 5.5 Å is also visible, which is interpreted as the average distance of the closest amino acid when not bound. The vertical red line indicates the 5 Å threshold for binding. (A) proline; (B) arginine; (C) aspartate; (D) phenylalanine; (E) glycine.
Figure 2.
(A) Rank preference of the anticodonic middle base nucleotide for each the 20 amino acids as predicted by MD simulations. Three additional cognate assignments are included for amino acids with multiple 1st and 2nd base assignments. Random allocation shows equal chance of each amino acid interacting most strongly with any nucleotide. (B) Number of amino acids predicted to interact most strongly with both, one, or no nucleotides cognate at either the 1st or 2nd position of either the anticodon or codon, compared to random allocations where the probability of interacting with any given nucleotide is equal. (C) Rank preference of each amino acid for each nucleotide, ordered by mean hydrophobicity rank of amino acids 8,76. The true anticodonic middle base for each amino acid is circled. Circular datapoints show pyrimidines; square datapoints are purines. Green is UMP, blue CMP, orange GMP and grey AMP. (D) Sum of rank preferences of the amino acids for their cognate nucleotides compared to randomized preferences. Highlighted bars are scores for: codon base 1 (purple), codon base 2 (green), anticodon base 1 (yellow), anticodon base 2 (red). All nucleotides are in the 1 charge state in all panels and interactions are given with respect to ring nitrogens.
Figure 2.
(A) Rank preference of the anticodonic middle base nucleotide for each the 20 amino acids as predicted by MD simulations. Three additional cognate assignments are included for amino acids with multiple 1st and 2nd base assignments. Random allocation shows equal chance of each amino acid interacting most strongly with any nucleotide. (B) Number of amino acids predicted to interact most strongly with both, one, or no nucleotides cognate at either the 1st or 2nd position of either the anticodon or codon, compared to random allocations where the probability of interacting with any given nucleotide is equal. (C) Rank preference of each amino acid for each nucleotide, ordered by mean hydrophobicity rank of amino acids 8,76. The true anticodonic middle base for each amino acid is circled. Circular datapoints show pyrimidines; square datapoints are purines. Green is UMP, blue CMP, orange GMP and grey AMP. (D) Sum of rank preferences of the amino acids for their cognate nucleotides compared to randomized preferences. Highlighted bars are scores for: codon base 1 (purple), codon base 2 (green), anticodon base 1 (yellow), anticodon base 2 (red). All nucleotides are in the 1 charge state in all panels and interactions are given with respect to ring nitrogens.
Figure 3.
Proportion of simulation time for each amino acid within 5 Å of a nucleotide, adjusted for molecular volume. Amino acids are denoted by their single letter codes, and organised by mean hydrophobicity rank, calculated from Trinquier’s 43 scales 8,76. (A) phosphate charge state = 0; (B) phosphate charge state = 1. (C) phosphate charge state = 2. Dotted red line shows best fit from a linear regression. Distance is measured relative to nucleobase rings.
Figure 3.
Proportion of simulation time for each amino acid within 5 Å of a nucleotide, adjusted for molecular volume. Amino acids are denoted by their single letter codes, and organised by mean hydrophobicity rank, calculated from Trinquier’s 43 scales 8,76. (A) phosphate charge state = 0; (B) phosphate charge state = 1. (C) phosphate charge state = 2. Dotted red line shows best fit from a linear regression. Distance is measured relative to nucleobase rings.
Figure 4.
1H NMR spectra showing characteristic changes in proton chemical shift perturbations (or changes) as the ratio of amino acid to nucleotide increases. The panels show mixtures of phenylalanine with each of the four mononucleotides, at two representative ratios: 1:150 nucleotide to amino acid, and 1:500 nucleotide to amino acid. Numbers next to each NMP refer to the proton probes used for measurement of peak shifts (see SI Figure 1 for numbering).
Figure 4.
1H NMR spectra showing characteristic changes in proton chemical shift perturbations (or changes) as the ratio of amino acid to nucleotide increases. The panels show mixtures of phenylalanine with each of the four mononucleotides, at two representative ratios: 1:150 nucleotide to amino acid, and 1:500 nucleotide to amino acid. Numbers next to each NMP refer to the proton probes used for measurement of peak shifts (see SI Figure 1 for numbering).
Figure 5.
Inferred NMR binding constants (KD) for phenylalanine, aspartate, arginine and glycine, with each of the four RNA bases. Lower KD indicates stronger binding. Numbers next to each NMP refer to the proton probes used for measurement of peak shifts (see SI Figure 1 for numbering). Red boxes highlight the cognate anticodonic middle bases for each amino acid.
Figure 5.
Inferred NMR binding constants (KD) for phenylalanine, aspartate, arginine and glycine, with each of the four RNA bases. Lower KD indicates stronger binding. Numbers next to each NMP refer to the proton probes used for measurement of peak shifts (see SI Figure 1 for numbering). Red boxes highlight the cognate anticodonic middle bases for each amino acid.
Figure 6.
Sum of rank preferences for a selection of amino acids and dinucleotides with respect to proportion of simulation time spent within 5 Å, for (A) whole dinucleotide (B) ring nitrogens, adjusted for molecular volumes. If all amino acids had their cognate nucleotides as their best binding partner, this gives the lowest score. Red = bases 1 and 2, anticodonic dinucleotide. Green = bases 1 and 2, codonic dinucleotide. Left of red line is the top 5% of randomized assignments.
Figure 6.
Sum of rank preferences for a selection of amino acids and dinucleotides with respect to proportion of simulation time spent within 5 Å, for (A) whole dinucleotide (B) ring nitrogens, adjusted for molecular volumes. If all amino acids had their cognate nucleotides as their best binding partner, this gives the lowest score. Red = bases 1 and 2, anticodonic dinucleotide. Green = bases 1 and 2, codonic dinucleotide. Left of red line is the top 5% of randomized assignments.
Figure 7.
Possible mechanism for amino acid binding and adenylation on a proto-tRNA. (A) An amino acid (yellow oblong) binds to its cognate anticodon on a short RNA with a hairpin loop. An ATP (blue) stacks onto the terminal A adjacent to the anticodon. (B) Nucleophilic attack of the carboxylate oxygen on the α-phosphate releases the pyrophosphate tail (blue oblongs), which adenylates the amino acid. (C) Transfer of the amino acid from the adenosine to the 2’ ribose of the terminal A on the ‘acceptor stem’ results in an amino-acylated RNA. (D) Possible primordial mechanism of translation, where a flexible hinge of the proto-tRNA allows binding of the anticodon to a ‘codon’ on an adjacent proto-mRNA (where the reading frame is also determined by stereochemical interactions) enabling synthesis of short peptide sequences specified by the RNA sequence.
Figure 7.
Possible mechanism for amino acid binding and adenylation on a proto-tRNA. (A) An amino acid (yellow oblong) binds to its cognate anticodon on a short RNA with a hairpin loop. An ATP (blue) stacks onto the terminal A adjacent to the anticodon. (B) Nucleophilic attack of the carboxylate oxygen on the α-phosphate releases the pyrophosphate tail (blue oblongs), which adenylates the amino acid. (C) Transfer of the amino acid from the adenosine to the 2’ ribose of the terminal A on the ‘acceptor stem’ results in an amino-acylated RNA. (D) Possible primordial mechanism of translation, where a flexible hinge of the proto-tRNA allows binding of the anticodon to a ‘codon’ on an adjacent proto-mRNA (where the reading frame is also determined by stereochemical interactions) enabling synthesis of short peptide sequences specified by the RNA sequence.