3. The Accumulating Evidence for the Biological Importance of G-Quadruplexes
G-flipon maintenance. Interestingly, the first evidence hinting at a biological role for GQ came from C. elegans. Sequences with the G-quadruplex motif were deleted in strains with
dog-1 (deletions of guanine-rich DNA) LOF variants, but not those with only 3 of the 4 G
3 sequence repeats [
53]. Mutant strains of
dog-1 lacking the trans-lesion polymerases (TLS) polymerases, POL eta and POL kappa had significantly more G-tract deletions than dog-1 by itself [
54]. Interestingly, the combined deletion of
dog-1 and the spindle-checkpoint component
mdf-1 enabled long term survival [
55], even though a high incidence of lethal mutations in this variant was revealed by the use of balancer chromosomes. In total, 126 (13%) of the 954 mono-G/C tracts, larger than 14 bp, were deleted over 470 generations in a strain deficient in both genes. The effects of
dog-1 LOF on the spindle checkpoint suggest an effect of GQ on sister chromatid alignment by the cohesin proteins during mitosis. The results also supported the consensus that GQ has only a limited role in determining phenotype not only in
C. elegans, but also other organisms.
Epigenetic Modification. The
dog-1 homolog in the DT40 chicken lymphoblastoid cell line, the 5′ FANCJ (Fanconi Anemia Complementation Group J) helicase [
56] also was found to prevent deletion of G-tracts that also had the potential to form G-quadruplexes. Effects were enhanced by loss of the REV1 polymerase that localizes TLS to sites of polymerase stalling. Interestingly, REV1 catalytic activity was not necessary to prevent deletion, although the LOF variant did enhance the rate of guanine rich DNA loss. Also, in the FANCJ model, the combined deletion of the superfamily 2 RecQ (SF2) Werner and Bloom Syndrome 3′ helicases [
57] also increased G-repeat loss, likely because of reduced GQ resolution [
56].
Of interest was that the TLS pathway was required to maintain the epigenetic state of dividing cells, as monitored by cell-surface expression of a protein with an intronic G-flipon that regulated gene expression. Whereas in the wildtype cell, the histone modifications associated with this G-flipon were maintained, they were lost following
rev1 deletion. Instead, resolution of the GQ formed during DNA replication was through the gap-fill repair pathway. The subsequent incorporation of unmodified histones led to diminished gene transcription and surface marker expression. This
rev1-dependent phenotype could be reverted by re-expression of human SF2 Fe-S FANCJ helicase [
56]. The opposite effect was observed when a G-flipon was experimentally inserted into a repressed locus. In this case,
rev1 deletion led to depression of the locus, consistent with the replacement of repressive histone with unmodified histones that were permissive to gene expression [
58]. These results support a model where the formation of GQ by G-flipons during periods of cell proliferation help transmit the current epigenetic state to progeny, an important biological outcome.
DNA replication and Sister Chromatid Conformation. The involvement of GQ in cell proliferation is further supported by other evidence. During assembly of the DNA polymerase complex at the origin of replication (OR), the MTBP protein assists in the loading of CDC45 into the replicative helicase. The C-terminal domain of MTBP binds GQ in vitro [
59]. Notably, G-flipons are enriched in OR. Indeed, in chicken DT20 cells a minimal, functional OR consists of a 90 bp fragments that has two G-flipons on the same strand [
60]. These constructs establish the nucleosome depleted region (NDR) bounded by histone H2A.Z that is typical of OR. Collectively, the results suggest a model in which the MTBP binds GQ at the OR to initiate the assembly of the replication complex.
Another potential role for GQ during proliferation and transmission of epigenetic state is to align sister chromatids, as mapping of intra- and inter chromatin interactions between homologous chromosomes reveals a high degree of symmetry in the architecture of topologically associated domains (TADS), and in the loops formed within TADs [
61]. In this regard, a recent report suggests that G-flipons are enriched near sites bound by the CTCF (CCCTC-binding factor), a protein associated with loop formation. Interestingly, the strand orientation of the G-flipons mirrors the inverse orientation of the two CTCF sites that associate with each other to form the base of the loop [
62]. CTCF however is not known to bind GQ [
45].
G-flipons in NER. The REV1 pathway also plays a role in nucleotide excision repair that is triggered by UV irradiation and the formation of DNA crosslinks. In this situation, loading of the repair pathway proteins such as XPCC and RAD23 is triggered by the protein ZRF1 and its yeast homolog Zuo1 that recognizes the lesion and induces GQ formation [
63]. Triggering of this pathway by cytosine deaminases can result in single base substitutions, with a C to G transversion occurring from the preferential insertion of cytidine into the lesion by REV1 [
64]. This mutation (STS13) is prevalent in cancers [
65]. The Cockayne Syndrome B helicase (encoded by ERCC6 that also binds GQ and performs NER in the transcription coupled repair pathway (TCR) [
66]. On sensing a lesion, CSB displaces DSIF from the RNA polymerase 2 ( RNAP2) complex, inducing a conformational switch that halts transcriptional elongation and initiates TCR [
67]. LOF variants of this enzyme are associated with premature aging phenotypes[
66].
G-flipons in BER. APE1 plays a similar role in stabilizing GQ formed by AP DNA, but not unmodified DNA, to initiate the base excision repair (BER) pathway [
51]. The pathway removes oxidized bases, such as 8-oxo-G. It is proposed that regulation of the APE1 by acetylation coordinates the expression of genes involved in pathways that respond to oxidative damage to the cell. Interestingly, the GQ involved are formed from G-flipons with a “spare tire”. The extra runs of G-repeats allow formation of a GQ despite damage to one of the other repeats [
68]. The flavin-dependent LSD1 demethylation of H3K9me2 also produces 8-oxoG through the hydrogen peroxide generated by the enzyme. The reaction is thought to explain the estrogen-induced activation of BCL-2 [
51].
GQ and Hemin. GQ may also play a different role in protecting the genome against oxidative damage caused by intracellular hemin [
69]. The high affinity (Kd ~ 10 nM) interaction of hemin with GQ was initially highlighted for its ability to generate oxidative species [
70], but it appears that this reaction in cells is squelched, presumably by proteins that bind to GQ [
69]. Instead, GQ sequesters hemin to prevent damage. Indeed, when released by small molecules that compete for binding to GQ, hemin induces the expression of genes, such as heme oxidase I, that modulate cell death by ferroptosis [
71].
GQ and telomere protection. The formation of T-loops by telomeres described above does not rule out a role for GQ formation in telomere protection. Indeed, the TRF2/RAP1 complex protects telomeres from homologous recombination by repressing PARP1 localization to telomeres and by inhibiting the SLX4 Holliday junction (HJ) resolvase. Loss of TRF2 and RAP1 in both humans and mouse leads to rapid telomere attrition , with increased rates of telomere deletion and fusion[
72]. TRF2 preferentially docks to GQ formed by RNA, rather than DNA. The protein binds GQ formed by the noncoding Telomeric Repeat-Containing RNA (TERRA) telomere transcript through an RG rich domain [
73].
GQ, the SANT domain, and gene expression. Prior to its role in telomere protection, Rap1 was characterized as a sequence-specific transcription factor that bound to an upstream activating site in yeast [
74]. The base-specific interaction with B-DNA was confirmed by crystallographic study of a telomeric sequence [
75]. Only later did crystal structures show that Rap1 also docked to GQ. Surprisingly, both DNA interactions involved the same helix, but a different face [
76] (
Figure 2). The GQ contacts were hydrophobic, with the helix lying on the planar surface of the terminal tetrad.
Figure 2.
The yeast Rap1 protein binds both B-DNA in a sequence-specific manner (A) [
75] and to GQ in a structure-specific manner (B). [
76] through different faces of the same helix in the SANT/Myb1 domain Images by Daniela Rhodes.
Figure 2.
The yeast Rap1 protein binds both B-DNA in a sequence-specific manner (A) [
75] and to GQ in a structure-specific manner (B). [
76] through different faces of the same helix in the SANT/Myb1 domain Images by Daniela Rhodes.
While this finding might seem anomalous, many subsequent studies have demonstrated the ability of proteins to bind specifically to their cognate B-DNA site, and also to a GQ. This finding is true for binding of the SP1 transcription factor to the c-MYC parallel GQ [
77] and for a range of other proteins that bind GQ with nanomolar affinity [
19]. Interestingly, like Rap1, many of the GQ binding proteins include a SANT/Myb domain such as ZRF1 [
78] and TRF2 [
79,
80]. Interestingly, the yeast Zuo1 protein has replaced the SANT domain with a highly hydrophobic helix [
78]. SANT domain proteins are found in multiple chromatin-modifying and remodeling complexes, although their binding to GQ has not been characterized for each [
81].
Helicases and Resolution of GQ. The DEAH box SF2 helicase DDX36 (RHAU), a highly specific GQ resolvase, also makes helical contacts with the GQ end plate [
82,
83]. However, the helix alone has a Kd of 1 μM. The additional engagement of a 3′ single-stranded DNA tail contributes to the nM affinity for the substrate and the ATP-dependent unwinding activity. Using a ratchet mechanism, the helicase disassembles the GQ, one guanine at a time. The chemical energy derived from ATP is converted into a pulling force by the C-terminal domain rotation that is associated with an opening of the helicase core [
83]. In the absence of nucleotide, or in the ADP bound state,
D. melanogaster DDX36 stabilizes the GQ [
84]. The crystal structure of the SF1
Thermus oshimai 5′-3′ DNA Pif1 helicase shows the enzyme in an unwinding state with engagement of single-stranded thymine repeats [
85]. The related yeast helicases
PiF1 and
Rrm3 cooperate to unfold GQ formed not only by telomeres, but also by centromeres and tRNA repeat sequences that form GQ [
86,
87]. The interaction of the Pif1 with the parallel stranded GQ differs from that of DDX36. The contact is mediated by a cluster of amino acids, including two arginine/lysine cation-π interactions at either end of the GQ, plus ionic contacts with the phosphate backbone. The enzyme unfolds G-quadruplexes in an ATP-dependent manner, unwinding both parallel and antiparallel GQ [
85]. The SF2 RecQ BLM helicase also can unfold a range of GQ folds through a variety of different mechanisms [
88]. Collectively, the helicases play key but distinct roles in flipping GQ back to the B-DNA conformation.
GQ and transcription complexes. Given the enrichment of G-flipons in promoters, a key question was how do GQ stabilizing and resolving proteins impact transcription. One of the surprises of the ENCODE project was the identification of HOT (high occupancy target) loci where upwards of a 100 TF bound, even to sites lacking their sequence-specific binding motif. The findings were initially dismissed as methodological artefacts [
89], but were later shown not to be [
90,
91]. The work focused on the sequence-specificity of TFs, not the GQs that were also formed at promoters. The ability of TF to bind both B-DNA and GQ offered a resolution to this dilemma presented by HOT [
45], especially given the possibility that the helicases engaged by each TF through various domains might recognize a specific GQ fold, a GQ loop of particular length or composition, or display a preference for a 5′ or 3′ single-stranded flanking sequence. Indeed, GQ formation is associated with increased TF binding [
92].
GQ and Promoter Maintenance. One possibility is that the promiscuous binding of TF at HOT is a mechanism to protect the genome from the oxidative damage due to the sequestration of hemin by promoter GQ [
69]. The diversity of associated helicase with TF would allow recognition of a wide variety of GQ folds, including those generated by various replication and repair errors occurring during a cell’s lifespan. Another explanation, one that is not mutually exclusive, is that once transcription is initiated by sequence-specific interactions, it could be maintained by docking of TF to GQ without the need for any further sequence-specific interaction. However, this possibility is not consistent with the observed rapid reset of promoters that occurs each round of transcription [
93,
94]. This disassembly of the transcriptional complexes mirrors the dissolution of promoter condensates that is triggered by the high levels of nascent RNAs produced during transcriptional bursting [
95].
GQ and Promoter resets. An alternative explanation for the engagement of GQ by so many TF accounts for this promoter reset. By increasing the local concentration of helicases through the complexes they localize, the TF promote the rapid resolution of both GQ and R-loops structures. The reset then allows the DNA duplex to reform and B-DNA sequence-specific proteins to seed formation of a new pre-initiation complex and so begin a new cycle.
Earlier experiments based on single molecule FISH suggest that the reset of promoters determine burst frequency but not burst size [
93]. One contribution to burst size is the frequency with which sister chromatids are transcribed. Curiously, only one allele is active at a time rather than both undergoing simultaneous transcription [
93]. The localization of many different helicases to the locus may allow one allele to reload sequence-specific TF to reform an initiation complex while the other one fires. Such coordinated activity is consistent with the symmetrical chromatin architecture observed for sister chromatids, as described above [
61]. Many other factors affect promoter reset. For example, the capture by Z-flipons of the energy released by an elongation RNAP2 is thought to power the re-engagement of the RNAP2 complex [
96,
97]. Of note is that some flipons may experimentally form either GQ or Z-DNA [
98]. For example, a GQ loop may contain an embedded Z-DNA forming sequence [
99]. The alternative structure adopted may depend on which alternative promoter is used, with the formation of Z-DNA downstream driven by transcription, while GQ upstream of the TSS localizes TF to the locus.
GQ and Gene Repression. The promoter reset occurs in competition with complexes that suppress gene expression. These competitors include the PRC2 complex that engages the GQ formed at promoters through the SANT (Swi3, Ada2, N-Cor, and TFIIIB) domain of the EZH2 (enhancer of zeste 2) component. For active genes, binding of PRC2 the GQ formed by a nascent RNA likely prevents engagement of promoter DNA [
100]. However, in other situations, binding of a small RNA
in trans to the coding strand would promote GQ formation by the promoter DNA without the transcription of a GQ RNA competitor. In this situation, proteins, such as PRC2, that are localized to the site by the small RNA, would enhance formation of a repressive complex at the promoter [
99]. Indeed, the small RNAs direct the
piwi mediated repression of endogenous retroelements in early development [
101]. By localizing a different set of proteins to the site, small RNAs acting
in trans could also promote transcriptional activation. Such a role has been proposed for the other
piwi-related agonaute family member complexes [
102,
103].
GQ and enhancers. Other mechanisms exist for the induction of alternative flipon conformations. Sequence-specific pioneering transcription factors, such as HNF4 and GATA4, can dock to their motifs on nucleosome bound DNA. The master regulators then localize complexes that evict histone octamers from the locus, generating a negatively supercoiled NDR at the site [
104,
105]. The energy released by removal of a nucleosome is sufficient to induce a number of different alternative DNA conformations [
106]. The relaxation of these structures to B-DNA is sufficient to power the assembly of the different biological machines that actuate cellular responses (
Figure 3).
Indeed, the formation of GQ at promoters often is detectable before the initiation of transcription [
107]. In such situations, there is no preference for which strand forms a GQ, further suggesting that gene transcription is not required to induce the flip from B-DNA [
77]. The formation GQ formation in promoter regions is not modulated by human topoisomerase I (TOP1), even though the enzyme is enriched at these sites [
108]. Instead, TOP1 is inhibited by GQ, with an IC50 ~ 100 nM [
109]. TOP1 activation is associated with RNAP2 release from the promoter and occurs 3′ to the polymerase [
108].
GQ, RNA splicing and RNAP2
pausing. GQ can also form in the gene body and in the RNA transcript produced. These structures have the potential to alter elongation rate and RNA processing [
35,
36]. Indeed, the small molecule cephaeline and the related compound emetine are both reported to disrupt GQ structure and globally disrupt alternative splicing [
110]. The GQ formed by RNA also affects alternative splicing. For example, the splicing factors U2AF65 and SRSF1 bind to GQ RNA with nanomolar affinity, showing specificity for different GQ substrates [
111]. The role of GQ in the co-transcriptional N6-methyladenosine (m6A) modification is more controversial. Interestingly, the methyltransferase METTL3/METTL14 heterodimer that writes m6A within the consensus DRACH motif (D = A, G, or U; R = A or G; H = A, C, or U) preferentially binds to rG4 structures through its RGG domain [
112,
113]. Also, the RBM15 protein that also binds G4 localizes METl3 to certain transcripts and to H3K36me3 marks [
45,
114,
115]. The mapping of GQ and m6A to splice junctions is dependent on the methods used. Over 81% of GQ that map in HeLa cells are formed from only 2 tetrads that will only form GQ in RNA, but the loops are not enriched for the DRACH motif [
116]. The mapping frequency likely depends on the m6A detection protocol employed and the cell line studied, varying from 14% in HeLa cells to 40% in HEK cells [
116]. More recent methods are even more sensitive than those used in this analysis, but reproducibility across studies remains a problem [
117]. The GQ formed by GGA repeats are destabilized by m6A modification [
118].
The m6A bases are read by heterogeneous ribonucleic acid proteins (hnRNPs) involved in alternative splicing, such as hnRNP C and hnRNP A2B1 {Ye, 2024 #3182} [
119]. However, expression of a hypomorphic METTL3 allele in mouse embryonic stem cells does not appear to change splicing patterns, although turnover of many m6A modified RNAs was slower [
120], In wildtype cells, the distribution of m6A in processed nuclear was similar to that found in cytoplasmic RNAs. Around 70% of the observed m6A sites were in terminal exons, with ~70% in the 3′ UTR. In chromatin, ∼93% of the m6As in the partially spliced CA-RNAs were in exons and only about 10% of m6As are within 50 nucleotides of 5′ or 3′ splice sites. Methylation was mostly performed before splicing [
121]. Rather than genomic knockout, another group examined the immediate effects of acute depletion of METTL3 protein in order to minimize downstream effects. Around 6%–10% of high-confidence m6A regions were mapped to introns, mainly to those of protein coding genes, either around stop-codon regions or at the beginning of the 3′ UTR. The loss of METTL3 disrupted inclusion of alternative introns/exons in the nascent transcriptome, particularly at those 5′ splice sites proximal to m6A peaks, suggesting that the sites were occluded or the isoforms were protected by proteins bound to m6A. Above those genes showing altered splicing were those encoding proteins for m6A modification (
Wtap, Ythdc1, Ythdf1, and Spen), suggesting a negative feedback regulatory mechanism that would be absent in cells with METTL3 deleted from the germline [
115]. Overall, the different results for GQ RNA formation at splice sites and METTL3 deficiency are consistent with a model where GQ folding in introns can promote m6A modification of exons, with rapid turnover of splicing isoforms that retain introns marked by m6A.
How GQ formation by DNA affects splicing is therefore of considerable interest as this may impact the timing and location of the events involved. One particular model proposes that GQ formation alters RNAP2 elongation [
35]. Thus, sites of RNAP2 pausing in the gene body are of considerable interest and have been investigated at nucleotide resolution. Careful in vivo measurement shows dependence of pause sites on the structure of the RNA:DNA hybrid produced, but not with GQ formation [
122]. The outcome may reflect the action of the FACT (Facilitates Chromatin Transcription) complex. FACT plays an important role in removing nucleosomes in front of the RNAP2 and replacing them behind the enzyme, preventing the net accumulation of local DNA supercoiling, while maintaining the existing epigenetic state [
123].
In contrast, a number of
in cellulo studies reveal that delays in RNAP2 transcript elongation occur at the CTCF binding sites involved in chromatin loop formation. CCTF binds to the large subunit of RNAP2 and the interaction is also associated with cohesin recruitment [
124,
125,
126]. Pausing of RNAP2 is associated with alternative splicing (reviewed in [
127]). Indeed, the complement of the degenerate pause motif given by Gajos et al., is consistent with a CTCF motif (i.e., the orientation is inverted relative to those enriched at TSS). Interestingly, the binding of CTCF to DNA is inhibited by methylation. The Increased engagement of CTCF binding with DNA following deletion of the DNA methylase DNMT1 is associated with an increase in alternative splicing of the CD45 transcript. A similar outcome occurs when DNA demethylation by TET1 (tet methylcytosine dioxygenase 1) and TET2 enzymes [
128,
129].
These findings are consistent with a model where CTCF induced stalling of the polymerase produces an R-loop that induces GQ formation at the site. The GQ structure then inhibits DNMT1 to prevent DNA methylation of the locus. The binding affinity of DNMT1 is higher for GQ than to duplex, hemi-methylated or single-stranded DNA [
130]. The resolution of the GQ by helicases then allows redocking of B-DNA binding CTCF to the original site and the reinstatement of the chromatin loop formed with the promoter. The CTCF binding sites facilitate the interaction because they are in reverse orientation to each other. The interaction between CTCF, RNAP2 and cohesin then sets the scene for the splicing of the subsequent transcript.
The loops formed between promoter and the spliceosome are associated with the transfer of various factors that accumulate in promoter regions to the site where they modulate splicing [
131]. Similar outcomes are also reported for the polyadenylation of transcripts [
132,
133]. The CTCF mechanism of connecting promoters with RNA processing condensates is quite flexible. For example, the multiple alternative splices of the protocadherin
Pcdh gene family connect the production of each isoform with a different active promoter [
134,
135]. The outcomes are aided by the formation of GQ that prevent the loss of CTCF binding sites through methylation and that promote the subsequent rebinding of CTCF by localizing helicases to the region.
Modulation of R-loops. Other mechanisms exist to regulate GQ formation by R-loops. For example, helicases such as SETX, and RTEL1 can facilitate the flip of GQ back to B-DNA through the resolution of RNA:DNA hybrids [
136,
137]. Nucleases that digest the RNA strand of hybrids, such as RNaseH1, play an important role in their removal [
138]. Other proteins such as ATRX prevent R-loop formation at telomeres by sequestering RNA to prevent formation of R-loops. Deletion of ATRX leads to increased formation of GQ at telomeres [
139].
GQ and ribosome assembly. GQ appear to play an important role in ribosome structure and maturation, with ribosomal RNAs enriched for G-flipons [
140]. A number of ribosomal proteins have been identified as GQ ligands in different screens [
111,
141]. Further, GQ binding and resolving proteins such as nucleolin and nucleophosmin help structure the nucleolar condensates that guide ribosome assembly [
48,
142,
143,
144].
GQ and translation GQ formation by mRNA is the subject of much interest, especially in the untranslated regions that regulate translation. These exons contain alternative translation initiation sites and microRNA (miR) binding sites that affect the production of different protein isoforms. The complexities involved are described in a number of recent reviews [
145,
146,
147,
148]. Analysis of G-flipons in 5′ and 3′ untranslated regions provides evidence of positive selection. These regions also undergo alternative splicing and have single nucleotide variants that are associated with quantitative trait loci [
149].
By modulating mRNA translation RNAs, GQ contribute in many ways to phenotypic pliability [
26]. Here helicases such as DHX36 and CCHC-type zinc finger nucleic acid-binding protein (CNBP/ZNF9 play a central role in promoting mRNA translation by resolving GQ [
150,
151]. The m6A modifications of RNA that are associated with GQ formation during transcription (as described above) also impact translation. The removal of these marks from the 5′ UTR near the start codon by the m6A erasers AlkB homolog H5 (
ALKBH5) and fat mass and obesity (
FTO) decreases ribosome translational pausing, increasing protein synthesis [
152]. Such m6A modifications also dynamically regulate heat shock responses by enhancing N7-methylguanosine cap-independent translation [
153]. Further, the class I cytoplasmic m6A readers, YTHDF1 and YTHDF3, promote the degradation of target transcripts [
154], potentially eliminating partially processed transcripts with retained introns. The endogenous repeat elements present in these introns, such as ALU SINE inverted repeats, might otherwise activate dsRNA and Z-RNA dependent immune responses [
97]. The potential of GQ to enhance m6A modifications provides additional mechanistic insight into how G-flipons increase phenotype pliability by regulating RNA dependent epigenetic outcomes.
GQ and development. GQ are able to facilitate a number of different processes in the cell that are directed by sequence-specific TF. Small RNAs, such as those used in the
piwi system to regulate endogenous retroelements [
155], provide another means by which GQ formation can be regulated in a sequence-specific manner. In both cases, the alternative flipon conformations engage the same structure-specific cellular machinery. The question arises as to two these two different systems for sequence-specific regulation of gene expression and RNA translation are used to coordinate development, especially during early embryogenesis. To explore the role of small RNAs in this process, the sequence-specific match between experimentally confirmed flipons and miR highly conserved in eutherian mammals was explored. Intriguingly, promoters with miR matches to G- and Z-flipons were highly enriched in developmental genes (FDR > 10
-100), consistent with a role in early development [
99].
Indeed, GQ are enriched in human embryonic stem cells (hESC). About 18,000 GQ were mapped to NDR as defined by ATAC seq. Following differentiation into neural stem cells and cranial neural crest cells, the number of detectable GQ was reduced by 25-50%, differing by lineage [
156]. In hESC, GQ were mapped to ~50% of bivalent promoters that contain both active H3K4me1 and repressive H3K27me3 marks and are lowly transcribed. The GQ in hESC overlapped sites bound by CTCF (~36 %), the cohesin component RAD51 (~50%) and RING1B that mediates repression by recruiting PRC1 to R-loops (~55%) [
157]. Differentiation was associated with the loss of bivalent promoters reflecting the potential of GQ to localize either activating or repressive protein complexes during lineage specification. Collectively the results are consistent with a model where small RNAs bootstrap development, much in the same way a computer loads an initial program to specify the inputs and outputs that are necessary for an operating system to run. Here, the programming of flipon conformation by small RNAs would establish epigenetic marks to template tissue differentiation by sequence-specific B-DNA binding proteins. The bootstrapping by small RNAs that occurs after the erasure of existing parental epigenetic marks early in development could potentially involve miR transmitted by either maternal or paternal gametes [
158,
159,
160,
161]. Further research is needed to address such mechanisms.