So far, we have provided a comprehensive compilation of putative genomic regulatory regions bound by OSKM during different stages of cellular reprogramming by performing genome-wide occupation analytical studies integrated with gene expression and open chromatin signatures (
Figure 1E,
Figure 2). Previous studies have identified a large number of biochemical and functional interactions between O/S/K/M, thus implying that they collaboratively initiate cellular reprogramming [
17,
22]. This unprecedented and carefully-coordinated OSKM dynamic binding along with the binding of other TFs, including the 9TR network,
lead to a sequential occupation of markedly different genomic regions between early and later stages of reprogramming, thus driving cells to abandon their mesenchymal phenotype and induce their transition towards pluripotency. For example, only the small percentage of cells in which the 9TR network is successfully reconstructed and activated by the direct action of OSKM will be reprogrammed into iPSCs [
10] (
Figure S3A-I). For an initial assessment of the existence and the biological relevance of putative Reprogramming Inducible Enhancers (RIEs), we searched the genome for such enhancers with
in vivo reprogramming activity. Furthermore, the identification of RIEs will allow the isolation and study of the precious rare cell populations undergoing successful transition from the somatic to the pluripotent phenotype in real time. Our previous analysis identified early-bound ESC genomic sites (
Figure 2A, left panel) implicated in the up-regulation of genes related to the transition to pluripotency (
Figure 2C-D,
Figure S1C,F). Importantly, 19,226 of these sites are bound by OSKM only after the initiation of reprogramming (
Figure 3A, right panel
de novo sites), and therefore we assumed that they could be contained within putative RIEs.
Based on the above, we carefully inspected these specific genomic sites to assess the value of these elements in functioning as true RIEs. More specifically, we searched the
de novo acquired OSKM early-bound sites focusing on regions residing up to ±2.5kb from the TSS of up-regulated genes during reprogramming (
Figure 3B). We did not take into account the biological function of the genes close to these elements, or any data regarding genomic regions previously characterized as regulatory elements. Thus, our RIEs identification approach was based on an unbiased logic. The resulting 702 sites were further filtered to remove loci bearing 10 or less Oct4, Sox2, Klf4 or Myc peaks in total. Based on this distribution, we defined 66 co-bound regions of adjacent OSKM peaks that are no more than 800bp apart on average, which we assumed that could function as putative RIEs (
Table S5). Of these, three test enhancers were selected as representative examples for further functional validation analyses: a 600bp-region located ~700 bp upstream of the Lefty1 gene (“Lefty1
700” element,
Figure 3C), a 300bp region located ~1800 bp upstream of the Pou5f1 gene (“Pou5f1
1800” element,
Figure 3D) and a 300bp region residing ~800 bp upstream of the Upp1 gene (“Upp1
800” element,
Figure 3E). Lefty1 encodes for a protein belonging to the TGF-β family of ligands which plays a role in the determination of the left-right symmetry in the developing embryo and it is a stemness marker [
43,
44]. Pou5f1 encodes for the Oct4 transcription factor protein. Oct4 has a vital role in regulation of pluripotency by forming the core of the pluripotency transcription factor network along with Sox2 and Nanog [
45]. Finally, Upp1 encodes for a uridine phosphorylase, an enzyme catalyzing the reversible phosphorolysis of uridine, often found up-regulated in rapidly dividing malignant cells [
46,
47]. All three elements are stably-bound by OSK from day 1 throughout reprogramming to ESCs, with the exception of Upp1
800, where Klf4 binding occurs from day 3 and afterwards. Importantly, all three elements are bound by Nanog in ESCs (
Figure 3C-E). Furthermore, OSK early binding (day 1) correlates with an increase of the corresponding gene expression within the first 24 hours (
Figure 3F-H). Taken together, these observations support the idea that the above genomic elements could indeed function as RIEs. In addition, ATAC-seq analysis revealed that these sites are occupied by nucleosomes in MEFs, but become accessible early in reprogramming (
Figure S4A-C). More specifically, OSK binding at Lefty1
700 correlates with an immediate opening of the local chromatin, while the Pou5f1
1800 and Upp1
800 become accessible after day 3. Accordingly, although at the beginning of reprogramming all three putative RIEs are marked by the repressive histone modification H3K27me3, during reprogramming there is a gradual replacement of the repressive mark by the H3K27ac modification, a transcriptionally active promoter/enhancer marker, a finding consistent with their function as RIEs (
Figure S4D-F) [
17]. Interestingly, all three RIEs coincide with ESC-related Multiple Transcription-factor binding Loci (MTLs) - regions short in size, bound by multiple TFs, which are believed to act as sites of enhanceosome assembly in ESCs (
Figure 3C-E, bottom). The elements have been also annotated as ES-specific Super-enhancer regions (
Figure 3C-E, bottom) [
42,
48]. Taken together, the Lefty1
700, Pou5f1
1800 and Upp1
800 identified by our unbiased approach (
Figure 3B), share many structural characteristics with functional enhancer elements and they are part of previously known regulatory elements in ESCs and therefore, they could indeed function as true RIEs.