1. Background
The cell nucleus is pervasively transcribed into a plethora of different RNA molecules with different coding, structural or regulatory roles [
1] for a recent review), among them microRNAs. These are small RNAs (over 20 nucleotides long) that exert a post-transcriptional control of gene expression by promoting mRNA degradation or by repressing translation through their binding to homologous sequences mainly at the 3’ Untranslated Regions (3’UTRs) of mRNAs [
2]. 3’UTRs not only protect mRNAs from exonucleolytic digestión but also provide functional platforms for the miRNA regulation of mRNA function [
3,
4], which furthermore adapt to new regulatory requirements by changing the lengths of their sequences and the arrangement of regulatory motifs they harbour. In this way, sequence changes in 3’UTRs results in a deregulation of gene expression, linked to the loss (or acquisition) of specific miRNA binding sites, that eventually could lead to diseases ([
5] for a review).
3’UTRs are highly polymorphic in length and sequence, and variant 3’UTRs provide mRNAs with different binding sites for miRNAs or RNA binding proteins (RBPs) which would contribute to the establishment of new regulatory environments [
6]. Regulation of 3’UTR length is thus critical for the control of gene expression by its potential to regulate accessibility of miRNAs or RBPs to their cognate sequences in mRNAs. In this way, mRNA isoforms with alternative 3’UTRs has been seen to have differential mRNA stability [
7], microRNA binding potential [
8], or interactions with ceRNAs [
9]. Alternative polyadenylation is one important mechanism for regulating 3’UTR length [
10]. Pre-messenger RNAs (pre-mRNAs) harbour specific polyadenylation signals (AAUAAA in its canonical form) over 30 nts upstream the end of their 3’UTRs, to precisely position the cleavage/polyadenylation complex (CPSF) next the cleavage site where PABPN will extend the poly-A tail [
11]. Over 70% of human genes have more than one polyadenylation site in their 3’UTRs and 50% have three or more [
12], thus making the generation of alternative 3’UTRs by alternative polyadenylation (APA) a widespread mechanism for the generation of transcript variants that are heterogeneous in length and have different regulatory potentials [
13,
14]. On the other hand, alternative splicing has been shown to be also involved in the regulation of 3’UTR lengths, and their regulatory potentials, through different mechanisms such as intron retention, exon skipping, incorporation of one of two mutually-exclusive terminal exons of different length, use of 5’/3’ alternative sites or activation of cryptic splice sites [
3,
15].
The emerging picture on alternative 3’UTRs highlights a complex collection of isoforms whose individual expression switches in a controlled way during cell proliferation and differentiation, but that could be de-regulated under stress conditions. This would impact on the RNA interactome, causing a miscontrol of mRNA function, and facilitating the loss of physiologic homeostasis. In this way, 3’UTR shortening would increase mRNA stability by relaxing protein or miRNA-based mechanisms of mRNA degradation, while 3'UTR lengthening would strengthen accessibility to miRNAs [
16].
On the other hand, 5’UTRs are also heterogeneous in length and play a key role in regulating translation efficiency and mRNA stability [
17] since they contain a number of regulatory elements and miRNA binding sites [
18], piRNA binding sites [
19], translation-regulating secondary structures [
20] and protein binding sites [
21]. See [
18] for a recent review on the impact of 5’UTR length variations in cell physiology.
Our group is interested in studying the disease-dependent alterations in the regulatory circuitry controlling mRNA stability and function, more specifically in the interactions among miRNAs/sponging ceRNAs and mRNAs. In a previous work, we described the association of adverse cardiovascular events in hemodialysis patients to SNPs in the gene of the lncRNA ANRIL [
22]. Furthermore, we took advantage of the atherosclerosis-prone ApoE
-/- mouse model to investigate the role of CD40 signaling on atherosclerosis (ATS) progression. Mice treated with an anti-CD40 siRNA for 16 weeks showed a significantly reduced extension and severity of atherosclerotic lesions, as well as a diminished infiltration of macrophages (F4/80
+ cells; galectin-3
+ cells) in the intima of atherosclerotic plaques [
23]. Next, a genome-wide miRNA/mRNA microarray profiling showed that miR-125b [together with Taf3, Xpr1 and Ikkβ] was significantly upregulated during ATS progression, effect that was reversed upon CD40 silencing [
23]. Further work showed that inclusion of alternative 3’UTRs in the murine
Cd34 or in the scavenger receptor
SCARB1 transcripts had the potential to alter the pattern of interacting microRNAs, including miR-125 [
24,
25]. These results prompted us to perform a global study of length variation (including 5’UTR, CDS and 3’UTR domains) in transcripts downregulated during ATS progression and regression in the above ApoE
-/- model of ATS, and here we present the first analysis of the results obtained. These suggest that de-regulation of alternative splicing is an important mechanism to generate length variability in the 5’UTR, CDS and 3’UTR of transcripts in ATS progression. The consequences of this length variability on the regulatory circuitry controlling mRNA stability and function are discussed.
4. Discussion
The sequencing revolution has led to the discovery of new families of regulatory RNAs (lncRNAs, miRNAs, piRNAs, etc.) that establish complex regulatory networks with mRNAs. Alterations in these networks contribute to the disease process and could be also considered as targets of therapeutic intervention [
28]. Among these new RNAs, miRNAs have an important role by regulating mRNA stability and function through the targetting of complementary sequences at the 3’UTR of mRNAs. Nevertheless, recent reports are showing that the relationship miRNA/mRNA is much more complex than previously described, since mRNAs avoid miRNA targetting by regulating the length and sequence of their 3’UTRs by alternative polyadenylation or through the regulated use of alternative 3’ terminal exons [
3,
27]. In this context, we aimed to study the variability in the 3’UTRs of DEGs during ATS progression, or regression after treatment with an anti-CD40 specific siRNA, in aortas from ApoE-deficient mice. Our original microarray expression data identified mRNAs up- or downregulated in our experimental conditions (see
Figure 1 and [
23]), and here we report the results of the analysis of the downregulated transcripts, because the complexity of the data from the upregulated ones makes it advisable to make a sepparate publication. This work has benefited of the double identification of DEGs in the microarray output, that included the “GeneSymbol”, a gene identifier common to all isoforms, and the “Seqname” an individual isoform identifier.
Our results draw a complex picture of sequence length alterations in ATS progression, since these were not restricted to 3’UTRs but also affected 5’UTRs and the Coding Regions of mRNAs. In this sense, we found a clear lengthening of 3’UTRs during disease progression (C10 to C24,
Figure 2A) that was partially reversed by the siRNA treatment (C10 vs. T10 and C24 vs. T24,
Figure 2). Variations in 3’UTR length cause the acquisition of new regulatory potentials with regard of stability, translational or subcelular localization by controling accessibility of miRNAs or of RBPs to their target sequences [
3,
29] and have been mostly linked to Alternative PolyAdenylation (APA) events regulated by general or type-specific factors [
30]. In this way, 3’UTR lengthening would increase the regulatory potential by including miRNA or RBP-responsive elements that in tumour suppressor genes have been associated to cáncer resistence by increasing density of mRNA stability regulating RBPs [
31], while shortening of 3’UTRs has been linked to tumour progression [
32,
33] by releasing oncogene transcripts from miRNA control.
On the other hand, we also describe a small but significant lengthening of 5’UTRs from week 10 to week 24 of ATS progression (
Figure 3A) not reversed by the antiCD40 treatment (
Figure 4A). 5’UTR length variations have been linked to the activation of alternative or upstream start-codons [
34], of alternative promoters [
35] or to alternative splicing events [
36]. Furthermore, 5’UTRs have been shown to be targeted by miRNAs [
37,
38] in a way depending on their secondary structure [
39] (see
Figure 7 for a summary of the findings decribed in this manuscript).
We next addressed the mechanisms promoting sequence lengthening by studying transcript isoform switching in ATS progression or regression as a way to describe evolution of gene expression. We were able to detect 9 different genes that were simultaneously expressed in two conditions (6 genes in C10 and C24, other 2 in T10 and T24 and one common to both experiments), which furthermore showed isoform switching with a total of 19 transcript variants expressed in the different conditions (Tables 1 and 2). In this group of transcript isoforms, we identified alternative splicing of 5’ and 3’UTRs as a frequent mechanism contributing to the sequence variability. Furthermore, among these 19 transcripts we found two, Mta3 and Ly6e, expressing alternative 3’ or 5’UTRs (respectively) conformed by the activation of alternative donor/acceptor splice sites. In the case of Mta3 this involved the activation of a cryptic/latent donor splice site [
40] at the middle of one exon (
Figure 6). This is interesting because alternative splicing has been considered as a minor contributor to the generation of transcript variability since analysis of the superfamily of odorant receptor (OR) genes showed that over 80% of OR mRNAs were submitted to alternative polyadenylation while only a few of these used alternative splicing to generate length variants of 3’UTRs [
41].
Regulation of alternative splicing is very complex and involves multiple regulatory sites at the pre-mRNAs (splicing donor and acceptor sites, splicing enhancers and silencers, etc.) recognized by mRNA binding proteins and U-snRNPs ([
42] for review). The first step in the maturation of a mRNA is the recognition of the 5’splice site through the functional integration of cis-acting splice signals and splicing regulatory elements (SREs) in the mRNA with the activity of U1-snRNPs and a number of trans-acting splicing factors, all of them working in the context of the secondary structure of the mRNA which regulates accessibility of the splicing machinery to the splice site [
43]. Nevertheless, in the genome there are many potential exonic 5’ splicing sites that are not used under physiological conditions, raising the interesting question of the nature of the mechanisms restricting 5’SS selection in normal cells and how are these mechanisms altered in disease to allow the recognition of cryptic or latent 5’SS. This is especially important because improper activation of latent splice sites could result in the incorporation of intronic sequences with potential premature in-frame stop codons to mature mRNAs [
44]. In this sense, a recent model of 5’SS selection by Brillen and cols. demonstrated that SREs and SRE-binding proteins were able to block “weak” cryptic 5’splice sites to facilitate recognition of the “strong” actual sites following a sequential, iterative, and position-dependent process [
45] while Boehm and cols. proposed the Exon Junction Complexes to suppress cryptic 5’splice sites through the recruitment of the splicing regulator RNPS1 [
46]. Furthermore, spliceosomes with noncanonical U1-snRNAs or changes in the stoichiometry of spliceosome components could also contribute to the recognition of variant 5’ss to generate cell/tissue specific patterns of alternative splicing [
47,
48], and Arafat and Sperling have proposed a quality control mechanism (SoS, suppression of splicing) that would distinguish among normal and latent 5’ splice sites while suppressing these last, through the recognition of a functional ORF (see [
49] for a recent review). Lastly, an interesting mechanism has been proposed by Movassat and cols. that linked recognition of the 5’SS in the last exon with the cleavage and polyadenylation site, and identified PA factor CstF64 as a potential regulator of alternative splicing [
50]. In this context it is clear that changes on the expression of splicing factors during ATS progression could have an impact on the regulation of the alternative splicing events here described, and work is currently on progress in our group to probe this hypothesis.
Our results should be interpretated with caution and taking into account that we used microarray and not RNA.seq to obtain our expression data [
23]. Microarrays represent a quite static view of gene expression, being limited by probe design and by the selection of the arrayed mRNA isoforms. In this sense, we could not discard the presence of additional changes in the patterns of mRNA alternative polyadenylation in our mRNAs, not represented among the microarray probes, yet this apparent limitation has allowed us to highlight isoform switching and alternative splicing as contributors to the generation of transcript length variability in disease progression. In this sense, one interesting possibility is that splicing alterations were pervasive in ATS progression and that by using microarray data we had detected just a reduced part of these alternative/cryptic splicing events. Clarifying this interesting point will require the use of RNA.seq data to identify actual isoform variants. Lastly, splicing is becoming a subject of intense research to develop therapies aimed to normalize splice site usage, anomalous alternative splicing events and to correct splicing factor irregularities (see [
51] for a recent review). Characterizing the splicing alterations in ATS will surely provide new targets of pharmacological intervention.
In conclussion, here we have characterized the length variability of transcript variants dowregulated during ATS progression. We have uncovererd general length variations, not only at the 3’UTRs of these transcripts as expected, but also at their 5’UTR and CDS, and have given evidences for the involvement of alternative splicing as a frequent mechanism for the generation of length variability. To the best of our knowledge, this is the first report linking ATS development with changes in the length of ATS-DEGs, and highlights splicing factors as possible targets of therapeutical intervention in ATS progression.