#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Ultra-deep sequencing reveals pre-mRNA splicing as a sequence driven high-fidelity process


Authors: Derrick J. Reynolds aff001;  Klemens J. Hertel aff001
Authors place of work: Department of Microbiology and Molecular Genetics, University of California, Irvine, California, United States of America aff001
Published in the journal: PLoS ONE 14(10)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0223132

Summary

Alternative splicing diversifies mRNA transcripts in human cells. While the spliceosome pairs exons with a high degree of accuracy, the rates of rare aberrant and non-canonical pre-mRNA splicing have not been evaluated at the nucleotide level to determine the quantity and identity of these events across splice junctions. Using ultra-deep sequencing the frequency of aberrant and non-canonical splicing events for three splice junctions flanking exon 7 of SMN1 were determined at single nucleotide resolution. After correction for background noise introduced by PCR amplification and sequencing steps, pre-mRNA splicing was shown to maintain a low overall rate of aberrant and non-canonically spliced events. Several previously unannotated splicing events across 3 exon|intron junctions in SMN1 were identified. Mutations within SMN exon 7 were shown to affect splicing fidelity by modulating RNA secondary structures, by altering the binding site of regulatory proteins and by changing the 5’ splice site strength. Mutations also create a truncated SMN1 exon 7 through the introduction of a de novo non-canonical 5’ splice site. The results from the ultra-deep sequencing approach highlight the impressive fidelity of pre-mRNA splicing and demonstrate that the immediate sequence context around splice sites is the main driving force behind non-canonical splice site pairing.

Keywords:

Sequence analysis – Mutation – Messenger RNA – Introns – RNA sequencing – Nucleotide sequencing – RNA splicing – Site selection

Introduction

Splicing is a complex process requiring hundreds of proteins to work in concert with proper regulation [1]. A pre-mRNA transcript from a single gene can be alternatively spliced to generate many mRNA variants. Differential pre-mRNA processing contributes significantly to genetic variability. It is estimated that transcripts from ~86–88% of multi-exon genes undergo alternative splicing [24]. Many mRNA isoforms are generated from a single gene as a result of splicing regulation, which may be caused by systemic feedback or tissue-specific expression of splicing regulators [5,6]. Other alternative mRNA isoforms may be the result of erroneous splice site pairing, also referred to as non-canonical splice sites, which may result in the generation of aberrant mRNA isoforms [7]. It has been shown that the most common form of non-canonical splice site activation occurs near the canonical splice site, mainly due to the U1 snRNP binding consensus sequence for 5’ splice sites or duplicate YAG trinucleotides near 3’ splice sites [8,9].

To avoid these errors, there are several safeguards to ensure splicing fidelity. Like transcription and translation, splicing has an active proofreading mechanism, while additionally relying on sequence information to guide the spliceosome through the process. Prp16 [10] and Prp22 [11,12] provide proofreading mechanisms for the first and second sequential transesterification reactions of splicing, and may even remodel the pre-mRNA to activate alternative splice sites [13]. Splicing regulatory element binding sites and the base-pairing of snRNPs to the pre-mRNA substrate lead to the selection of the correct splice sites based on optimal adjacent sequence contexts [14]. Even with these safeguards, splicing fidelity can be compromised when the sequence context for splice sites is suboptimal. All mRNA isoforms are subject to a number of quality control mechanisms, such as nonsense-mediated decay (NMD) [15,16], nonstop decay (NSD) [17], or no-go decay (NGD) [18,19], however, not all aberrant mRNA isoforms are removed through these processes and could be translated.

Owing to the importance of splicing regulation, a large number of mis-splicing or splicing errors can result in different diseases [20]. According to the Human Gene Mutation Database (HGMD release 2014.4), mutations that disrupt normal splicing have been estimated to account for up to a third of all disease-causing mutations [21]. It has been demonstrated that the spliceosome can pair constitutive exons with high fidelity with rates of rare and non-canonical splicing events as low as one in 20,000 [22,23]. Based on these studies, it was suggested that splicing accuracy is limited by Pol II transcription error rates [22,23]. These RT-qPCR based studies are inherently limited to resolution at the exonic level, investigating only single exon skipping events based on genome annotation. Using genome-wide RNA sequencing, similar rates of rare and non-canonical splicing events were observed [24], but it is still unclear whether these aberrant splicing events are the result of transcription errors, poor exon recognition mediated by weak splice sites and splicing regulatory elements, or whether these events are merely stochastic in nature. Additionally, the extent of aberrant mRNA splicing at the nucleotide level remains unknown. Using ultra-deep sequencing we determined the rates of rare and non-canonical splicing events for three splice junctions flanking exon 7 of SMN1 at single nucleotide resolution. We identified previously unannotated splice sites, a potential microexon, potential transcription error-mediated splicing events and the rate at which 5’ splice sites with their inherently susceptible U1snRNP binding site splice at positions 4 nucleotides upstream or downstream of the canonical splice site. Furthermore, we evaluated the effects that mutations in SMN exon 7 have on splicing fidelity.

Results

Dataset for ultra-deep analysis of splicing fidelity

To determine the rates of rare aberrant and non-canonical splicing events we used a recently published dataset [25] of SMN1 exon 7 inclusion rates based on a synonymous position mutation library in the well-studied SMN1 mini-gene, which spans exons 6–8 [2628] where exon 7 is included or excluded depending on splicing signals in the pre-mRNA (Fig 1). Neighboring codons in SMN exon 7 were mutated to every possible combination of silent mutations within the context of a sliding hexamer window, a minimal binding site for splicing regulatory proteins [25,29]. The resulting library of plasmids was transfected into HeLa cells and plasmid-specific mRNAs were analyzed by deep sequencing. The data obtained from these library transfections were previously used to determine if synonymous mutations in exon 7 influence splicing. This study also resulted in the several million-fold sequencing of three exon|intron splice site junctions, SMN1 exon6|exon7, exon7|exon8, and exon6|exon8. This extensive sequencing data allowed for an ultra-deep detection of low abundance local isoforms, including rare stochastic and non-stochastic splicing outcomes described below. Our observations and calculations of splicing fidelity are based on the wild-type SMN1 mini-gene.

Mutation scheme for SMN1 exon 7 library.
Fig. 1. Mutation scheme for SMN1 exon 7 library.
The SMN1 mini-gene construct consists of exon 6, exon 7, and exon 8 with shortened introns 6 and 7. All possible silent mutations in exon 7 were created within the context of a sliding hexamer window. For example, the first two codons depicted are GGT TTC. All three mutations were made in GGT resulting in GGN and combined with all silent mutations in TTC (TTT), resulting in eight combinations including the wild-type sequence. A transition 6C→T results in SMN1 exon 7 skipping.

Total rate of aberrant splicing events

There are several explanations why splicing can occur in a non-canonical way. Here, deviations from the expected canonical exon 7 inclusion splicing pathway (referred to as non-canonical splicing) could be the consequence of imperfections in the generation of the SMN1 mini-gene mutation library, pre-splicing transcription errors, sequencing errors, or the activation of rarely used splice sites, such as de novo splice sites or the selection of microexons. The ultra-deep sequencing of the SMN1 mini-gene highlights several clear-cut examples of non-canonical splice site selection, albeit at a very low rate. Out of a total of 6,469,446 wild-type SMN1 exon 7 reads 20,505 contained an unexpected alternatively spliced event at either the exon6|exon7 or the exon7|exon8 junction, for a raw aberrant splicing event rate of 3.2E-03 or 1 aberrantly spliced event for every 315 splicing events. At first glance this is a higher rate than that of other gene expression steps, transcription and translation, each of which are characterized by error rates as low as 1.0E-05 [3032]. Further examination of the dataset revealed that not every observed deviation from canonical splicing could be counted as a result of aberrant splicing fidelity.

Control for sequencing errors

In addition to sequencing the RNA generated from the SMN mini-gene mutation library, the transfected SMN mini-gene DNA constructs were sequenced themselves [25]. The DNA library sequencing served as a control to demonstrate that sequence differences detected in the mRNA reads are due to RNA generation and processing [25]. The most common sequence deviation from the RNA pool was the deletion of a single guanosine from a GGG triplet at the exon6|exon7 junction at a frequency of 1.1E-03, accounting for nearly 1/3 of the total identified aberrant splicing events. However, the same deletion occurred within a GGG triplet at a nearly identical frequency at the intron6|exon 7 junction in their DNA counterparts (1.0E-03) (Fig 2). Similarly, a single guanosine insertion at this same site, producing a GGGG motif, occurs in 247 DNA reads and 320 RNA reads at rates 8.4E-05 and 5.0E-05, respectively (Fig 2). These observations strongly suggest that these single guanosine insertions and deletions derive from errors independent of splicing. Importantly, there were no errors in the DNA input reads that resulted in insertions or deletions of multiple consecutive nucleotides. We conclude that any RNA output reads with 2 or more nucleotides consecutively inserted or deleted are attributable to pre-mRNA processing errors or pre-existing sequence variations introduced in the library during its construction.

Analysis of aberrant splicing rates in DNA and RNA reads by error length.
Fig. 2. Analysis of aberrant splicing rates in DNA and RNA reads by error length.
To detect possible reading frame preservation bias in aberrant splicing events, an analysis of splicing aberrant splicing rates in DNA and RNA by insertion or deletion length in wild-type reads was performed. For example, the insertion of 4 nucleotides GTAA at the 5’SS of exon 7 and the insertion of 4 nucleotides ACAG at the 3’SS of exon 8 are combined as a total aberrant splicing rate for the insertion of 4 nucleotides. There is no consensus error length.

Control for plasmid generated errors

The sequencing of the transfected SMN mini-gene library also serves as a control to demonstrate that errors and aberrantly spliced events identified in the processed RNA reads are due to the generation and processing of the RNA, namely transcription and splicing, and were not already present in the DNA template. Due to the size constraints of the sequencing protocol used (100 nucleotide read length) and the location of the input DNA primers [25], we were only able to estimate the plasmid error rate for the region that was flanked and amplified by the DNA primers used. This region consists of exon7 and the adjacent 6 upstream and 10 downstream nucleotides (Fig 3). While erroneous mutants that arose from errors in the generation of SMN mini-gene DNA construct do exist, they occur at a low rate, averaging 3.0E-04 (Fig 3 and Table 1). Although infrequent, these library construction imperfections limit the sensitivity of aberrant splicing detection.

Mutation rate by position in DNA input reads.
Fig. 3. Mutation rate by position in DNA input reads.
A heatmap representing positional error rates in DNA input reads that lie within the 15 nucleotide primers on either side of the amplified section that was sequenced (100 nucleotides total). Positions within exon 7 that were purposely mutated in our library construction were omitted and set to zero.
Tab. 1. Average error rate—Input DNA.
Average error rate—Input DNA.

Non-canonical 3’-splice site usage

An example of abundant non-canonical splicing observed is the selection of an unannotated 3’ splice site 27 nucleotides upstream of the canonical intron7|exon8 3’ splice site (AG/CCTCTGGN10…CAG|GA…; where the non-canonical splice site is designated by a “/” and the canonical splice site is represented by a “|”) (Fig 4A). This novel splice site is used at a frequency of 1.5E-03 (Table 2) and it is characterized by a canonical AG dinucleotide that defines the 3’ end of nearly every intron in metazoans [33]. However, a poorly defined upstream polypyrimidine tract prevents extensive usage of this non-canonical 3’ splice site (maximum entropy score (MES) = -1.62) [34]. At a splice site usage rate of 1 in 680 transcripts, this non-canonical splicing event is rare enough that it is only readily discovered using ultra-deep sequencing. The upstream location relative to the canonical 3’ splice site polypyrimidine tract suggests that this splice site is acting independent of the canonical 3’ splice site.

Non-canonical splice site usage.
Fig. 4. Non-canonical splice site usage.
A) Non-canonical 3’SS usage between exon 7 and exon 8. The green line represents the unannotated non-canonical 3’ SS event 27 nucleotides upstream of the canonical intron7|exon8 junction. The red dashed line represents the non-canonical splicing observed that is due to transcription or library generation errors resulting in canonical AG dinucleotide sequences. B) Non-canonical 3’ SS usage between exon 6 and exon 7. The green line represents the unannotated non-canonical 5’ SS event at position 8 in exon 7. The red dashed line represents the non-canonical splicing observed that is due to transcription or library generation errors. C) Non-canonical 5’ SS usage between exon 7 and exon 8. The green line represents the usage of the intrinsic non-canonical 5’ SS. D) Non-canonical 5’ SS usage between exon 6 and exon 7. The green line represents the usage of the intrinsic non-canonical 5’ SS.
Tab. 2. Non-canonical 3’ splice site usage.
Non-canonical 3’ splice site usage.

At the same intron7|exon8 3’ splice site two additional lower frequency insertions were observed. In 376 cases (aberrant splicing event rate = 5.9E-05, Fig 4A) the ligation of exon 7 and exon 8 took place 6 nucleotides upstream of the canonical 3’ splice site (AT/TTGCAG|GAA). The sequence upstream of this non-canonical splice site is an AT dinucleotide, instead of the requisite AG. Thus, the low observed frequency of this non-canonical 3’ splice site selection could be the consequence of selecting a poorly defined AT/TT junction (MES = -1.72), it could have arisen by rare nucleotide mis-incorporation mediated by elongating pol II to change the junction to AG/TT (MES = 6.87), or it could be the consequence of low-level nucleotide variations intrinsic to the mutation library that would also give rise to pre-mRNAs characterized by the improved AG/TT 3’ splice site junction.

To distinguish between these possibilities, we compared the frequency at which alternative nucleotides were detected at invariable nucleotide positions across exon7 of the SMN1 mutant library (Fig 3). On average, the library generation resulted in T to G nucleotide changes at a frequency of 1.1E-04 (Fig 3 and Table 1). Using this frequency as a measure for background noise, it is impossible to assign the (AT/TTGCAG|GAA) non-canonical splicing event to any other cause but library generation defects.

At the same intron7|exon8 junction, we also observe 238 similar occurrences (aberrant splicing event rate = 3.7E-05, Fig 4A and Table 2) where a possible nucleotide change 4 nucleotides upstream of the canonical 3’ splice site (AT/CAG|GAA → AG/CAG|GAA) drastically changes the favorability of the splice site, (MES = -5.33 → 3.04). These nucleotide changes would create non-canonical 3’ splice sites that approximate those observed in annotated EST databases [8]. Using the same arguments as above, it is most likely that the (AT/CAG|GAA) non-canonical splicing event is observed because of library imperfections.

Analysis of 3’ non-canonical splicing at the exon6|exon7 junction highlights two events that are represented with reasonable frequency. With 825 supporting reads (aberrant splicing event rate = 1.3E-04, Fig 4B and Table 2) where the activation of a 3’ splice site 8 nucleotides downstream of the canonical 3’ splice site AG|GGTTTCAG/AC is activated resulting in a truncated exon 7. This non-canonical 3’ splice site has a low MES (0.35), yet its activation does not rely on nucleotide changes at the new spliced junction. Based on these considerations it is likely that non-canonical AG selection intrinsic to the SMN1 wild-type sequence context mediates this non-canonical splicing event.

A lower frequency event is represented by 202 supporting reads (aberrant splicing event rate = 3.1E-05, Fig 4B and Table 2) where a non-canonical 3’ splice site (AT/TTTCCTTACAG|GGT) was selected for the intron6|exon7 junction 11 nucleotides upstream of the canonical 3’ splice site. The wild-type sequence upstream of this non-canonical splice site is an AT dinucleotide, instead of the requisite AG, again arguing that the selection of this sequence as a non-canonical splice site is likely a consequence of library imperfections.

Intrinsic 5’-splice site fidelity—The U1snRNP binding site conundrum

Previous in silico sequence analyses have shown that 5’ splice sites are often subject to non-canonical splice site activation 4 nucleotides upstream or downstream from the canonical splice site due to the presence of the U1snRNP binding sequence (AG|GURAGU), which commonly includes a GU dinucleotide 4 nucleotides downstream from the canonical splice site [8]. 491 supporting reads reveal the activation of an intrinsic non-canonical exon7 5’ splice site (GA|GTAA/GTCTGC) (aberrant splicing event rate = 7.7E-05, Fig 4C and Table 3). While the canonical 5’ splice site is a reasonably strong splice site (MES = 8.57), it should be highly favored compared to this downstream intrinsic 5’ splice site (MES = -7.82).

Tab. 3. Non-canonical 5’ splice site usage.
Non-canonical 5’ splice site usage.

The selection of the non-canonical 5’ splice site at exon6|intron6 (TG|GTAA/GTAATC) was observed only 32 times (aberrant splicing event rate = 5.0E-06, Fig 4D and Table 3). This lower rate in non-canonical 5’ splice site activation may be explained by a stronger canonical 5’ splice site (MES = 11.01).

Microexon discovery in SMN1

Another type of rare splicing variants observed are microexons. Many microexons, 3 to 30 nucleotides long, have not been annotated because of their rarity and size [35]. The ultra-deep sequencing analysis identified the presence of a microexon contained within intron 6. The microexon AG/ATCTGGG/GTAATGT is located 210 nucleotides upstream of the intron6|exon7 junction, and it was detected in 18 reads (2.8E-06). The microexon is flanked by weak splice sites (3’ splice site, MES = 0.99; 5’ splice site, MES = 4.85) and its usage does not rely on nucleotide changes at the splice junctions. Interestingly, this microexon uses the same 5’ splice site as the recently discovered cryptic exon 7a [36]. Thus, it is possible that the generation of the intron 6 microexon is an alternative splicing pathway in the generation of cryptic exon 7a.

The influence of exon mutations on splicing fidelity

The library analyzed was created with synonymous mutations at all possible positions within a six-nucleotide window throughout SMN exon 7 [25]. Using the splice efficiency results from this mutant library we tested the hypothesis that positional mutants alter canonical splice-site usage by increasing or decreasing non-canonical splice-site usage. To identify mutations that preferentially influence non-canonical splice-site usage, we focused only on high incidence non-canonical splice events and compared the Non-canonical Splicing Value with the published Inclusion Index Value [25]. The result of this comparison is referred to as the Mutant Influence Value.

Non-canonical 3’ splice site activation at the exon7|exon8 junction

A frequently used non-canonical splice site is located upstream of the canonical intron7|exon8 3’ splice site (Fig 4A). Multiple mutations within exon 7 result in altered non-canonical splice site usage, either increasing or decreasing its selection (Table 4). A significant decrease in non-canonical 3’ splice site usage occurs with the mutants 54A→C or 54A→G, which reside within the exon 7 5’ splice site. Combinations of these mutations with 50A→G result in similar effects. Other combinatorial mutations in the region 39 through 45 generally lead to further decreases in non-canonical splice site usage (Table 4 and Fig 5A). Conversely, the combinatorial mutation 3T→G+6C→T results in greater non-canonical 3’ splice site usage. Interestingly, the most influential mutants identified cluster to either the 5’ or the 3’ end of exon 7.

Tab. 4. Mutant influence on splicing fidelity—Non-canonical 3’ splice site AG/CCTCTGGN10…CAG|GAA.
Mutant influence on splicing fidelity—Non-canonical 3’ splice site AG/CCTCTGGN<sub>10</sub>…CAG|GAA.
Mutant influence on non-canonical splice site usage.
Fig. 5. Mutant influence on non-canonical splice site usage.
A) Mutant influence on non-canonical 3’SS usage between exon 7 and exon 8. The green line represents canonical SS usage, while the red line represents non-canonical 3’ SS usage 27 nucleotides upstream of the canonical intron7|exon8 junction. Mutations at positions in green influence more canonical SS usage, while mutations at position in red influence more non-canonical SS usage. B) Mutant influence on non-canonical 3’ SS usage between exon 6 and exon 7. The green line represents canonical SS usage, while the red line represents non-canonical 5’ SS usage 4 nucleotides downstream of the canonical intron6|exon7 junction. Mutations at positions in green influence more canonical SS usage. C) Mutant influence on non-canonical 5’ SS usage between exon 7 and exon 8. The green line represents canonical SS usage, while the red line represents non-canonical 5’ SS usage 4 nucleotides downstream of the canonical exon7|intron7 junction. Mutations at positions in green influence more canonical SS usage. D) Mutant at position 27 creates 5’SS. The green line represents canonical SS usage, while the red line represents non-canonical 5’SS usage. Mutations at positions in red influence more non-canonical 5’ SS usage.

Non-canonical splicing (GGTTTCAG deletion) at the intron6|exon7 junction

When compared to wild-type non-canonical 3’ splicing at the intron6|exon7 junction AG|GGTTTCAG/AC is significantly reduced by the exon 7 mutations 28A→C, 30G→A, and 45A→G (Table 5 and Fig 5B). As was observed for other non-canonical splicing events, combinations of mutants generally preserve the overall effect single mutants have. By contrast, no mutant significantly increased non-canonical splicing compared to wild-type.

Tab. 5. Mutant influence on splicing fidelity—3’ splice site AG|GGTTTCAG/ACA.
Mutant influence on splicing fidelity—3’ splice site AG|GGTTTCAG/ACA.

Non-canonical splicing (GTAA insertion) at the exon7|intron8 junction

Another abundant non-canonical splice site that is affected by mutation is the retention of GTAA by activation of an intrinsic 5’ non-canonical splice site (GA|GTAA/GTCTGC) for exon 7. A significant decrease in non-canonical 5’ splice site usage occurs with the combinatorial mutant 50A→G + 54A→G (Table 6 and Fig 5C).

Tab. 6. Mutant influence on splicing fidelity—5’ splice site GA|GTAA/GTCTG.
Mutant influence on splicing fidelity—5’ splice site GA|GTAA/GTCTG.

Mutations create a highly efficient de novo cryptic 5’ splice site

By far the most abundant non-canonical splicing example in the dataset analyzed is the truncation of exon 7 to the first 25 nucleotides. This 5’ non-canonical splicing event (AAG/GAAGGT) is represented by 309,793 reads containing a truncated exon 7 with no other mutations (Table 7 and Fig 5D). However, at a splice site usage rate of 1 in ~21 wild-type transcripts, this non-canonical splicing event is common enough that in the absence of mutation, it should have been readily discovered and annotated without using ultra-deep sequencing. This non-canonical splicing event is most likely the result of one or more mutations downstream of position 25. An analysis of the effects of other mutations on this truncated exon 7 was performed. The difficulty in this analysis is that any mutation that occurs at the 26th through the 54th positions of exon 7 cannot be accurately assessed, as the mutation will be omitted from the read as a result of the exon 7 truncation. The second obstacle in the analysis is that the wild-type reads cannot be used as the comparative baseline. To circumvent these limitations read counts were normalized to the splicing neutral 9A→G [37]. Normalization to splicing neutral 15A→C provided similar results (not shown). The mutation 24A→G occurred in far more truncated exon 7 reads (9,624) than any other mutant (Table 7 and Fig 5D). But the combinatorial mutation 21A→G+24A→G did not occur at an increased rate in truncated exon 7 reads, when compared to splicing neutral mutations 9A→G and 15T→C. These observations suggest that the truncated exon 7 reads containing the mutation 24A→G are actually the result of the 24A→G+27A→T combinatorial mutant (and probably to a lesser extent 24A→G+27A→C), as these mutations exhibit the same exon 7 inclusion behavior as the single mutations 27A→T and 27A→C in a previous study [25].

Tab. 7. Mutation-derived non-canonical 5’ splice site at position 27 results in truncated exon 7.
Mutation-derived non-canonical 5’ splice site at position 27 results in truncated exon 7.

Non-canonical splicing during skipping of exon 7

The mini-gene used in our analysis also generates mRNA transcripts with skipped exon 7. We recovered ~4 million reads that include exon 6 and exon 8 but skip exon 7 in the dataset analyzed. While we are unable to directly validate the junctions observed in these skipped exon 7 reads, they can be compared to the exon 7 inclusion reads. In general, exon 7 exclusion events contain the same rare and non-canonical splicing events at similar rates. The non-canonical 3’ splice site upstream of exon 8 (AG/CCTCTGGN10…CAG|GA…) is observed at a frequency identical to the usage rate seen for exon 7 inclusion events (1.5E-03 vs 1.5E-03) (Tables 2 and 8, Fig 6A). Other non-canonical splice site selection events at the exon 6|intron 6 or the intron7|exon8 junctions are also observed at similar frequencies (Tables 2, 3 and 8, Fig 6A and 6B). A surprising result in the analysis of exon 7 exclusion transcripts was the discovery of a GA dinucleotide frequently inserted between exon 6 and exon 8 (1257 occurrences, aberrant splicing event rate = 3.0E-04, Table 5). As this microexon event occurs at a higher rate than our established library construction error rate, it is highly likely that this event is the result of non-canonical splice site usage (Fig 6C).

Non-canonical splice site usage with excluded exon 7.
Fig. 6. Non-canonical splice site usage with excluded exon 7.
A) Non-canonical 3’SS usage between exon 6 and exon 8. The green line represents the unannotated non-canonical 3’ SS event 27 nucleotides upstream of the canonical intron7|exon8 junction. The red dashed line represents the non-canonical splicing observed that is due to transcription or library generation errors resulting in canonical AG dinucleotide sequences. B) Non-canonical 5’ SS usage between exon 6 and exon 8. The green line represents the usage of the intrinsic non-canonical 5’ SS. C) Recursive splicing resulting in GA dinucleotide insertion between exon 6 and exon 8.
Tab. 8. Skipped exon 7 cryptic splicing.
Skipped exon 7 cryptic splicing.

Discussion

Non-canonical splicing detection is limited by sequence context and sequencing accuracy

The spliceosome has evolved several mechanisms to ensure that the intron/exon junction is selected efficiently and accurately. Base pairing between U1 snRNA and the 5’ splice sites ensures proper selection of the exon/intron junction. In this process LUC7, a component of yeast U1 snRNP, stabilizes its interaction with the 5’ splice site [38]. During spliceosomal assembly active proofreading mechanisms through the activities of Prp16 [10] and Prp22 [11,12] ensure that the first transesterification reaction of splicing is carried out faithfully. At the 3’ splice site interactions of U2AF with the polypyrimidine tract and U2 snRNP with the branch site are essential for the recognition of the intron/exon junction [39]. While the small subunit of U2AF can initially interact with the AG dinucleotide at the 3’ splice site, hSlu7 and SPF45 select the AG dinucleotide prior to the second step of splicing [40,41] while Prp8 conformational changes align the upstream exon with the AG dinucleotide in the active site [42]. Together, these selection mechanisms ensure the two-step excision of introns and ligation of exons is carried out accurately.

The most common spliced sequence deviation identified in the dataset analyzed was the deletion of a single guanosine from a GGG triplet at the exon6|exon7 junction. However, this G insertion likely reflects sequencing errors because G insertions observed in control DNA reads occurred at an identical frequency. The GGG sequence context is more prone to Illumina HiSeq sequencing errors when compared to genome-wide observations [43], thus explaining the abundance of errors observed. These considerations support the idea that the numerous guanosine insertion/deletion events found at the intron6|exon7 junction are mainly attributable to sequencing errors inherently associated with GGG motifs.

Within the context of the mini-gene used the most abundant non-canonical splice sites used are located close to 3’ splice sites. This may be due to the two-step process of 3’ splice site selection described above. It is possible that these consecutive selection steps at the 3’ splice site are more prone to spurious splice site selection when compared to the base pairing guided selection of the 5’ splice site.

Despite the sequencing depth across the three junctions evaluated, we did not find evidence for multiple non-canonical splice events within a single transcript. This observation suggests that upstream non-canonical splicing does not increase the likelihood of downstream non-canonical splicing, at least as far as our mini-gene approach can decipher. Considering the locations of non-canonical splicing, the results of the ultra-deep sequencing analysis demonstrate that non-canonical splicing usage is constrained by the sequence surrounding putative splice sites and that non-canonical splicing is more likely to occur at 3’ splice sites.

The reading frame is not preserved by non-canonical 5’ splice site selection

It was argued that the SMN1 mini-gene used for generating the ultra-deep sequencing dataset is not subject to mRNA surveillance methods such as NMD, as it lacks the features of a full-length coding transcript [16,25]. This independence from of frameshift considerations allowed the generation and detection of all possible non-canonical splicing events. As expected from the independence of NMD, we do not detect a direct reading frame preference for the aberrant splicing events observed. Thus, the splicing events observed, non-canonical or canonical, are all a result of a contextual sequence dependent process not beholden to frame preservation.

Previous work postulated that the activation of downstream non-canonical 5’ splice sites at the +4 position are subjected to less selective pressures because they are not part of the coding region. The results from our analysis are in agreement with this notion. Only non-canonical 5’ splice sites located within the canonical intronic region were selected at a statistically significant level. Within the native context of SMN1 exon 7, an intronic +4 non-canonical 5’splice site will be positioned downstream of the designated stop codon, which is located in exon 7. Thus, the lengthening of exon 7 by 4 nucleotides through non-canonical splicing would not disrupt the coding region of the gene, nor would the non-canonical transcript isoform be subject to NMD. These considerations suggest that evolutionary pressures to prevent non-canonical +4 splicing at the exon 7 5’ splice site are less stringent. Indeed, our findings support this hypothesis because the number of non-canonical 5’ spliced events at exon 6, which is entirely coding, is 10-fold less frequent than those detected for exon 7.

Skipped exon 7

While the preferential usage of non-canonical splicing is the result of several different mutant influences, the fact that they occur at similar frequencies in exon 7 inclusion and exclusion types suggests that splice site selection occurs at each splice site independently. The most surprising result in the analysis of exon 7 exclusion transcripts was the discovery of a GA dinucleotide frequently inserted between exon 6 and exon 8 (Table 8 and Fig 6C). While there are 66 instances of GA dinucleotides in the intervening sequence between exon 6 and exon 8, only four are flanked by a GT dinucleotide (GA/GT) synonymous with 5’ splice sites. Additionally, an AG dinucleotide, generally associated with 3’ splice sites (AG/GA), positioned next to the GA dinucleotide occurs five times. Only one of these prospective GA dinucleotides is flanked by both an AG at the potential 3’ splice site and a GT at the presumed 5’ splice site (AG/GA/GT). This single prospective GA dinucleotide uses the canonical 5’ splice site for exon 7, while the hypothetical 3’ splice site generates a respectable splice site strength score (MES = 4.28). Thus, while this GA dinucleotide insertion event cannot be unambiguously mapped, the hypothetical splice site evidence flanking the singular AG/GA/GT site strongly suggests that this microexon is a result of the non-canonical splicing of exon 7. This unexpected isoform may be generated through a form of recursive splicing, where all but the last two nucleotides of exon 7 are lost to a non-canonical 3’ splice site (AAG/GA|GTAAGT) contained at position 52. It is possible that this non-canonical 3’ splice site is selected and once intron 6 and the first 52 nucleotides of exon 7 are excised, the last two nucleotides of exon 7 are ligated to exon 6 and redefined as part of a 5’ splice site prior to removal of intron 7.

Splicing fidelity is a sequence driven process influenced by splicing efficiency

Three abundant non-canonical splice sites altered usage rates when tested in mutant contexts. Mutational effects on non-canonical 3’ splicing at the exon7|exon8 junction (Table 4 and Fig 5A) can be explained by three separate factors that are expected to alter the splicing efficiency. The first factor is the manipulation of splicing regulatory elements within exon 7. The exon 7 mutant 3T→G+6C→T was implicated as a mutation that affects non-canonical splice site usage and greatly reduces exon 7 inclusion [25]. It is well known that the mutation 6C→T in SMN1 results in decreased inclusion of exon 7 [26,44] by modulating the binding affinity of splicing regulators [45,46]. Combinatorial mutations at position 3T→A or G synergize with 6T to further decrease exon 7 inclusion levels [25,47]. What is interesting is that this mutation, which is located near the 5’ end of exon 7, has such a marked effect on downstream non-canonical 3’ splice site usage. These observations suggest that elevated non-canonical splicing during intron 7 removal is triggered by inefficient exon 7 removal, a demonstrated consequence of the 3T→G+6C→T mutation.

The second factor affecting non-canonical splicing at the exon7|exon8 junction is RNA secondary structure. All significant combinatorial mutations located between 39 and 45 lie within the reported exon 7 inhibitory terminal stem-loop 2 (TSL2) [28]. All mutations are predicted to simply disrupt the stability of this inhibitory RNA hairpin [28,48], thereby promoting the canonical splicing pathway (Fig 5A).

The third factor involved in the mutational influence on non-canonical 3’ splicing at the exon7|exon8 junction is the direct altering of splice site strength. The reduced selection of non-canonical splicing for mutations 54A→C and 54A→G (Table 4 and Fig 5A) can be explained by an increase in exon 7 5’ splice site strength (wild-type MES = 8.57 increases to 54A→C (MES = 9.39) and 54A→G (MES = 9.65)). The strengthening of the 5’ splice site on exon 7 provides increased exon definition, thus favoring the canonical pathway.

Non-canonical 3’ splice site selection at the intron6|exon7 junction can be explained by the same three factors, modulation of SREs, changes in RNA secondary structure and changes in splice site strength (Table 5 and Fig 5B). Mutations at nucleotide positions 28–30 reside within a conserved tract of exon 7 [28] that is directly adjacent to a Tra2-β1 binding site [49], a splicing enhancer [50,51]. Disruption of the TSL2 RNA secondary structure by combinatorial mutations between positions 42 and 50 generally decreases non-canonical splicing at the exon6|exon7 junction, presumably because these mutations increase the efficiency of canonical splicing. Similarly, splice site mutations (positions 50–54) increase canonical splicing and decrease non-canonical splicing.

The only mutation that significantly influences non-canonical 5’ splice site usage in exon 7 is 50A→G+54A→G (Table 6 and Fig 5C). Its appearance as a mutation significantly altering non-canonical 5’ splice site usage indicates the importance of splice site strength on splicing efficiency and its effect on splicing fidelity. As was argued above, increasing the strength of the canonical 5’ splice site results in greater inclusion of exon 7, and decreased usage of the non-canonical splice site.

In summary, changes in the relative usage of non-canonical splice sites can be explained by its inverse relationship with the efficiency of canonical splicing. Mutations that increase canonical splice site selection reduce non-canonical splice site activation. By contrast, mutations that reduce canonical splice site recognition increase non-canonical splice site selection by modulation of SREs, changes in RNA secondary structure and changes in splice site strength.

Mutation at position 27 creates a highly efficient non-canonical 5’ splice site

While the influence of the previously mentioned mutations on non-canonical splicing are straightforward, the most abundant non-canonical splicing event is more difficult to decipher. Although we detected 309,793 reads as having a non-canonical splice site at position 25 resulting in a truncated exon 7, this non-canonical splice event was not a previously annotated splice site. We propose that this non-canonical splicing event is the result of the creation of a de novo non-canonical 5’ splice site by the mutation 27A→T (Table 7 and Fig 5D), which creates a strong 5’ splice site AAG/GTAGGT (MES = 10.29). These results demonstrate how splicing efficiencies can be radically changed under the influence of a single mutation.

In summary, the ultra-deep sequencing analysis provides evidence for the notion that the balance between non-canonical and canonical splicing is determined by their relative usage efficiency, a concept that extends beyond the SMN1 model used here. Several factors, including splice site strength, binding sites for splicing regulators and RNA secondary structure play an active role in establishing a preference between competing splicing kinetics. Single mutations within exons can significantly disturb this balance, leading to non-canonical splice site activation within the exon affected or its flanking introns.

Materials and methods

Cell culture, sequencing library preparation

The creation and sequencing of the mutant library was executed by our lab [25], in brief, HeLa cells were used for creation and transfection of the SMN1 Exon 7 mutant library. These were maintained in monolayer at 37°C in Dulbecco’s high glucose modified Eagle’s medium (Invitrogen) supplemented with 10% fetal bovine serum, 4mM L-Glutamate, and 1mM Na-Pyruvate. Cell confluence was maintained at ~80% or less before splitting cells. Cells were transfected according to manufacturer’s specifications for Lipofectamine 2000 (Invitrogen) for plate sizes of 10cm, 15cm, and 6-well plates with 3cm wells.

Bioinformatic analysis of splicing fidelity for SMN1 mini-gene

We obtained 54,780,073 single-end reads of 100 nucleotides from the sequencing run. These reads were aligned to a custom index consisting of genomic SMN1 exons 6 to 8, spliced mRNA sequence consisting of exons 6, 7, and 8, and all exon 7 mutants that were introduced into the library. The reads were classified as either input reads, which would be the unprocessed DNA based reads, and output reads which would comprise all reads that were sequenced from processed mRNAs, which would include undergoing transcription and splicing. We used custom Python scripts to identify reads with wild-type exon 7 and all mutant exon 7 types. Regular expression search functions were employed to search for multiple anchor sequences associated with exon 6, exon 7, exon 7 mutants, and exon 8. In order to determine the existence of aberrant splicing events we checked each read that contained these anchor sequences for either unexpected additional sequence inserted between exon anchor sequences or sequence deleted from the expected anchor sequences that result in partial anchor sequences. Aberrant splicing event rates were calculated as percentages where the total number of normal reads divided by the total number of reads that contained each distinct event. Reads where anchor sequences for exon 6 and exon 8 were found, but no anchor sequences from exon 7 were considered to be non-canonically spliced reads where exon 7 was excised. Reads that did not contain anchor sequences or with multiple quality score based errors resulting in ambiguous nucleotides were discarded.

Mutation position effects on splicing fidelity

There are several examples of mutations that increase non-canonical splicing. However, many of these mutations also increase wild-type exon 7 inclusion levels, confounding the number of increased non-canonical splice site usage reads with an increased total number of exon 7 inclusion reads. For instance, mutations at positions 42C→T+43T→C+45A→G has a Non-canonical Splicing Value of 2.0 for splicing of AG/CCTCTGGN10…CAG|GA at the intron7|exon8 junction. However, this same set of mutations is also responsible for a 2.1-fold increase in exon 7 inclusion according to its Inclusion Index Value [25]. Therefore, the increased number of reads containing non-canonical splicing compared to wild-type is inherently tied to the increase of exon 7 inclusion by this same set of mutations.

To calculate the effect of mutations on splicing fidelity, taking into account the rarity of events, we utilized odds ratios (OR) to determine those mutations that significantly change the ratios of aberrant splicing events compared to the aberrant splicing event rates observed in the wild-type SMN exon 7. We calculated the OR for each SMN1 exon 7 mutant type by taking the rate of each distinct non-canonical splicing event and divided it by the rate that the corresponding non-canonical splicing event occurs in wild-type SMN1 exon 7. This we refer to as the Non-canonical Splicing Value. To normalize the influence of exon 7 inclusion rates on the Non-canonical Splicing Value, we took the difference between the published Inclusion Index Value [25] creating the Mutant Influence Value. We then took the absolute value of the Mutational Influence Value and calculated the standard error. A z-score statistic was calculated and used to determine the p-value for the difference between the Non-canonical Splicing Value and the Inclusion Index Value. To account for multiples testing problems, the Benjamini-Hochberg procedure was used at a level of 0.2, to control the false discovery rate. Additionally, a minimum of 10 non-canonically spliced reads (based on the wild-type non-canonical splicing rate) threshold was imposed to avoid outsized conclusions based on small sample size.


Zdroje

1. Wahl MC, Will CL, Lührmann R. The Spliceosome: Design Principles of a Dynamic RNP Machine. Cell. 2009;136: 701–718. doi: 10.1016/j.cell.2009.02.009 19239890

2. Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet. 2008;40: 1413–5. doi: 10.1038/ng.259 18978789

3. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, et al. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008;456: 470–476. doi: 10.1038/nature07509 18978772

4. Chen L, Bush SJ, Tovar-Corona JM, Castillo-Morales A, Urrutia AO. Correcting for Differential Transcript Coverage Reveals a Strong Relationship between Alternative Splicing and Organism Complexity. Mol Biol Evol. 2014;31: 1402–1413. doi: 10.1093/molbev/msu083 24682283

5. Kornblihtt AR, Schor IE, Alló M, Dujardin G, Petrillo E, Muñoz MJ. Alternative splicing: a pivotal step between eukaryotic transcription and translation. Nat Rev Mol Cell Biol. 2013;14: 153–165. doi: 10.1038/nrm3525 23385723

6. Barbosa-Morais NL, Irimia M, Pan Q, Xiong HY, Gueroussov S, Lee LJ, et al. The evolutionary landscape of alternative splicing in vertebrate species. Science. 2012;338: 1587–93. doi: 10.1126/science.1230612 23258890

7. Buratti E, Chivers M, Královičová J, Romano M, Baralle M, Krainer AR, et al. Aberrant 5’ splice sites in human disease genes: Mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization. Nucleic Acids Res. 2007;35: 4250–4263. doi: 10.1093/nar/gkm402 17576681

8. Dou Y, Fox-Walsh KL, Baldi PF, Hertel KJ. Genomic splice-site analysis reveals frequent alternative splicing close to the dominant splice site. RNA. 2006;12: 2047–2056. doi: 10.1261/rna.151106 17053087

9. Tsai K-W, Chan W-C, Hsu C-N, Lin W-C. Sequence features involved in the mechanism of 3’ splice junction wobbling. BMC Mol Biol. 2010;11: 34. doi: 10.1186/1471-2199-11-34 20459675

10. Koodathingal P, Novak T, Piccirilli JA, Staley JP. The DEAH Box ATPases Prp16 and Prp43 Cooperate to Proofread 5′ Splice Site Cleavage during Pre-mRNA Splicing. Mol Cell. 2010;39: 385–395. doi: 10.1016/j.molcel.2010.07.014 20705241

11. Mayas RM, Maita H, Staley JP. Exon ligation is proofread by the DExD/H-box ATPase Prp22p. Nat Struct Mol Biol. 2006;13: 482–490. doi: 10.1038/nsmb1093 16680161

12. Semlow DR, Staley JP. Staying on message: ensuring fidelity in pre-mRNA splicing. Trends Biochem Sci. 2012;37: 263–273. doi: 10.1016/j.tibs.2012.04.001 22564363

13. Semlow DR, Blanco MR, Walter NG, Staley JP. Spliceosomal DEAH-Box ATPases Remodel Pre-mRNA to Activate Alternative Splice Sites. Cell. 2016;164: 985–998. doi: 10.1016/j.cell.2016.01.025 26919433

14. Nilsen TW, Graveley BR. Expansion of the eukaryotic proteome by alternative splicing. Nature. 2010;463: 457–463. doi: 10.1038/nature08909 20110989

15. Maquat LE. Nonsense-mediated mRNA decay: splicing, translation and mRNP dynamics. Nat Rev Mol Cell Biol. 2004;5: 89–99. doi: 10.1038/nrm1310 15040442

16. Kurosaki T, Maquat LE. Nonsense-mediated mRNA decay in humans at a glance. J Cell Sci. 2016;129: 461–467. doi: 10.1242/jcs.181008 26787741

17. Szádeczky-Kardoss I, Csorba T, Auber A, Schamberger A, Nyikó T, Taller J, et al. The nonstop decay and the RNA silencing systems operate cooperatively in plants. Nucleic Acids Res. 2018;46: 4632–4648. doi: 10.1093/nar/gky279 29672715

18. Doma MK, Parker R. Endonucleolytic cleavage of eukaryotic mRNAs with stalls in translation elongation. Nature. 2006;440: 561–564. doi: 10.1038/nature04530 16554824

19. Inada T. Quality control systems for aberrant mRNAs induced by aberrant translation elongation and termination. Biochim Biophys Acta—Gene Regul Mech. 2013;1829: 634–642. doi: 10.1016/j.bbagrm.2013.02.004 23416749

20. Scotti MM, Swanson MS. RNA mis-splicing in disease. Nat Rev Genet. 2016;17: 19–32. doi: 10.1038/nrg.2015.3 26593421

21. Daguenet E, Dujardin G, Valcarcel J. The pathogenicity of splicing defects: mechanistic insights into pre-mRNA processing inform novel therapeutic approaches. EMBO Rep. 2015;16: 1640–1655. doi: 10.15252/embr.201541116 26566663

22. Fox-Walsh KL, Hertel KJ. Splice-site pairing is an intrinsically high fidelity process. Proc Natl Acad Sci. 2009;106: 1766–1771. doi: 10.1073/pnas.0813128106 19179398

23. Mellert K, Uhl M, Högel J, Lamla M, Kemkemer R, Kaufmann D. Aberrant Single Exon Skipping is not Altered by Age in Exons of NF1, RABAC1, AATF or PCGF2 in Human Blood Cells and Fibroblasts. Genes (Basel). 2011;2: 562–577. doi: 10.3390/genes2030562 24710210

24. Pickrell JK, Pai AA, Gilad Y, Pritchard JK. Noisy Splicing Drives mRNA Isoform Diversity in Human Cells. PLoS Genet. 2010;6: e1001236. doi: 10.1371/journal.pgen.1001236 21151575

25. Mueller WF, Larsen LSZ, Garibaldi A, Hatfield GW, Hertel KJ. The silent sway of splicing by synonymous substitutions. J Biol Chem. 2015;290: 27700–27711. doi: 10.1074/jbc.M115.684035 26424794

26. Lorson CL, Hahnen E, Androphy EJ, Wirth B. A single nucleotide in the SMN gene regulates splicing and is responsible for spinal muscular atrophy. Proc Natl Acad Sci. 1999;96: 6307–6311. doi: 10.1073/pnas.96.11.6307 10339583

27. Lim SR, Hertel KJ. Modulation of Survival Motor Neuron Pre-mRNA Splicing by Inhibition of Alternative 3’ Splice Site Pairing. J Biol Chem. 2001;276: 45476–45483. doi: 10.1074/jbc.M107632200 11584013

28. Singh NN, Singh RN, Androphy EJ. Modulating role of RNA structure in alternative splicing of a critical exon in the spinal muscular atrophy genes. Nucleic Acids Res. 2006;35: 371–389. doi: 10.1093/nar/gkl1050 17170000

29. Fairbrother WG, Yeh R-F, Sharp PA, Burge CB. Predictive identification of exonic splicing enhancers in human genes. Science. 2002;297: 1007–13. doi: 10.1126/science.1073774 12114529

30. Nesser NK, Peterson DO, Hawley DK. RNA polymerase II subunit Rpb9 is important for transcriptional fidelity in vivo. Proc Natl Acad Sci. 2006;103: 3268–3273. doi: 10.1073/pnas.0511330103 16492753

31. Jeon C, Agarwal K. Fidelity of RNA polymerase II transcription controlled by elongation factor TFIIS. Proc Natl Acad Sci. 1996;93: 13677–13682. doi: 10.1073/pnas.93.24.13677 8942993

32. Imashimizu M, Oshima T, Lubkowska L, Kashlev M. Direct assessment of transcription fidelity by high-resolution RNA sequencing. Nucleic Acids Res. 2013;41: 9090–9104. doi: 10.1093/nar/gkt698 23925128

33. Horowitz DS. The mechanism of the second step of pre-mRNA splicing. Wiley Interdiscip Rev RNA. 2012;3: 331–350. doi: 10.1002/wrna.112 22012849

34. Yeo G, Burge CB. Maximum Entropy Modeling of Short Sequence Motifs with Applications to RNA Splicing Signals. J Comput Biol. 2004;11: 377–394. doi: 10.1089/1066527041410418 15285897

35. Ustianenko D, Weyn-Vanhentenryck SM, Zhang C. Microexons: discovery, regulation, and function. Wiley Interdiscip Rev RNA. 2017;8: e1418. doi: 10.1002/wrna.1418 28188674

36. Yoshimoto S, Harahap NIF, Hamamura Y, Ar Rochmah M, Shima A, Morisada N, et al. Alternative splicing of a cryptic exon embedded in intron 6 of SMN1 and SMN2. Hum Genome Var. 2016;3: 16040. doi: 10.1038/hgv.2016.40 27917293

37. Pedrotti S, Bielli P, Paronetto MP, Ciccosanti F, Fimia GM, Stamm S, et al. The splicing regulator Sam68 binds to a novel exonic splicing silencer and functions in SMN2 alternative splicing in spinal muscular atrophy. EMBO J. 2010;29: 1235–47. doi: 10.1038/emboj.2010.19 20186123

38. Agarwal R, Schwer B, Shuman S. Structure-function analysis and genetic interactions of the Luc7 subunit of the Saccharomyces cerevisiae U1 snRNP. RNA. 2016;22: 1302–10. doi: 10.1261/rna.056911.116 27354704

39. Black DL. Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem. 2003;72: 291–336. doi: 10.1146/annurev.biochem.72.121801.161720 12626338

40. Chua K, Reed R. An upstream AG determines whether a downstream AG is selected during catalytic step II of splicing. Mol Cell Biol. 2001;21: 1509–14. doi: 10.1128/MCB.21.5.1509-1514.2001 11238888

41. Lallena MJ, Chalmers KJ, Llamazares S, Lamond AI, Valcárcel J. Splicing Regulation at the Second Catalytic Step by Sex-lethal Involves 3′ Splice Site Recognition by SPF45. Cell. 2002;109: 285–296. doi: 10.1016/s0092-8674(02)00730-4 12015979

42. Konarska MM, Vilardell J, Query CC. Repositioning of the Reaction Intermediate within the Catalytic Center of the Spliceosome. Mol Cell. 2006;21: 543–553. doi: 10.1016/j.molcel.2006.01.017 16483935

43. Schirmer M, D’Amore R, Ijaz UZ, Hall N, Quince C. Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data. BMC Bioinformatics. 2016;17: 125. doi: 10.1186/s12859-016-0976-y 26968756

44. Cartegni L, Hastings ML, Calarco JA, de Stanchina E, Krainer AR. Determinants of Exon 7 Splicing in the Spinal Muscular Atrophy Genes, SMN1 and SMN2. Am J Hum Genet. 2006;78: 63–77. doi: 10.1086/498853 16385450

45. Kashima T, Rao N, Manley JL. An intronic element contributes to splicing repression in spinal muscular atrophy. Proc Natl Acad Sci. 2007;104: 3426–3431. doi: 10.1073/pnas.0700343104 17307868

46. Smith PJ, Zhang C, Wang J, Chew SL, Zhang MQ, Krainer AR. An increased specificity score matrix for the prediction of SF2/ASF-specific exonic splicing enhancers. Hum Mol Genet. 2006;15: 2490–2508. doi: 10.1093/hmg/ddl171 16825284

47. Singh NN, Howell MD, Singh RN. Transcriptional and Splicing Regulation of Spinal Muscular Atrophy Genes. Spinal Muscular Atrophy. Elsevier; 2017. pp. 75–97. doi: 10.1016/B978-0-12-803685-3.00005-7

48. Singh NN, Singh RN. Alternative splicing in spinal muscular atrophy underscores the role of an intron definition model. RNA Biol. 2011;8: 600–606. doi: 10.4161/rna.8.4.16224 21654213

49. Singh NN, Androphy EJ, Singh RN. In vivo selection reveals combinatorial controls that define a critical exon in the spinal muscular atrophy genes. RNA. 2004;10: 1291–1305. doi: 10.1261/rna.7580704 15272122

50. Hofmann Y, Lorson CL, Stamm S, Androphy EJ, Wirth B. Htra2-beta 1 stimulates an exonic splicing enhancer and can restore full-length SMN expression to survival motor neuron 2 (SMN2). Proc Natl Acad Sci. 2000;97: 9618–9623. doi: 10.1073/pnas.160181697 10931943

51. Watermann DO, Tang Y, Zur Hausen A, Jäger M, Stamm S, Stickeler E. Splicing factor Tra2-beta1 is specifically induced in breast cancer and regulates alternative splicing of the CD44 gene. Cancer Res. 2006;66: 4774–80. doi: 10.1158/0008-5472.CAN-04-3294 16651431


Článek vyšel v časopise

PLOS One


2019 Číslo 10
Nejčtenější tento týden
Nejčtenější v tomto čísle
Kurzy

Zvyšte si kvalifikaci online z pohodlí domova

Svět praktické medicíny 1/2024 (znalostní test z časopisu)
nový kurz

Koncepce osteologické péče pro gynekology a praktické lékaře
Autoři: MUDr. František Šenk

Sekvenční léčba schizofrenie
Autoři: MUDr. Jana Hořínková

Hypertenze a hypercholesterolémie – synergický efekt léčby
Autoři: prof. MUDr. Hana Rosolová, DrSc.

Význam metforminu pro „udržitelnou“ terapii diabetu
Autoři: prof. MUDr. Milan Kvapil, CSc., MBA

Všechny kurzy
Kurzy Podcasty Doporučená témata Časopisy
Přihlášení
Zapomenuté heslo

Zadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.

Přihlášení

Nemáte účet?  Registrujte se

#ADS_BOTTOM_SCRIPTS#