Article Text
Abstract
Background Over 40% of male and ∼16% of female carriers of a premutation FMR1 allele (55–200 CGG repeats) will develop fragile X-associated tremor/ataxia syndrome, an adult onset neurodegenerative disorder, while about 20% of female carriers will develop fragile X-associated primary ovarian insufficiency. Marked elevation in FMR1 mRNA transcript levels has been observed with premutation alleles, and RNA toxicity due to increased mRNA levels is the leading molecular mechanism proposed for these disorders. However, although the FMR1 gene undergoes alternative splicing, it is unknown whether all or only some of the isoforms are overexpressed in premutation carriers and which isoforms may contribute to the premutation pathology.
Methods To address this question, we have applied a long-read sequencing approach using single-molecule real-time (SMRT) sequencing and qRT-PCR.
Results Our SMRT sequencing analysis performed on peripheral blood mononuclear cells, fibroblasts and brain tissue samples derived from premutation carriers and controls revealed the existence of 16 isoforms of 24 predicted variants. Although the relative abundance of all mRNA isoforms was significantly increased in the premutation group, as expected based on the bulk increase in mRNA levels, there was a disproportionate (fourfold to sixfold) increase, relative to the overall increase in mRNA, in the abundance of isoforms spliced at both exons 12 and 14, specifically Iso10 and Iso10b, containing the complete exon 15 and differing only in splicing in exon 17.
Conclusions These findings suggest that RNA toxicity may arise from a relative increase of all FMR1 mRNA isoforms. Interestingly, the Iso10 and Iso10b mRNA isoforms, lacking the C-terminal functional sites for fragile X mental retardation protein function, are the most increased in premutation carriers relative to normal, suggesting a functional relevance in the pathology of FMR1-associated disorders.
- Genetics
Statistics from Altmetric.com
Introduction
Carriers of premutation expansions (55–200 CGG repeats) of the fragile X mental retardation 1 (FMR1) gene can present with a range of clinical phenotypes, from mild cognitive and mood problems during childhood and psychiatric and immune-mediated disorders in adulthood. Moreover, approximately 20% of the women will develop fragile X-associated primary ovarian insufficiency by the age of 40 years (reviewed in ref. 1). Premutation carriers are also at high risk of developing the late-onset neurodegenerative disorder, fragile X-associated tremor/ataxia syndrome (FXTAS).2 With an estimated prevalence of approximately 1:251-813 in men and 1:130–259 in women,3 ,4 understanding the pathology of these disorders is of great societal importance.
Extensive alternative splicing of the FMR1 gene has been demonstrated by qRT-PCR analysis, and several FMR1 mRNA and fragile X mental retardation protein (FMRP) isoforms have been observed in both human and mouse.5–11 The distribution and abundance of these isoforms may regulate the expression and functional properties of FMRP. Indeed, the FMR1 gene, which spans approximately 38 kb of genomic DNA and contains 17 exons, undergoes alternative splicing mainly involving inclusion or exclusion of exons 12 and 14 and the use of alternative splice acceptors in exons 15 and 175 ,9 (figure 1). The phosphorylation sites (Ser-499 in the mouse, Ser 500 in humans) mapping within exon 15, necessary for the translational repressor function of FMRP, are removed by the splicing of the first 36 nucleotides at the first acceptor site on exon 15. The second acceptor site removes an additional 39 nucleotides of exon 15, which results in both the loss of the phosphorylation sites and the loss of the methylation sites, likely impacting the composition and/or abundance of FMRP–messenger ribonucleoprotein complexes and translation of FMRP mRNA targets12 (figure 1). In addition, these splicing events in exon 15, which occur in close proximity to the RGG box domain, modulate the RNA-binding affinity of FMRP as evidenced by fluorescence spectroscopy analysis of the RNA-binding properties of recombinant Iso1, Iso2 and Iso3 FMRP proteins, which differ only in the splicing at exon 15.13 Splicing of the full exon 12 shortens a loop (located from the 3′ end of exon 10 through exon 12), within one of two high-affinity RNA-binding K homology (KH) domains in FMRP.14 ,15 Splicing of exon 14 determines nuclear localisation of FMRP isoforms7 and produces a +1 frame shift, which results in the loss of post-translational modification sites and loss of the RGG RNA-binding domain of FMRP. The frame shift also results in several truncated FMRP variants with novel C-termini5 (figure 1). The significance of a splice acceptor within exon 17 is currently unknown, although it has been recently demonstrated that a 17 amino acid stretch within this exon is important for the localisation of FMRP nuclear isoforms within Cajal bodies.16 Thus, exons 12, 14, 15 and 17 are involved in splicing events that can generate a diversity of FMR1 isoforms and likely with diverse biological properties.
Despite their potential impact on FMRP function, the expression profiles of the different FMR1 isoforms have not been characterised in either normal or premutation carriers. Although elevated FMR1 mRNA levels have been observed for FMR1 mRNA expanded alleles ranging from the premutation to the full mutation range,17–21 it is currently unknown whether all or only specific FMR1 mRNA isoforms are differentially overexpressed or distributed within cellular compartments as a function of CGG repeat number. As alternative protein isoforms are likely to have differing functions, alteration in their relative abundances could have profound biological consequences.
To address these questions, we have used single-molecule real-time (SMRT) sequencing technology and qRT-PCR to determine which of the 24 or more predicted isoforms are actually expressed in humans. Relative expression levels were measured by qRT-PCR in peripheral blood mononuclear cells (PBMCs) and brain tissue derived from premutation carriers and compared with age-matched typically developing controls. The relative distribution and abundance of the isoforms was measured by SMRT sequencing in three different tissues (PBMCs, primary fibroblast cells and postmortem brain cerebellar tissue), derived from premutation carriers and compared with age-matched typically developing controls.
Results
Expression levels of all FMR1 mRNA isoforms are elevated for alleles in the premutation range
The mRNA expression levels of those FMR1 isoforms within Group A (Iso1, Iso2, Iso3, Iso13, Iso14 and Iso15), of those missing exon 14 within Group B (specifically Iso 4 and Iso 4b), of those missing exon 12 within Group C (Iso7, Iso8, Iso9, Iso17, Iso18 and Iso19), of Iso7 and Iso17 combined, as well as of those simultaneously missing exons 12 and 14, containing the full exon 15 and differing in alternative splicing at exon 17 (specifically Iso10 and Iso10b), were quantified by qRT-PCR in total RNA isolated from PBMCs (premutations n=70; controls n=40) and from postmortem cerebellum brain tissue (premutations n=22; controls n=7) of male subjects.
Significant increases in expression levels were observed for all groups of isoforms in both PBMCs and brain tissue. However, while the expression levels in PBMCs of isoforms from Group A (p<0.02), Group B (p<0.023) and Group C (p<0.031) were approximately twofold higher in premutations compared with controls, the expression levels of Iso10 and Iso10b were disproportionately elevated, from fourfold to sixfold, (p<0.001) (figure 2A–D). In brain tissue, the increase in expression levels of the total FMR1 mRNA (p=0.233), of the isoforms from Groups A (p=0.615) and C (p<0.05) and of Iso10 and Iso10b (p<0.05) were no or marginally significant, between premutations and controls, in agreement with previous reports22 ,23 (figure 2E–H). As expected, transcript levels of the various isoforms positively correlated with a longer CCG repeat tract in both tissues examined (figure 2I–L).
Base resolution splicing patterns observed with single-molecule, long-read sequencing
Accurate consensus calls of full-length isoforms were obtained at a single-molecule level.24 The reads were aligned to our reference sequence, the full-length FMR1 or Iso1 (NM_002024.5), by using MUMer, a system for rapidly aligning entire genomes, with a low setting for the break-length variable. The resulting alignment data were then parsed to produce the normalised count breakdown by tissue (PBMCs, cerebellum brain tissue and primary fibroblasts) and category (premutation or control) as well as the isoform breakpoints at single base resolution (figure 3).
For each of the six cases (see study subject section), a SMRTbell sequencing library was constructed from PCR products generated using primers complementary to the start of exon 9 and the 3′-untranslated region (3′ UTR) of the FMR1 mRNA. SMRT sequencing was performed to generate exon count analysis by alignment to the reference exons 9 through the 3′ UTR in either the forward or reverse direction. A total of 108 126 reads was obtained for the six samples (∼18 000 reads per sample), with a bimodal read-length distribution and median read-length of 1053 nucleotides (nt). The main peak of the distribution was centered at ∼1100 nt (the reference length of exons 9 through the 3′ UTR is expected to be 1095 nt) with a smaller peak at about 500 nt. Of the total, 14% of the reads did not align to any exon and belonged to the shorter read-length peak. A filter step, requiring a match to both exon 9 and the 3′ UTR, removed 26% of the sequences. A total of 16 isoforms, comprising 57% of the total sequenced pool, had at least 350 reads in agreement, which verified the existence of 16 isoforms in PBMCs, in brain tissue and in fibroblast cells derived from premutations and controls, and showed complete exon sequence fidelity with >90% circular consensus sequence (CCS) accuracy.
Analysis of the read alignment generated by SMRT sequencing revealed that of all exons only exons 15 and 17 exhibited alternative start sites (figure 4). In addition, analysis of the relative abundance of the various isoforms led to the following findings. First, the isoforms missing only exon 12 regardless of splicing in exons 15 and 17 (Iso7, Iso8, Iso9, Iso17, Iso18 and Iso19) were collectively the most abundant isoforms in both premutations and controls, with similar representation in all three tissues examined (figure 5; see online supplementary figure S1). However, the relative proportion of Iso10 and Iso10b was higher within the premutation samples. Furthermore, the percent of reads for Iso10 and Iso10b, lacking exons 12 and 14 but retaining the entire exon 15 and differing only for the spliced acceptor in exon 17, displayed the greatest difference between the premutation and control PMBCs, and to a lesser degree in the other two tissues, confirming the qRT-PCR data (figure 5; see online supplementary figure S1).
These findings are intriguing in light of the fact that, compared with the other isoforms in Group D (figure 5) (Iso11, Iso11b, Iso12 and Iso20), Iso10 and Iso10b retain the full exon 15. The upstream sequence of exon 15 harbours functionally important phosphorylation sites for FMRP. However, although a frame shift due to splicing of exon 14 eliminates the canonical phosphorylation sites known to be important for FMRP function (figure 1A), it also generates potential new phosphorylation sites on Iso10 and Iso10b (figure 1B). Interestingly, the putative Iso11 was only detected in trace amounts in the three tissues analysed for either premutation or control alleles, under our filtering cut-off (at least 350 conforming reads). Verification by qRT-PCR revealed very low levels of this isoform, in agreement with a very small number of reads by sequencing, below our cut-off, and indicating the presence of Iso11 at <1% expression level in all three tissues.
In addition, we observed that isoforms containing all 17 exons, regardless of the splicing patterns at exons 15 and 17 (Iso1, Iso2, Iso3, Iso13, Iso14 and Iso15) were few in number with no differences in proportion between premutations and controls in any of the three tissues. Lastly, similar to Iso11, isoforms missing only exon 14 and varying in splicing of exons 15 and 17 (Iso4, Iso4b, Iso5, Iso5b, Iso6 and Iso16) were not detected under our cut-off conditions, yielding <0.5% abundance in both premutations and controls, in the three different tissues examined. In support of this observation is the qRT-PCR data showing low abundance of both Iso4 and Iso4b mRNAs (data not shown).
Alternative splicing of exon 3
The first half of the FMR1 transcript, spanning exons 1 through 9, was also investigated for the presence of splicing events using SMRT sequencing libraries constructed from PCR products generated using primers located at the 5′ UTR of FMR1 and in exon 9. Splice variants lacking exon 3 were identified in approximately 1% of the transcript reads in both premutation and control samples. The presence of an FMR1 mRNA lacking exon 3 was confirmed by qRT-PCR and subsequent sequencing of the cDNA in both a normal and a premutation sample (figure 6). qRT-PCR on total RNA derived from cases 1 through 6 confirmed the existence of a splicing event involving exon 3 in both premutation and control samples. Interestingly, the expression levels of the isoforms missing exon 3 were the highest in brain tissue (normal=3.49±0.23; premutation=3.17±0.26) compared with PBMCs (normal=0.28±0.09; premutation=0.19±0.03) with no statistical significant difference observed in a subgroup of premutation subjects (n=30) compared with control subjects (n=15).
Discussion
The full-length FMRP, the Iso1 protein, harbours RNA-binding domains (two KH domains and an RGG box),25 ,26 nuclear export and localisation signals,7 ,26 and phosphorylation and methylation sites within exon 15.7 ,27 As the function of any given protein is dictated by the properties of its domains, the splicing/removal of exon sequences that encode functional motifs in the FMR1 gene will likely impact the functional properties of FMRP. The consequent variations in expression levels and/or in ratios between the expressed isoforms could potentially contribute to the FMR1-associated disorders observed in premutations, including FXTAS. Splicing enhancers and silencers are keys to this process, usually accomplished by RNA-binding proteins, such as Sam68, whose role in alternative splicing has been demonstrated for a number of genes including CD44,28 ,29 Bcl-X30 and SMN2.31 In this regard, we have recently demonstrated that Sam68 is a specific and early component of CGG inclusions in FXTAS and is depleted as a consequence of its sequestration by the expanded CGG repeats, leading in turn to an altered splicing-regulatory function.32
FMR1 isoforms contain different discrete exonic combinations due to alternative splicing at exons 12, 14, 15 and 17. Exon 12 or both exons 12 and 14 are entirely removed in certain isoforms and exons 15 and 17 contain two and one alternative splicing site acceptors, respectively. Earlier studies have identified the existence of these isoforms in humans,7 ,33 and more recently in mouse.10 ,11 Importantly, a number of FMR1 alternative spliced isoforms were shown to co-sediment with polyribosomes in sucrose gradient ultracentrifugation experiments performed in adult mouse brain lysates, suggesting that those isoforms can be properly translated.10 These findings, including our current observations, highlight the importance of understanding the expression of multiple FMR1 isoforms and their functional properties. To our knowledge, this is the first study that has used (SMRT) sequencing to identify which FMR1 isoforms are generated from splicing of the FMR1 transcript and to determine their relative abundance in human controls and premutations. As noted, several studies using a qRT-PCR approach have demonstrated the presence of a number of different FMR1 isoforms7 ,10 ,11 ,33; however, PCR-based methodologies only allow the individual splice sites to be analysed separately and therefore fail to provide the combinations of different splice sites within the same RNA molecule. Our approach, which has used the single-molecule long-read sequencing technology, has allowed us to obtain a transcript map of all of the splice combinations within a single FMR1 transcript, and more importantly in both premutation and control individuals. As premutation carriers express elevated levels of FMR1 transcripts, changes in expression of specific isoforms could be playing a relevant role in the pathogenesis of the premutation-associated disorders.
In this study, we have determined the existence of at least 16 out of 24 predicted alternative spliced FMR1 isoforms (nucleotide and polypeptide alignments are shown in online supplementary figures S2 and S3) in PBMCs, brain tissue and fibroblasts, and we have determined the differences in level between premutations and controls in these tissues.
Our qRT-PCR data and sequencing analysis are consistent with previous findings indicating that the most common spliced isoforms are those missing exon 12,10 ,11 suggesting that these splice variants likely have a more critical gene function,34 or that inclusion of exon 12 might have a negative impact on one or more of those functions. In this study, we have been able to assess the contribution to FMR1 expression of all the single isoforms lacking exon 12, including exon 14 but differing in the other alternative splicing sites. All the isoforms tested and belonging to Groups A, B and C, although expressed at different levels, were increased in premutations. Thus, the relative ratio of the most abundant variant isoforms (Iso7, Iso17, Iso8, Iso18, Iso9 and Iso19) was significantly higher in premutation samples compared with controls as expected based on the increased bulk expression of FMR1 mRNA in the premutation range. Detailed examination of spliced variants within this group, spliced at exon 12, revealed that Iso7 and Iso17 are the two most highly expressed isoforms in both the premutation and the control groups, also showing increased levels in the premutation range. The importance of these splicing events is not completely clear as exon 12 is part of an approximately 76 amino acid long unstructured variable loop encoded from the end of exon 10 through 12. This loop localises between β sheets, β2 and β′ in the eukaryotic FMRP KH2 domain, being outside of the type I KH domain minimal motif.15 It is not known whether alternative splicing resulting in the removal of exon 12 and therefore in the shortening of the variable loop by 21 amino acids (encoded by exon 12) would change the arrangement of the β strands or modify the orientation of the KH2 domain affecting its ligand-binding properties, although the crystal structure of FMRP KH1-KH2 domains with a variable loop truncated to 10 amino acids does not demonstrate a compromised folding of the domain.15 However, shortening of the loop may have some effect on KH2 RNA-binding properties as the direct comparison of in vitro binding of KC1 (kissing complex 1) RNA with FMR1 isoforms including or excluding exon 12 showed greater affinities in the absence of exon 12. No differences were observed with binding of poly(rG) sequences.11 Interestingly, the same authors also reported that isoforms including exon 12 are enriched in P7 and P15 mice synaptoneurosomes, suggesting that these isoforms may play a role in nerve terminals of neuronal processes.11 Finally, it has been shown that although the isoform excluding exon 12 is predominant during mouse development, the expression of isoforms including exon 12 increases from E7 to E17, indicating its developmental regulation.11
In those isoforms simultaneously spliced at exons 12 and 14 in addition to introducing a shorter variable loop in the KH2 domain, the splicing event also eliminates a nuclear export signal identified in exon 14 and introduces a frame shift that creates a novel C-terminus,33 with a nuclear localisation of the truncated protein isoforms as likely consequence.7 ,10 Strikingly, our data show that two isoforms in this group, namely Iso10 and Iso10b, which differ only in their splicing pattern in exon 17, show the highest overexpression in the premutation samples as suggested by their greater representation relative to controls as determined by qRT-PCR and by sequencing analysis. (figures 2D,H and 5).
Thus, whereas there is a general increase in the expression of all the detected FMR1 isoform mRNAs consistent with the elevated FMR1 mRNA levels detected in premutation carriers,20 which is thought to lead to RNA toxicity and ultimately to FMR1-associated disorders,1 it also appears that not all FMRP isoforms may be equally represented in premutation carriers. How altered abundances of the FMR1 isoforms, particularly of Iso10 and Iso10b, may impact FMRP function in the premutations is not currently known. One possibility would involve a gain of function mechanism in which an increased relative abundance of truncated isoforms lacking the function of the C-terminal RGG box and nuclear export signal (NES) can effectively execute specific interactions mediated by the N-terminus (containing NES, Tudor and KH domains) that over time may not only deplete a significant pool of FMRP-binding factors but reduce functions mediated by the cross-talk with the missing C-terminus. The most recent model of FMRP function in translational repression, derived from studies in Drosophila melanogaster, implicates the direct binding of FMRP to ribosomes through the KH domains.35 However, repression of translation was reduced by either an I244N mutation in the KH1 RNA-binding domain or a deletion of the RGG box domain, suggesting that the joint activity of the FMRP N-terminus and C-terminus may be essential for FMRP function in the translational regulation of its targets.
Although both Iso10 and Iso10b isoforms retain the full exon 15 sequence, the frame shift resulting from exon 14 splicing eliminates the canonical sites for phosphorylation implicated in FMRP function as a translational repressor7 and introduces novel putative phosphorylation sites prior to the early stop codon.33 One important interaction eliminated by the deletion of the RGG box is that of the microtubule-associated protein 1B (MAP1B) that is involved in the development of the nervous system and normal physiology in adult brain.36 Indeed, it is intriguing that heterozygous MAP1B KO mice exhibit motor system abnormalities including cerebellar ataxia and tremors,37 both phenotypes of FXTAS.38 Using the most stringent settings of the group-based prediction system, V.2.0,39 Ser460 or Thre455 and Thre456 were identified as potential sites of phosphorylation by the Ser/Thre kinase casein kinase 1 (CK1). CK1 phosphorylation has been associated with nuclear localisation of its targets40 and is required for activation of cdk5 under the regulation of metabotropic glutamate receptors.41 ,42 Whether FMRP is a target of CK1 and the potential biological implications has not been contemplated thus far. Interestingly, a recent report implicates FMRP in the regulation of neurotransmitter release via synaptic vesicle exocytosis by modulating the density of N-type calcium channels through direct interaction with FMRP's C-terminus.43 N-type calcium channels are coincidentally regulated by cdk5, which impacts neurotransmitter release at presynaptic terminals.44 The absence of the C-terminus in a larger pool of truncated isoforms may create an imbalance affecting N-type calcium channel densities and, therefore, neurotransmitter release. Therefore, the question of whether the overabundance of these truncated FMRP isoforms may degrade FMRP function and contribute to pathology is of great interest.
Additionally, the significance of splicing events at exon 17 is not presently understood. It has been speculated that the absence of the extended loop removed by this splicing event could provide FMRP specificity reducing the repertoire of mRNA interactions perhaps serving as a mechanism for selection, or discrimination, of targets for FMRP-mediated translational repression.10 However, direct evidence for this mechanism of action has yet to be reported. As the biological function of these mRNA isoforms and the proteins they encode remain unknown, elucidating the function of the truncated FMRP will be crucial for determining their biological role in premutation disorders. Presumably any of the splicing events could lead to altered FMRP function due to changes in thermodynamic and isoelectric properties affecting FMRP-binding activity. Indeed, fluorescence spectroscopy RNA-binding experiments using recombinant FMRP produced from constructs containing spliced Iso1 (containing all exonic regions), spliced Iso2 (all exonic regions and exon 15 spliced at the first acceptor site) and Iso3 (all exonic regions and exon 15 spliced at the second acceptor site) demonstrated that each splicing event in exon 15 leads to a significant increase in RNA-binding affinity of the RGG box.45 The authors suggested that either conformational changes reducing the distance of the two RNA-binding regions that form the RGG box or perhaps the removal of negatively charged amino acids could be responsible for these affinity changes. It is interesting to note that all isoforms containing the entire exon 15, regardless their splicing at exons 12, 14 and 17, were the most highly represented compared with those isoforms spliced at the first or at the second acceptor site in exon 15 (figure 5).
Finally, the presence of alterative splicing in the first half of the FMR1 gene has not been previously reported. We have detected the absence of exon 3 in approximately 1% of the sequenced transcripts. This finding is especially interesting given the presence of two Tudor (Agenet) motifs,33 implicated in the recently proposed chromatin binding-dependent DNA damage response activity of FMRP during development.46 The tandem Tudor domains are highly conserved in the FMRP paralogs FXR1P and FXR2P,47 and are implicated in RNA metabolism and specific protein–protein interactions.48 One of such interaction is the binding of FMRP to brain cytoplasmic RNA (BC1), a small non-coding RNA involved in the translation of certain targets of FMRP. Recently, it was shown that unmethylated BC1, found at synapses, has a higher affinity for FMRP, within the second FMRP Tudor domain.49 Additionally, a KH domain RNA-binding protein Src-associated in mitosis, 68 kDa (Sam68), has been shown to potentially regulate the alternative splicing of FMR1 at exon 3.50 This regulation suggests that the observed sequestration of Sam68 by the FMR1 CGG repeat locus32 may result in increased FMR1 exon 3 splicing. It is unknown which FMR1 mRNA isoforms harbouring splicing events between exons 12 and 17 also become spliced at exon 3. However, the low expression of splicing in exon 3 suggests that possibly only those isoforms present in low abundance may carry this event. It is also possible that relative representation and expression levels of the various isoforms missing exon 3 and alternatively spliced in other exonic regions of the FMR1 gene could be different during development or in different tissues. Thus, further studies to understand the implications of exon 3 splicing events and to verify the presence of the corresponding translated products will be necessary to define its biological significance, which may involve altered mRNA translational activity at the synapses.
Materials and methods
Study subjects
For qRT-PCR analysis, total RNA was isolated from PBMCs from subjects with a normal allele (n=40 (mean±SD), CGG repeats=30.9±5.4; range 19–47 CGG, age range=8–73 years) and from premutation carriers (n=70; mean CGG repeats=100±25; range 58–180 CGG, age range=17–74 years). Among the premutation carriers, 60 had FXTAS stage 3–5 as defined by ref. 51 and postmortem brain tissue from premutation carriers with FXTAS (n=22, mean CGG=87±16.4; range 57–118 CGG; age range=66–87 years) and controls (n=7; mean CGG=30±7.7; age range 53–88 years).
For sequencing analysis, PBMCs were collected from two different male subjects, one carrying a normal and one a premutation FMR1 allele (case 1, CGG=30, age=35 years; case 2, CGG=170, age=19 years, respectively). Primary fibroblast cultures were established from two different male subjects, one carrying a normal and one a premutation FMR1 allele (case 3, CGG=20; case 4, CGG=107, respectively). Postmortem brain tissue from a control (case 5, CGG=30, age=69 years, obtained from the Harvard Brain Bank Repository) and from a premutation carrier with FXTAS (case 6, CGG=92, age=81 years; obtained from the UC Davis Brain Tissue Repository) were used for SMRT sequencing analysis.
RNA isolation
Total RNA was isolated from 3 mL of blood collected in Tempus tubes according to manufacturer's instructions (Applied Biosystems, Foster City, California, USA). Total RNA from postmortem cerebellar tissues or from 1×106 primary fibroblast cells were isolated using Trizol (Life Technologies, Carlsbad, California, USA) from samples stored at −80°C and pulverised in liquid nitrogen. All RNA isolations were performed in a clean RNA designated area. TotRNA quantification and quality control were carried out using the Agilent 2100 Bioanalyzer system.
mRNA expression levels
qRT-PCR was performed using custom-designed TaqMan primers and probe assays (Applied Biosystems, Foster City, California, USA). Custom-designed probes were used to detect both the full-length FMR1 gene and the β-glucuronidase (GUS) gene that was used for normalisation; details are as previously described.20 Probe and primer assays were also designed to quantify: total FMR1 mRNA (which included the transcripts derived for all possible isoform combinations; forward primer 5′-TGG CTT CAT CAG TTG TAG CAG G-3′, reverse primer 5′-TCT CTC CAA ACG CAA CTG GTC-3′), FMR1 isoforms in Group A (forward primer 5′ TGGCTTCATCAGTTGTAGCAGG 3′; reverse primer 5′ TCTCTCCAAACGCAACTGGTC 3′); FMR1 isoforms within Group B, specifically Iso4 and Iso4b (forward primer 5′ TGGCTTCATCAGTTGTAGCAGG 3′, reverse primer 5′ CAGAATTAGTTCCTTTAAATAGTTCAGG 3′); FMR1 isoforms in Group C (forward primer 5′ TCCAGAGGGGTATGGTACCATT 3′, reverse primer 5′ TCCAAACGCAACTGGTCTACTTC 3′); FMR1 Iso7 and Iso17 (forward primer 5′ TCCAGAGGGGTATGGTACCATT 3′, reverse primer 5′ GCTTCAGAATTAGTTCCTGAAGTATATCC 3′) and FMR1 Iso10 and Iso10b isoforms (forward primer 5′ TCCAGAGGGGTATGGTACCATT 3′, reverse primer 5′ CAGAATTAGTTCCTTTAAATAGTTCAGG 3′). Two sets of primers were also designed to verify the alternative splicing at exon 3 (forward 1 primer 5′-GCA TTT GAA AAC AAG TGT ATT CCA G 3′, reverse 1 primer 5′ TGG CAG GTT TGT TGG GAT TAA CAG A-3′, forward 2 primer-2 5′-AGG CAT TTG TAA AGG ATG TTC ATG-3′ and reverse 2 primer-2 5′-GCA AGG CTC TTT TTC ATT TGC T-3′). mRNA expression levels for each isoform were compared between premutation and control samples using two-sample t tests conducted on log transformed data.
Generation of PCR amplicons for SMRT libraries
Two PCR products, covering the entire FMR1 coding region, were amplified. One amplicon (plus and minus exon 3), ∼847 bp in length, was amplified using primers located in the 5′ UTR (5′ GCA GGG CTG AAG AGA AGA TG 3′) and a reverse primer located at the exon 8/9 boundary (5′ CAC TGC ATC CTG ATC CTC TC 3′). The second amplicon (including al the expressed isoforms) of ∼1119 bp in length was obtained using a forward primer located at exon 8/9 boundary (5′ GAG AGG ATC AGG ATG CAG TG 3′) and a reverse primer located in the 3′ UTR of the FMR1 gene (5′ CCT GTG CCA TCT TGC CTA C 3′). cDNAs were made as described in Tassone et al19 and were derived from total RNA isolated from normal and premutation PBMCs (cases 1 and 2), from normal and premutation primary fibroblasts (cases 3 and 4) and from normal and premutation cerebellum tissue (cases 5 and 6). cDNA synthesis was performed as previously described in ref. 19.with minor modifications; briefly the annealing temperature in the RT reaction was 55°C for 40 min. PCR reactions were performed in 50 μl aliquots containing 1× high fidelity buffer, 0.2 mM each deoxynucleotide triphosphate, primers each at 0.2 μM, 1.25U Platinum Taq DNA Polymerase high fidelity, 2 mM MgSO4 (Life Technology-Invitrogen, Grand Island, New York, USA).
Library preparation and polymerase binding
SMRT bell sequencing libraries were constructed for each PCR amplicon for the six cases using the DNA Template Prep Kit 1.0 (Pacific Biosciences, Menlo Park, California, USA), followed by primer annealing and polymerase binding using the DNA Polymerase Binding Kit (Pacific Biosciences, Menlo Park, California, USA) as described in ref. 52.
SMRT sequencing
Sequencing was performed on the RS I V.1.3.3.1.116585 software and C2 chemistry. Each run consisted of a 45′ movie that interrogated 75 000 zero mode waveguides. Two to three movies were acquired per sample, using diffusion loading yielding an average of 13 463 reads per sample (table 1).
Full-length transcript analysis
We used intramolecular CCS alignment to generate high-accuracy individual sequence reads.53 The reads were aligned to our reference sequence, Iso1, by using MUMmer (NUCmer (NUCleotide MUMmer) V.3.07).54 A cut-off match alignment of >97% was used to keep high-quality agreements. The data were then collated on a per-molecule basis to construct the full transcript using R (http://www.r-project.org). Only molecules that contained both ends of the cDNA strand amplified by RT-PCR (exons 1 and 9 or exons 9 and 17 for the amplicons covering the first and the second half of the FMR1 gene) were selected for further analyses. All samples had rates of 90% or greater for inclusion of exon 17, but varied for inclusion of exon 9 from 75% to 93%. Table 1 shows the number of sequences that were filtered and total number of reads. Subsequent filtering criteria included strand agreement to the mapped reference piece.
Assignment of isoform constraints
FMR1 isoforms contained different discrete exonic combinations due to alternative splicing at exons 12, 14, 15 and 17. Exon 12 or both exons 12 and 14 are entirely removed in certain isoforms and exons 15 and 17 contain two and one alternative splicing start sites, respectively (figure 1). Nomenclature of the identified isoforms is as illustrated in table 2. Limited by the size of transcripts sequenced (from the 5′ UTR to exon 9), transcripts with exon 3 spliced out were not assigned isoform designations as downstream splicing events in exons 12, 14, 15 and 17 were undetermined.
In conclusion, the characterisation of the expression levels of the FMR1 isoforms is fundamental for understanding regulation of the expression of the FMR1 gene as well as to elucidate the mechanism by which ‘toxic gain of function’ of the FMR1 mRNA and intranuclear inclusion formation may take place in FXTAS and/or in the other FMR1-associated conditions. These findings also suggest that in addition to the elevated levels of FMR1 isoforms the altered abundance/ratio of the corresponding FMRP isomers may affect the overall function of FMRP in premutations. Thus, future research should address these questions and whether or not the proteins for which the splicing of exon 14 creates a frame shift that leads to the loss of the canonical phosphorylation sites are translated and determine the impact of FMRP variants and the biogenesis of specific FMR1 isoforms in the neurological and neurodevelopmental clinical presentation in premutation carriers, particularly of FXTAS.
Acknowledgments
This work is dedicated to the memory of Matteo. We thank Elizabeth Berry-Kravis for providing one fibroblast cell line.
References
Supplementary materials
Supplementary Data
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Files in this Data Supplement:
- Data supplement 1 - Online supplement
- Data supplement 2 - Online supplement
- Data supplement 3 - Online figure
Footnotes
-
Contributors DIP performed experiments, data analysis and wrote the manuscript. JSE, CMY and PJH analysed data and contributed to writing of the manuscript. H-TT, EWL and CR performed experiments and contributed to writing of the manuscript. BD-J performed data analysis and contributed to writing of the manuscript. FT designed the study, analysed data and contributed to writing of the manuscript.
-
Funding This work was supported by the National Institutes of Health (NIH) through the research award HD02274 and HD040661 and by the National Center for Advancing Translational Sciences research grant UL1 TR000002. Brain tissue samples were obtained from the UC Davis Brain Repository, from the Harvard Brain Tissue Resource Center, which was supported in part by PHS grant number R24MH 068855; the NICHD Brain and Tissue Bank for Developmental Disorders at the University of Maryland, Baltimore, Maryland, supported by NICHD contract # NO 1-HD-4-3368 and NO1-HD-4-3383.
-
Competing interests FT has received funds from Roche and had consulted with Novartis and Genentech. FT and PJH hold patents for sizing of the CGG repeat and for quantification of FMRP. PJH and FT are collaborators with Pacific Biosciences on an NIH STTR grant. The other authors declare no conflicts of interest.
-
Patient consent Obtained.
-
Ethics approval UC Davis Institutional Review Board (IRB) protocols.
-
Provenance and peer review Not commissioned; externally peer reviewed.