Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Rare loss-of-function variants in SETD1A are associated with schizophrenia and developmental disorders

Abstract

By analyzing the whole-exome sequences of 4,264 schizophrenia cases, 9,343 controls and 1,077 trios, we identified a genome-wide significant association between rare loss-of-function (LoF) variants in SETD1A and risk for schizophrenia (P = 3.3 × 10−9). We found only two heterozygous LoF variants in 45,376 exomes from individuals without a neuropsychiatric diagnosis, indicating that SETD1A is substantially depleted of LoF variants in the general population. Seven of the ten individuals with schizophrenia carrying SETD1A LoF variants also had learning difficulties. We further identified four SETD1A LoF carriers among 4,281 children with severe developmental disorders and two more carriers in an independent sample of 5,720 Finnish exomes, both with notable neuropsychiatric phenotypes. Together, our observations indicate that LoF variants in SETD1A cause a range of neurodevelopmental disorders, including schizophrenia. Combining these data with previous common variant evidence, we suggest that epigenetic dysregulation, specifically in the histone H3K4 methylation pathway, is an important mechanism in the pathogenesis of schizophrenia.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Study design for the schizophrenia (SCZ) exome meta-analysis.
Figure 2: The genomic position and coding consequences of 16 SETD1A LoF variants observed in the schizophrenia exome meta-analysis, the DDD study and the SiSU project.
Figure 3: A comparison of genome-wide de novo mutation rates in probands with ASD, DD, schizophrenia (SCZ) and controls.

Similar content being viewed by others

References

  1. Perälä, J. et al. Lifetime prevalence of psychotic and bipolar I disorders in a general population. Arch. Gen. Psychiatry 64, 19–28 (2007).

    Article  PubMed  Google Scholar 

  2. van Os, J. & Kapur, S. Schizophrenia. Lancet 374, 635–645 (2009).

    Article  CAS  PubMed  Google Scholar 

  3. Saha, S., Chant, D. & McGrath, J. A systematic review of mortality in schizophrenia: is the differential mortality gap worsening over time? Arch. Gen. Psychiatry. 64, 1123–1131 (2007).

    Article  PubMed  Google Scholar 

  4. Lichtenstein, P. et al. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet 373, 234–239 (2009).

    Article  CAS  PubMed  Google Scholar 

  5. Sullivan, P.F., Kendler, K.S. & Neale, M.C. Schizophrenia as a complex trait: evidence from a meta-analysis of twin studies. Arch. Gen. Psychiatry 60, 1187–1192 (2003).

    Article  PubMed  Google Scholar 

  6. Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).

    Article  CAS  PubMed Central  Google Scholar 

  7. The International Schizophrenia Consortium. Rare chromosomal deletions and duplications increase risk of schizophrenia. Nature 455, 237–241 (2008).

  8. Malhotra, D. & Sebat, J. CNVs: harbingers of a rare variant revolution in psychiatric genetics. Cell 148, 1223–1241 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Kirov, G. et al. De novo CNV analysis implicates specific abnormalities of postsynaptic signaling complexes in the pathogenesis of schizophrenia. Mol. Psychiatry 17, 142–153 (2012).

    Article  CAS  PubMed  Google Scholar 

  10. Rees, E. et al. Analysis of copy number variations at 15 schizophrenia-associated loci. Br. J. Psychiatry 204, 108–114 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  11. Purcell, S.M. et al. A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506, 185–190 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Takata, A. et al. Loss-of-function variants in schizophrenia risk and SETD1A as a candidate susceptibility gene. Neuron 82, 773–780 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. The Network and Pathway Analysis Subgroup of the Psychiatric Genomics Consortium. Psychiatric genome-wide association study analyses implicate neuronal, immune and histone pathways. Nat. Neurosci. 18, 199–209 (2015).

  15. De Rubeis, S. et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515, 209–215 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Sanders, S. et al. Insights into autism spectrum disorder genomic architecture and biology from 71 risk loci. Neuron 87, 1215–1233 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Girard, S.L. et al. Increased exonic de novo mutation rate in individuals with schizophrenia. Nat. Genet. 43, 860–863 (2011).

    Article  CAS  PubMed  Google Scholar 

  18. Xu, B. et al. De novo gene mutations highlight patterns of genetic and neural complexity in schizophrenia. Nat. Genet. 44, 1365–1369 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Gulsuner, S. et al. Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell 154, 518–529 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Guipponi, M. et al. Exome sequencing in 53 sporadic cases of schizophrenia identifies 18 putative candidate genes. PLoS One 9, e112745 (2014).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. McCarthy, S.E. et al. De novo mutations in schizophrenia implicate chromatin remodeling and support a genetic overlap with autism and intellectual disability. Mol. Psychiatry 19, 652–658 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. He, X. et al. Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes. PLoS Genet. 9, e1003671 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Genovese, G. et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N. Engl. J. Med. 371, 2477–2487 (2014).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  24. Exome Aggregation Consortium. Analysis of protein-coding genetic variation in 60,706 humans. Preprint at http://www.biorxiv.org/content/early/2015/10/30/030338 (2015).

  25. Iossifov, I. et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature 515, 216–221 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. The Deciphering Developmental Disorders Study. Large-scale discovery of novel genetic causes of developmental disorders. Nature 519, 223–228 (2015).

  27. Rauch, A. et al. Range of genetic mutations associated with severe non-syndromic sporadic intellectual disability: an exome sequencing study. Lancet 380, 1674–1682 (2012).

    Article  CAS  PubMed  Google Scholar 

  28. de Ligt, J. et al. Diagnostic exome sequencing in persons with severe intellectual disability. N. Engl. J. Med. 367, 1921–1929 (2012).

    Article  CAS  PubMed  Google Scholar 

  29. Rajji, T.K., Ismail, Z. & Mulsant, B.H. Age at onset and cognition in schizophrenia: meta-analysis. Br. J. Psychiatry 195, 286–293 (2009).

    Article  CAS  PubMed  Google Scholar 

  30. Fahrner, J.A. & Bjornsson, H.T. Mendelian disorders of the epigenetic machinery: tipping the balance of chromatin states. Annu. Rev. Genomics Hum. Genet. 15, 269–293 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Firth, H.V. et al. DECIPHER: database of chromosomal imbalance and phenotype in humans using ensembl resources. Am. J. Hum. Genet. 84, 524–533 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Kirov, G. et al. The penetrance of copy number variations for schizophrenia and developmental delay. Biol. Psychiatry 75, 378–385 (2014).

    Article  CAS  PubMed  Google Scholar 

  33. Bassett, A.S. et al. Clinical features of 78 adults with 22q11 deletion syndrome. Am. J. Med. Genet. 138, 307–313 (2005).

    Article  PubMed  Google Scholar 

  34. Butcher, N.J. et al. Functional outcomes of adults with 22q11.2 deletion syndrome. Genet. Med. 14, 836–843 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Ryan, A.K. et al. Spectrum of clinical features associated with interstitial chromosome 22q11 deletions: a European collaborative study. J. Med. Genet. 34, 798–804 (1997).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Lee, J., Tate, C.M., You, J. & Skalnik, D.G. Identification and characterization of the human Set1B histone H3-Lys4 methyltransferase complex. J. Biol. Chem. 282, 13419–13428 (2007).

    Article  CAS  PubMed  Google Scholar 

  37. Lee, J. & Skalnik, D.G. Wdr82 is a C-terminal domain-binding protein that recruits the Setd1A Histone H3-Lys4 methyltransferase complex to transcription start sites of transcribed human genes. Mol. Cell. Biol. 28, 609–618 (2008).

    Article  CAS  PubMed  Google Scholar 

  38. The Uniprot Consortium. UniProt: a hub for protein information. Nucleic Acids Res. 43, 204–212 (2014).

  39. Kleefstra, T. et al. Further clinical and molecular delineation of the 9q subtelomeric deletion syndrome supports a major contribution of EHMT1 haploinsufficiency to the core phenotype. J. Med. Genet. 46, 598–606 (2009).

    Article  CAS  PubMed  Google Scholar 

  40. Dincer, A. et al. Deciphering H3K4me3 broad domains associated with gene-regulatory networks and conserved epigenomic landscapes in the human brain. Transl. Psychiatry 5, e679 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).

    CAS  PubMed  PubMed Central  Google Scholar 

  42. Picard. http://broadinstitute.github.io/picard/ (accessed 1 March 2011).

  43. McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).

    CAS  PubMed  PubMed Central  Google Scholar 

  44. DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. 43, 491–498 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Van der Auwera, G.A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinformatics 11, 11.10.1–11.10.33 (2013).

    Google Scholar 

  46. Jun, G. et al. Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. Am. J. Hum. Genet. 91, 839–848 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).

    CAS  PubMed  PubMed Central  Google Scholar 

  48. Thornton, T. et al. Estimating kinship in admixed populations. Am. J. Hum. Genet. 91, 122–138 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Li, H. Toward better understanding of artifacts in variant calling from high-coverage samples. Bioinformatics 30, 2843–2851 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. McLaren, W. et al. Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor. Bioinformatics 26, 2069–2070 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Kircher, M. et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat. Genet. 46, 310–315 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Do, R. et al. Exome sequencing identifies rare LDLR and APOA5 alleles conferring risk for myocardial infarction. Nature 518, 102–106 (2014).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  53. Wu, M.C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89, 82–93 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. PLINK/SEQ version 0.09. http://atgu.mgh.harvard.edu/plinkseq/ (accessed 1 February 2014).

  55. Harrow, J. et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 22, 1760–1774 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Samocha, K.E. et al. A framework for the interpretation of de novo mutation in human disease. Nat. Genet. 46, 944–950 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Köhler, S. et al. The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data. Nucleic Acis Res. 42, D966–D974 (2014).

    Article  CAS  Google Scholar 

  58. Sanders, S.J. et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485, 237–241 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Moore, C. et al. The INTERVAL trial to determine whether intervals between blood donations can be safely and acceptably decreased to optimise blood supply: study protocol for a randomized controlled trial. Trials 15, 363 (2014).

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank the thousands of patients who participated in these studies. We thank H. Firth and D. FitzPatrick for discussions. The UK10K project was funded by Wellcome Trust grant WT091310. The DDD Study is funded by HICF-1009-003. The DDD and the INTERVAL sequencing studies are funded by Wellcome Trust grant WT098051. T.S. is supported by the Williams College Dr. Herchel Smith Fellowship. P.F.S. is supported by NIH R01 MH077139. A.P. is supported by Academy of Finland grants 251704 and 286500, NIMH U01MH105666 and the Sigrid Juselius Foundation. The work at Cardiff University was funded by Medical Research Council (MRC) Centre (G0801418) and Program Grants (G0800509). The key groups of the Sequencing Initiative Suomi (SISu) project are from the Universities of Eastern Finland, Oulu and Helsinki and The Institute for Health and Welfare, Finland, Lund University, The Wellcome Trust Sanger Institute, University of Oxford, The Broad Institute, University of Michigan, Washington University in St. Louis and University of California, Los Angeles (UCLA). The SiSu project is coordinated in the Institute for Molecular Medicine Finland at the University of Helsinki. Participants in INTERVAL were recruited with the active collaboration of NHS Blood and Transplant England, which has supported fieldwork and other elements of the trial. DNA extraction and genotyping was funded by the National Institute of Health Research (NIHR RP-PG-0310-1004), the NIHR BioResource and the NIHR Cambridge Biomedical Research Centre. The academic coordinating center for INTERVAL was supported by core funding from NIHR Blood and Transplant Research Unit in Donor Health and Genomics, UK Medical Research Council (G0800270) and British Heart Foundation (SP/09/002). M.I.K. was supported by Instrumentarium Science Foundation, Finland; Finnish Foundation for Cardiovascular Research; Orion Research Foundation and the University of Eastern Finland, Saastamoinen Foundation.

Author information

Authors and Affiliations

Authors

Consortia

Contributions

T.S., S.S.G., E.L.C., D.G., M.E.H., M.C.O'D. A.P., M.J.O. and J.C.B. conceived and designed the experiments. S.S.G., E.L.C. and E.P. performed the experiments. T.S., D. Curtis, S.M.P., L.C., J.M. and H.C. performed the statistical analysis. T.S., M.I.K., S.M.P., L.C., J.M., H.C., G.B. and E.R. analyzed the data. M.I.K., S.M.P., J. Suvisaari, D.B., G.B., O.P., D. Collier, M.J.D., J.D., N.B.F., M.J., G.K., J.K., O.K., P.H., C.M.H., M.M., S.A.M., P.M., A.M.M., A.M., J.S.M., C.M., W.O., T.P., D.R., J. Sambrook, P.S., D.S.C., J.V., J.T.R.W., H.W. and P.F.S. contributed reagents, materials and analysis tools. T.S., P.F.S., M.E.H., M.J.O. and J.C.B. wrote the paper. J. Suvisaari, D.B., M.A., M.B., T.C., D. Collier, N.C., M.D., A.F., S.J., C.I., J.L., R.M.M., R.N.-E., T.P. and D.S.C. recruited patients.

Corresponding author

Correspondence to Jeffrey C Barrett.

Ethics declarations

Competing interests

P.F.S. consults for Pfizer, Inc. M.E.H. is a shareholder of, and consultant for, Congenica Ltd.

Additional information

A full list of consortium members is available in the Online Methods.

A full list of consortium members is available in the Online Methods.

A full list of consortium members is available in the Online Methods.

A full list of consortium members is available in the Online Methods.

Integrated supplementary information

Supplementary Figure 1 Density plots of sequence coverage in the UK10K, INTERVAL and DDD data sets.

Per-sample sequence coverage was calculated and summarised from exome sequencing data generated in the UK10K (N = 4,734 in batch 0, and N = 562 in batch 1), INTERVAL (N = 4,502), and DDD (N = 1,972) data sets. The UK10K data set was separated into two sequencing batches. Top: sample mean coverage; Middle: percentage of Gencode v19 coding bases covered at 10x or more in each sample; Bottom: percentage of Gencode v19 coding bases covered at 20x or more in each sample.

Supplementary Figure 2 Variant metrics in the UK10K and INTERVAL data sets after each variant filtering step.

Box plots of per-sample heterozygote count (nHets), non-reference homozygote count (nNonRefHom), transition/transversion rate (TiTv), number of singletons (nSiS), and number of indels (nIndels) following each variant QC step. Variant metrics were summarised across all samples in the UK10K and INTERVAL data sets. Raw: no variant QC steps applied; Cov7: restricting to variants with at least 7x mean coverage; VQSR: GATK variant calibration using default parameters; Miss: filter for excess missingness; HWE: filter for deviation from Hardy-Weinberg equilibrium; GTfilt: filter for low alternate allele read depth, and abnormal allelic balance; GQmean: filter for low genotype quality; LCR: exclude variants in low-complexity regions. See Online Methods on more information on each step of variant QC.

Supplementary Figure 3 Variant counts summarized according to variant class and sequencing batch in the UK10K, INTERVAL, Finnish and Swedish data sets.

Box plots of per-sample variant counts in the UK10K, INTERVAL, Finnish, and Swedish data sets. All samples included in our meta-analysis are represented in the figure. The UK10K data sets was sub-divided according to sequencing batches (batch 0 and batch 1), and sample ancestry (UK and Finnish). The Finnish control data sets was separated by study of origin (Metsim, Finrisk, and Sanger controls). The Swedish case-control data set was separated into two sequencing batches. Differences exist in total variant counts between the UK, Finnish, and Swedish collections, likely reflecting differences in sequencing depth, capture reagents, sequencing protocol, read alignment, and variant calling. However, variant counts and population genetics metrics were consistent between cases and controls within each population group.

Supplementary Figure 4 Distributions of TiTv and frameshift/in-frame ratios in the UK10K, INTERVAL, Finnish and Swedish data sets.

Box plot of sample TiTv (left) and violin plot of sample frameshift-to-inframe ratio (right) in the UK10K, INTERVAL, Finnish, and Swedish data sets. All samples included in our meta-analysis are represented in the figure. See Supplementary Figure 3 for the legend, and a description of each batch and sub-study. Following sample and variant QC, the per-sample transition/transversion rate was comparable between all populations (mean ~3.25).

Supplementary Figure 5 Principal component analysis of UK and Finnish samples in our UK10K schizophrenia data set.

Principal components were estimated using 1000 Genomes samples, onto which we projected our cases and controls. We verified if samples had the same population ancestry (UK, Finnish or Swedish) as reported in the sample manifests, and excluded individuals who were of non-European ancestry. Thresholds for sample inclusion and exclusion are shown as dashed lines in each plot. Our analyses incorporated this information to perform permutations within each population (UK, Finnish, and Swedish) to control for ancestry and batch-specific differences. Top left: Population structure of all UK10K samples, with 1000 Genomes populations used as bases. We restricted our analyses to individuals of European ancestry; Bottom left: PCA plot of individuals of non-Finnish European ancestry in the UK10K data set with 1000 Genomes European populations used as bases. Samples not within the UK cluster (center of the plot) were excluded from analysis; Top right: PCA plot of individuals of Finnish ancestry in the UK10K data set. Samples not in the Finnish cluster (right of plot) were excluded from analysis. The three-letter symbols describing each population originate from nomenclature in the 1000 Genomes Project.

Supplementary Figure 6 Manhattan plot of the rare variant association analysis of LoF variants in 4,264 cases and 9,343 controls.

We tested for an excess of LoF variants within 18,271 genes using Fisher’s exact test. – log10P-values were plotted against the chromosomal location (mid-point) of each gene. We showed results from three allele frequency thresholds (singletons, < 0.1% and < 0.5%) for aggregating rare variants. No gene exceeded the exome-wide significant threshold of P = 1.25 x 10-6 (red line).

Supplementary Figure 7 Q-Q plots of the rare variant association analysis of LoF variants in 4,264 cases and 9,343 controls.

We tested for an excess of LoF variants within 18,271 genes using Fisher’s exact test, and plotted the ordered – log10P-values against transformed P-values sampled from the uniform distribution. The Q-Q plots for gene burden tests with minor allele frequency cut-offs of 0.1% and 0.5% followed an expected null distribution. The Q-Q plot for the burden test of singleton variants still showed deflation because the per-gene counts are too low and the data does not meet the asymptotic requirements of the statistical test. We included P-values from informative tests in which genes have at least one case LoF count.

Supplementary Figure 8 The robustness of the SETD1A result across reasonable parameters in the TADA model.

Because the TADA model depended heavily on the specification of its hyperparameters, we calculated the log q-value of SETD1A across different mean relative risk of de novo variants (), mean relative risk of case-control variants (), and numbers of true schizophrenia risk genes (k). Each vertical column is a different value for , and each horizontal facet is a different value for k. Our signal in SETD1A had a q-value < 0.01 across all reasonable parameters. Blue line: P = 0.05; red line: P = 0.01.

Supplementary Figure 9 Manhattan plot of the meta-analysis of de novo mutations and case-control variants in 1,077 trios, 4,264 cases and 9,343 controls.

De novo and case-control burden P-values were meta-analysed using Fisher's combined probability method. – log10P-values were plotted against the chromosomal location (mid-point) of each gene. A total of 18,271 genes were tested. Only SETD1A exceeded exome-wide significance, with P = 3.3 × 10−9. Red line: P = 1.25 × 10−6.

Supplementary Figure 10 Q-Q plot of the meta-analysis of de novo mutations and case-control variants in 1,077 trios, 4,264 cases and 9,343 controls.

De novo and case-control burden P-values were meta-analysed using Fisher's combined probability method, and the log10 P-values plotted against transformed P-values sampled from the uniform distribution. Because only a subset of genes had de novo LoF variants, Fisher’s method deflated the combined P-value of genes without any de novo information.

Supplementary Figure 11 Results from the minigene experiment assessing the impact of the exon 16 splice acceptor site variant.

A. Minigene constructs driving expression of exons 15, 16 (ref and alt), and 17 fused to GFP were transfected into HELA cells. RT-PCR analysis of cell lysates using primer pair 2, spanning exons 15, 16, and the intervening intron reveal a change in size of PCR products suggesting retention of the intervening intron in the construct containing the splice-acceptor deletion (panel A, "Exons 15-16", REF versus ALT). PCRs with primer pair 3, spanning the intron downstream of exon 16 show no change in band sizes (panel A, "Exons 16-17", REF versus ALT), suggesting this intron is correctly spliced out in both reference and alternate forms. B. Cartoon of genomic locus surrounding the exon 16 splice acceptor deletion. The predicted structure of reference (green) and deletion containing (red) transcripts are shown above and below genomic map. The red star indicates a predicted premature stop codon due to intron retention and resulting frame-shifted translation. C. Results from capillary sequencing of PCR products from panel A confirms intron retention in the splice acceptor deletion construct (panel C, "RNA", yellow box). This will result in a predicted frame-shifted translation of exon 16 (panel C, "PEP", red box), and a premature truncation of the protein 28 amino acids into exon 16 (red star). Downstream intron splicing was confirmed by capillary sequencing to be intact in both constructs.

Supplementary Figure 12 De novo microdeletion of a single copy of SETD1A identified in the DDD study.

A proband was identified to have a 650 kb deletion encompassed SETD1A and 29 other genes. The figure showing the deletion was generated using the UCSC Genome Browser (https://genome.ucsc.edu/).

Supplementary Figure 13 Mendelian disorders of epigenetic machinery at histone H3.

Writers (in green) add methyl groups at the specified residue of the histone tail, while erasers (in red) perform targeted demethylation. Disrupting variants in writers and erasers described in the figure result in well-known examples of dominant, highly penetrant disorders characterised by developmental delay and intellectual disability. Only the tail of histone H3 and its four key lysine residues are illustrated here. Alternate nomenclature: EHMT1 (also known as KMT1D), EZH2 (KMT6A), NSD1 (KMT3B), SETD1A (KMT2F).

Supplementary Figure 14 Sample size curves for detecting an increased risk of premorbid cognitive impairment in schizophrenia SETD1A LoF carriers.

We performed power calculations using a simple one-sided t-test to identify sample sizes required to show possible cognitive impairment in SETD1A schizophrenia carriers. Effect sizes d (0.5, 1), and allele frequencies (0.0001, 0.0005, 0.001) are varied to show their influence on statistical power. We assume a Type I error probability of 0.05. For these effect sizes and frequencies, a sample of tens of thousands of cases will be needed.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–14 and Supplementary Tables 1–5 (PDF 2128 kb)

Supplementary Methods Checklist

(PDF 386 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Singh, T., Kurki, M., Curtis, D. et al. Rare loss-of-function variants in SETD1A are associated with schizophrenia and developmental disorders. Nat Neurosci 19, 571–577 (2016). https://doi.org/10.1038/nn.4267

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nn.4267

This article is cited by

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research