Context: Dyslexia is a common disorder with a strong genetic component, but despite significant research effort, the aetiology is still largely unknown.
Objective: To identify loci contributing to dyslexia risk.
Methods: This was a genomewide linkage analysis in a single large family. Dutch families with at least two first degree relatives suffering from dyslexia participated in the study. Participants were recruited through an advertisement campaign in papers and magazines. The main outcome measure was linkage between genetic markers and dyslexia phenotype.
Results: Using parametric linkage analysis, we found strong evidence for a locus influencing dyslexia on Xq27.3 (multipoint lod = 3.68). Recombinations in two family members flanked an 8 cM region, comprising 11 currently confirmed genes. All four males carrying the risk haplotype had very low scores on the reading tests. The presentation in females was more variable, but 8/9 females carrying the risk haplotype were diagnosed dyslexic by our composite score, so we considered the putative risk allele to be dominant with reduced penetrance. Linkage was not found in an additional collection of affected sibling pairs.
Conclusions: A locus influencing dyslexia risk is probably located between markers DXS1227 and DXS8091 on the X chromosome, closely situated to a locus indicated by a published genome scan of English sibling pairs. Although the locus may not be a common cause for dyslexia, the relatively small and gene poor region offers hope to identify the responsible gene.
- QTL, quantitative trait locus
- reading disability
- genome scan
- X chromosome
Statistics from Altmetric.com
Developmental dyslexia is defined as a distinct learning impairment characterised by difficulties in single word decoding. Dyslexics show profound difficulties in learning to read and/or write despite otherwise normal intelligence and cognitive skills. Multiple family and sibling studies, investigating both a global dyslexia phenotype and specific cognitive abilities within the dyslexia spectrum, have consistently demonstrated that genetic factors are of major aetiological significance for development of the condition. Linkage studies have indicated susceptibility loci on a number of chromosomes, including 1,1 2,2,3 3,4 6,5,6 7,7 15,8–10 and 18.11 The genetic studies were not long ago reviewed by Schulte-Körne12 and Fisher & deFries.13 Recently, Taipale et al14 identified a gene on 15q that is associated with dyslexia.
The prevalence of dyslexia has been found to be higher in male than in female schoolchildren. The ratio of boys to girls among dyslexics varies between studies, depending on method of ascertainment and on the applied definition of dyslexia. In a large, presumably unbiased collection of Dutch schoolchildren, an M:F ratio of around 2:1 was found.15 The total prevalence of dyslexia in this survey of Dutch 12 year olds was 3.6%.
A possible cause for sex bias in congenital disorders is the involvement of X linked loci. Segregation studies, however, indicate that the disorder is usually inherited as autosomal dominant.16,17 Recently, in a genomewide scan on a collection of clinic ascertained sibling pairs from the UK, some evidence was found for a quantitative trait locus (QTL) for single word reading on Xq26, although stronger signals were found at other chromosomal locations.11 A recessive allele of an X linked gene increasing the risk for dyslexia could explain why (hemizygous) males are more commonly or more severely18 affected than females. Other explanations for the skewed sex ratio, however, involving a role of male specific hormones during development interacting with an autosomal locus, for example, cannot be ruled out.19
In the current study, we investigated a large Dutch speaking family in which dyslexia appeared to be segregating (fig 1). Diagnosis was based on the combination of scores on a small number of tests, as described below. We carried out a genomewide, 400 marker linkage study including the X chromosome. Parametric linkage analysis was used to identify possible genes predisposing for dyslexia. Regions with suggestive linkage were checked in a collection of affected sibling pairs.
MATERIALS AND METHODS
As part of a large multidisciplinary research effort into different aspects of dyslexia, an advertisement campaign was set up to collect multiplex dyslexia families. Advertisements were published in regional and national papers and magazines. Families were ascertained when at least two first degree relatives had a school history of reading problems. In this way, a panel of predominantly nuclear families was collected. In addition, a few large three generation families displaying apparent autosomal dominant inheritance could be identified, including the family described in this paper. Informed consent was obtained from all participants and the study was approved by the local ethics committee (CWOM) of the University Medical Centre Nijmegen under CWOM-nr 9811-025. The family described in this paper included 29 persons of white descent who were all willing to cooperate (fig 1). Loci with suggestive lod scores were checked in a subset from our collection of affected sibling pairs and their parents, who had been ascertained and tested in the same manner as the large family. As linkage results were found for an X linked locus with seemingly a clearer phenotype in males, we selected 67 pairs of affected brothers and their mothers from our sibling pair collection for this initial check. A genomewide linkage scan was also carried out for two other extended families. As the linkage results did not coincide with those of the family described in this paper, the results will be described elsewhere.
The subjects were administered a small battery of tests, as described below, in individual sessions.20
Single word reading test
This is a standardised test, often used for testing dyslexia in schools.21 The subjects have to read across columns of increasingly difficult, unrelated words within a time constraint of 1 minute. Only correctly pronounced words are scored. This test measures a combination of speed and accuracy, as many adults who have had reading difficulties in school can eventually reach normal levels of accuracy, but are likely to remain significantly slower than normal.
Non-word reading test
Subjects must read across columns of increasingly difficult non-existing, but pronounceable words with a time constraint of 2 minutes.22 Again, the score was the number of words pronounced correctly within the time constraint. In this test, the lexical route of reading, which according to some theories is independent of the phonological route, is eliminated.
This test is part of the Dutch version of the Wechsler Adult Intelligence Test.23 It tests the ability of the subject to express him/herself verbally. The subject is offered two words and is asked to describe as concisely as possible the similarity between them. Examples are car-aeroplane or courage-cowardice. Other tests were administered as well, but they were not used in this study and will therefore not be discussed. Adult subjects were asked to fill out a questionnaire concerning school history and related issues. One of the parents filled out a similar questionnaire for the children within the household. We also obtained a list of presumably affected members from the contact person.
The control group consisted of 560 16–17 year old pupils from four different levels in Dutch secondary education: VBO, MAVO, HAVO, and VWO (roughly equivalent to technical training, O-levels, higher general secondary education, and A-levels, respectively). Prior to further analysis, the scores of these pupils were weighted relative to the actual distribution of pupils over those levels in Dutch schools. Scores from test subjects were expressed as percentile scores relative to this weighted control group.
We used a phenotype definition following the criteria designed by the Dutch Dyslexia Programme. Affected status according to this definition required scoring below the normative 10th percentile on the word or the non-word reading test or scoring below the 25th normative percentile on both of the tests. Affected status could also be assigned based on a discrepancy criterion: it required the word or non-word reading score to be ⩾60 percentage points below the normalised score on the verbal competence test (Kuijpers et al20). The discrepancy criterion was included to identify people who had adequately compensated their reading problems, but still scored below what could be expected based on their general verbal competence, when pressed for time. These criteria classified over 20% of the weighted control group as dyslexic. An almost identical combination of criteria was used by Grigorenko,8 with an almost identical percentage of affected individuals in the control population. The criteria used by the Dutch dyslexia programme to identify dyslexics are intentionally rather wide, so as not to miss compensated adults.24 Criteria designed to select children for remedial teaching will probably differ from criteria designed to identify persons with a predisposition for dyslexia for research purposes.8
For all adults and for schoolchildren having completed their first year of secondary school, the control group was used to assign affectedness status. For younger children, published scoring criteria were used.21,22
EDTA blood was collected from each person and DNA was extracted by a standard extraction method.25 The genome was screened using the set of 400 CA repeat markers (LMS-MD10 v2.5; Applied Biosystems, Foster City, CA, USA). The average heterozygosity of these markers was 0.76. The average distance between markers was 10 cM. Intermarker distances for multipoint analysis were obtained from Marshfield genetic maps (http://research.marshfieldclinic.org/). Three additional markers, DXS8028, DXS8084, and DXS8106 were selected for fine mapping purposes from the public databases on chromosome X. Six markers in the region that showed strongest evidence for linkage were analysed in 67 affected male sibling pairs and their mothers (DXS1227, DXS8106, DXS8084, DXS8043, DXS8028, and DXS8091), which were a subset from our collection of families. The analysis of the markers was according to the protocol provided for the LMS-MD10 v2.526 with some small modifications. PCRs were performed in 10 l reaction volumes containing about 40 ng of genomic DNA, 5 pmol of each primer, 2.5 mmol/l MgCl2, 250 mol/l each dNTP, 10 mmol/l Tris HCl (pH 8.3), 50 mmol/l KCl, and 0.4 U of Taq polymerase (AmpliTaqTM Gold; Applied Biosystems). The PCRs had an initial denaturation stage of 12 minutes at 95°C, followed by 10 cycles of 15 seconds at 94°C, 15 seconds at 55°C and 30 seconds at 72°C, then 22 cycles of 15 seconds at 89°C, 15 seconds at 55°C, and 30 seconds at 72°C, and a final extension step at 72°C for 10 minutes. Reactions for each marker were performed separately, with products being pooled into size specific sets before typing. Markers were typed on an ABI 3100 sequencer (Applied Biosystems) using the GenescanTM and GenotyperTM software (Applied Biosystems). Allele binning was done with the Excel macro Linkage Designer (designed by G van Camp).27 Mendelian inheritance was checked with PedCheck software.28
Parametric linkage analyses were performed using version 5.2 of the LINKAGE program. Two point lod scores were calculated using MLINK of the LINKAGE package version 5.1.29 Multipoint analyses were carried out with the LinkMap program30 in regions that had lod scores >1.5, followed by inspection of haplotypes, which were created in the software program Cyrillic 2.1.
There were two models used in the linkage analysis, with the following characteristics. Model 1: an autosomal dominant model with sex dependent penetrance (0.02, 0.95, and 1.00 for unaffected, heterozygous, and homozygous males; 0.005, 0.85, and 0.95 for unaffected, heterozygous, and homozygous females), and an allele frequency of 2%, assuming a 5% disease prevalence and a 1.5 M:F ratio. Model 2: an autosomal dominant model, assuming high phenocopy numbers (penetrances 0.16, 0.95, and 1.0 for males; 0.16, 0.85, and 0.95 for females). Model 1 is based on published segregation analyses for dyslexia.17 Model 2 assumes 19% prevalence of dyslexia and an almost equal sex ratio, and is based upon the frequency of the phenotype in the control group.20 For X linked loci, identical penetrance values were used (M: 0.02 and 0.95; F: 0.005, 0.85, and 0.95), although these result in slightly lower prevalences and less skewed sex ratios. Because a genetically heterogeneous disorder such as dyslexia makes it difficult to choose the appropriate parameter set, we also conducted a non-parametric linkage analysis in Merlin 0.10.1 (options Minx−NPL), which uses Kong-Cox function31 to convert Z scores into lod scores and associated p values).
Linkage of the locus to the dyslexia phenotype in the affected sibling pairs was assessed by non-parametric linkage using Genehunter.
Mutation analysis of candidate genes in the region identified by high lod scores was performed by forward and reverse sequencing of the coding regions of the genes of at least one male family member carrying the risk haplotype and at least one male family member not carrying the risk haplotype.
The family consisted of 33 people, 29 of whom could be included in this study. Of these 29 subjects, five males and 10 females were classified as dyslexic, based on the tests described above. One of those females was an unrelated spouse. The grandmother of the family (I.2), reportedly, could not read (test scores can be found in table 1).
Linkage results are based on 374 of 400 markers that showed reliable results. The highest lod score by far, and the only one exceeding the threshold lod score of 3, was on the X chromosome at marker DXS8043 with Model 1. The two point lod score was 3.38 (multipoint lod score 3.68) at θ = 0.00 of marker DXS8043 (fig 2). The high phenocopy Model 2 also indicated only this locus, although as expected with a lower two point lod score of 2.61. Neither model indicated other loci with lod scores exceeding 1.5. None of the previously described candidate loci on chromosomes 1, 2, 3, 6, 15, and 18 (see text), showed evidence for linkage. The non-parametric linkage analysis by Merlin resulted in Z = 8.75 at marker DXS8043 with p <0.0001. Converted to lod, this was 1.95, p = 0.0014 (fig 2).
As two key recombinants flanked the region around marker DXS8043, we tested three extra markers between the two breakpoints in the key recombinants and their ancestors, the extra markers being DXS8028, DXS8084, and DXS8106. The chromosomal region was narrowed down to about 8 cM between marker DXS1227 (proximal) and marker DXS8091 (distal) (fig 1). This region encompasses chromosomal band Xq27.3 and small parts of the flanking bands Xq27.2 and Xq28.
Sex dependent expression of the X linked allele
Depending on the dominance of the alternative alleles, X linked genes may cause differences in prevalence among males and females. According to our criteria all four males carrying the risk haplotype and eight of the nine women carrying the risk haplotype were classified as dyslexic. This argues for a dominant effect of the risk allele, yet it is possible that heterozygous females are less severely affected than hemizygous males. To investigate this we returned to the test scores. We added up the test results for single word reading and for non-word reading to create a single score, as these tests are the most important components for the dyslexia criteria. An analysis of covariance was carried out with carrier status and sex as fixed factors, and age as a covariant. Age had no significant effect on test results (p = 0.116). As expected, a significant effect of carrier status on test results was found (p = 0.002). No significant effect of sex (p = 0.563) was found, but there was a significant interaction between sex and carrier status (p = 0.046); the difference in test scores between male carriers and non-carriers was significantly larger than the difference between female carriers and non-carriers. Non-carrier males scored a mean (SD) of 178 (31) (corresponding z score = 0.2) on the combined test, non-carrier females scored 160 (34) (z-score = −0.5). Among the carriers, males scored on average 79 (17) (z score = −3.1) and females 125 (58) (z score = −1.6) (fig 3). The effect of the allele on reading ability, therefore, seemed to be larger in hemizygous men (decline of 3.3 SD) than in heterozygous women (decline of 0.7 SD). The pattern for the males seemed very straightforward: all four carriers scored below the fifth percentile on word and non-word reading, and all were dyslexic according to their own statement. Of the carrier women, II.12, her daughters III.15 and III.16, and niece III.13 scored below the fifth percentile on word reading, and II.12, III.13, and III.15 also on non-word reading, so these women resembled the males. Subjects II.12 and III.13 were dyslexic according to their own report; the other two (8 and 11 years old) reportedly had difficulties in school, but the mother was not sure whether this was due to dyslexia, and the girls had not been tested. Four other females (II.2, II.6, II.10, and III.8) were identified as dyslexic because of the discrepancy between non-word reading and their verbal competence. Of these, II.10 and III.8 reported to have had spelling problems during elementary education, but few difficulties later in life, while II.6 reported only mild problems with reading and spelling during elementary school and with languages during secondary education. One woman (III.5) carried the risk haplotype, but scored far above average (99th and 100th percentile) on word and non-word reading. She reported no real problems at school. Although the allele seemed dominant, its effect appeared more variable in females than in males. Two persons (III.1 and III.9) were found to be dyslexic, but did not carry the risk haplotype. Both had a severe phenotype with word and non-word reading below or equal to the fifth percentile. At least one of those had an affected parent not related to the family. Obviously, other causes for dyslexia played a role in this family (individual test scores can be found in table 1).
Investigation of Xp27 locus in affected sibling pairs
The affected siblings were on average 17 years old (range 8–40 years). Their mean (range) test scores were 61.64 (7–115) for word reading, 39.9 (1–78) for non-word reading, and 101.5 (8–186) for the sum of both. For comparison, in our control population the averages were 87 (47–116) for word reading, 81 (20–116) for non-word reading, and 168 (74–232) for the sum.
The result of the non-parametric linkage analysis (NPL-all) for the dyslexia phenotype was a z-value of −1.64 at marker DXS8043 with associated p value = 0.94, so there was no evidence for linkage to this locus in the general Dutch population.
No mutations were found to segregate with dyslexia in the coding regions of the genes FMR1, Cxorf1, DKFZp574M2010, or KIAA1854.
Candidate locus for dyslexia on Xq27-28
Reading difficulties are in part due to genetic influences. Linkage studies have indicated a number of chromosomal regions, which shows that dyslexia is a genetically heterogeneous trait and many genes can contribute to the phenotype. To date, a gene affecting dyslexia has been identified in only one of those regions. The present study cannot confirm any of the previous reports, with the possible exception of some results from a large genome scan on sibling pairs by Fisher et al.11 In that study, possible linkage for a reading ability QTL was found to the region Xq26 around marker DXS1047 (position 82.07 cM on the Marshfield map) with a broad peak. Our current study found high lod scores around marker DXS8043 at position 94.2 cM on the Marshfield map (Xq27.3). Broad regions and flat peaks are not unusual in studies of neurological disorders,32 so these results may well point to the same gene.
We were able to narrow the chromosomal region in which the putative gene should be found to a size of about 8 cM (sex average map), not including marker DXS1047 found by Fisher et al. The region spans band Xq27.3 and parts of flanking bands Xq27.2 and Xq28. Because of the possibility of phenocopies among the key recombinants, the boundaries should be considered with some caution.
This is a relatively gene poor region: only 11 confirmed genes are reported and an additional eleven predicted (Public and Celera databases, Map View Build 33, October 2003). The first marker outside the risk haplotype at the telomeric side (DXS8091) lies within the first intron of the FMR2 gene, a gene involved in fragile X mental retardation. However, the coding regions of the gene are outside our interval. The other currently identified genes in this region are MAGEC3, MAGEC1, and MAGEE1, genes from a melanoma antigen family; MYCL3, a pseudogene with homology to an avian viral oncogene;33RRM2P4, a ribonucleotide reductase pseudogene; FMR1, a gene involved in fragile X mental retardation; Cxorf1, a gene expressed in the hippocampus;34 and KIAA1854, FLJ25736, and DKFZp547M2010, coding for hypothetical proteins. The MAGE family genes in this region seem to be sperm specific.35 FLJ25736 protein has also been isolated from human testes.36 These genes do not seem to be likely candidates for dyslexia susceptibility. Hypothetical protein KIAA1854 is expressed in brain and is probably identical to the product of putative gene SLITL1, a gene assumed to be involved in formation and maintenance of the nervous system.37,38 Trinucleotide repeat expansions in FMR1, and subsequent silencing of the gene, are associated with mental retardation and learning disabilities. FMR1 contains an mRNA binding region and is involved in neurone development.39 The FMR1, KIAA1854, and Cxorf1 genes can be considered plausible candidates. The function of DKFZp547M2010 is currently unknown, but it is expressed in the hippocampus. It shows high homology with the product of the Mus musculus gene SLITRk4, which controls neurite outgrowth, and is expressed in brain and hippocampus. We therefore considered DKFZp547M2010 to be a candidate as well.
Mutation analysis of the coding regions Cxorf, KIAA1854, FMR1, and DKFZp547M2010 revealed no polymorphisms that segregated with dyslexia.
Degree of dominance of the putative locus
The results of this study suggest that the locus on Xq27.3 contains a risk factor for dyslexia with sex dependent expression. Alhough eight of nine carrier women were designated dyslexic according to our tests, not all had reported themselves as dyslexic. The phenotype seemed more variable in the nine women than in the four males. The sample is of course very small, so the conclusions for sex differences are tentative. A more varied phenotype in women for an X linked gene can, however, easily be explained by the mosaicism of X inactivation,40 but other explanations of the sex differences in expression are also possible. As all except one of the carrying females were found to be dyslexic, we would consider this factor dominant with sex related differences in penetrance and expression.
Importance of current results for the general Dutch population
Alhough segregation analyses and other linkage studies have shown that in most populations X linked loci are not of major importance for dyslexia or its components,17 it cannot be ruled out that dyslexia can be a polygenic trait to which X linked genes contribute. Most probably, it is a genetically heterogeneous trait. We found the factor on Xq27.3 to tend towards dominance, and this cannot explain the consistently reported male bias. However, the degree of dominance may vary with the genetic background in which the allele finds itself. It is also possible that other mutations in the same gene act as recessive alleles, thus contributing to a male bias.
Fisher et al found a locus at Xq26 with a Haseman-Elston analysis of single word reading in their UK sample of affected sibling pairs, but not in their US sample. We also found no indication for a major contribution of this locus to dyslexia no its components in our sample of sibling pairs. Most probably, the mutation is not very common, but not completely unique to our family either.
In summary, there were strong indications from our linkage study that the Dutch speaking family described in this paper carried a risk factor for dyslexia on the X chromosome in the 8 cM region between markers DXS1227 and DXS8091. The factor behaved as a dominant allele with a variable effect in females. Analysis of this locus in an additional collection of 67 male sibling pairs showed that the locus does not appear to be of major importance in our Dutch sample. Findings by Fisher et al, however, suggest it plays a minor role in UK populations.
We are grateful to the families for participating in this study. We thank H Brunner for useful discussions, N Blokdijk for help in collecting the samples, B Franke for help with administering the tests, and C Kuijpers for help with interpreting the test scores. We also thank B Simmelink and M Schijvenaars for help with sequencing. This work was funded by grant 200-62-305 from the Dutch Research Association (NWO), division Geesteswetenschappen.
Conflicts of interest: none declared
The first two authors contributed equally to this work. Dr Sandkuijl is deceased.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.