Statistics from Altmetric.com
Keratoconus (KC; MIM 148300) is a non-inflammatory corneal thinning disorder. Progressive thinning and protrusion of the cornea lead to loss of visual acuity for which the only effective treatment is corneal transplantation. The estimated prevalence of KC is 1–4 per 2000 in the general population.1 The age of onset is often at puberty, and the disorder is progressive until the third or fourth decade of life when it usually arrests. KC is a major cause of corneal transplantation in developed countries.
The cause of KC is still unknown. Histological observations have demonstrated degradation of the corneal epithelium basal membrane, diminution of the number of collagen fibrils, thinning of the corneal stroma, and keratocyte apoptosis.2,3 Biochemical studies describe increased activity of metalloproteinases (MMP-2 and MMP-9) and lower expression of protease inhibitors such as α1-protease inhibitor.4,5
Sporadic KC is the most common presentation; however, a positive familial history has been reported in at least 6–10% of patients.1 Twin studies performed since the advent of modern computerised videokeratoscopy have reported four monozygotic pairs concordant for KC and two monozygotic pairs which were discordant,6–9 suggesting a genetic component for KC. Keratoconus prevalence in first degree relatives of KC patients is significantly higher than in the general population, demonstrating familial aggregation of the trait.10,11 Most published studies have suggested autosomal dominant inheritance of KC with incomplete penetrance or variable expression.1,12 Autosomal recessive inheritance as well as rare cases of X linked inheritance have also been described.10,13 In some rare cases, KC is associated with other genetic disorders such as trisomy 21, atopy, and Leber congenital amaurosis.14,15
KC gene localisation efforts to date have been carried out in rare large families or in population isolates where genetic heterogeneity is minimised. Linkage studies include a genome scan of 20 Finnish families resulting in a multipoint LOD score of 4.1 on chromosome 16q,16 and a directed chromosome 21 scan in one large family with trisomy 21 leading to suggestive linkage.17 Association studies in eight unrelated patients in Tasmania have identified two putative loci on 18p and 20q12.18,19 However, these results have not yet led to the identification of a causative gene for familial KC and, although KC is present in all human populations, no studies have tested for linkage in a mixed outbred population.
In the present article, we identify a new candidate region at 2p24 mapped by linkage and haplotype analysis in a heterogeneous population.
The study population included 253 members of 28 families recruited in France, Spain, and Guadeloupe (West Indies). There were 112 affected and 104 unaffected individuals; the other 37 individuals were of undetermined phenotype. Families were of European, Arab, and Caribbean African descent. All pedigrees had a minimum of two members who met the definition of clinical KC20 and no other recognised genetic disease segregated in these families. We performed ophthalmic examination of all the members of the 28 families using videokeratographic evaluation as described previously.11 KC was defined according to Rabinowitz’s KISA% index20: patients with a KISA% over 300 have clinical KC, patients with a KISA% between 100 and 300 may have forme fruste KC and were considered as being of an undetermined phenotype, and patients with a KISA% under 100 are unaffected. Children under the age of 16 were considered of unknown phenotype unless clinical KC was observed.
Twenty eight pedigrees with isolated familial keratoconus (MIM 148300) were investigated.
A genome-wide search mapped a new locus for keratoconus to chromosome 2p24 with a maximal HLOD score of 5.13. Analysis of key recombination events placed the keratoconus locus in a 1.69 Mb region flanked by markers D2S305 and D2S2373. Linkage to all other known keratoconus loci on chromosomes 3, 16, 20, and 21 was excluded.
This candidate region contains eight known genes and 10 Acembly gene predictions.
This study reports the first genome-wide scan in an outbred population (Caucasian, Arab, and Caribbean African) and is the first step in the positional cloning of a keratoconus gene.
This study was approved by the ethical committee “Toulouse 2” according to French law, and informed consent was obtained from each patient.
Venous blood samples were collected in EDTA for DNA extraction according to standard methods.21 Genotyping was performed in two successive stages. An initial set of seven informative families (88 individuals of Caucasian and Arab origin) composed of both affected and unaffected individuals was genotyped with the ABI PRISM Linkage Mapping Set of microsatellite markers, version 2 (Applied Biosystems, Foster City, CA, USA). While the first analysis was underway, additional families with KC were recruited. The total sample size was thus increased to 28 informative pedigrees, consisting of 112 affected and 104 unaffected members available for genotyping. The second set of 21 families was only genotyped with markers for the short arm of chromosome 2.
All PCR reactions were carried out with 25 ng of genomic DNA as a template in a mixture of 1×Applied Biosystems PCR buffer II, 2.5 mM MgCl2, 200 nM of each dNTP, 5 pmol of each primer, and 0.1 μl of AmpliTaq Gold polymerase (Applied Biosystems) in a final volume of 15 μl. The thermocycling conditions were 94°C for 10 min, followed by 33 cycles at 94°C for 30 s, 53°C for 30 s, and 72°C for 30 s, followed by a final extension step at 72°C for 5 min. PCR products were then pooled with regard to their size range and labelling and were analysed by denaturing electrophoresis on an ABI Prism 3100 DNA Sequencer (Applied Biosystems). The marker alleles were assigned with GENESCAN and GENOTYPER software (Applied Biosystems). A set of 36 additional markers obtained from online information was added for detailed analysis of the short arm of chromosome 2.
Non-parametric multipoint linkage analysis for the genome-wide screen was performed with the Merlin program.22 Non-parametric linkage analysis concentrates on allele sharing between family members with the same phenotype. Merlin outputs both NPLall (Zlr) scores and Kong and Cox LOD scores23 to test for allele sharing among affected individuals, with their asymptotic p values. All Mendelian inheritance and genotype errors were checked using Merlin and resolved. Allele frequencies were calculated in unrelated founders with Merlin. Multipoint parametric linkage analysis, allowing for heterogeneity, was performed with Genehunter, version 2.1.24 The frequency of the putative KC gene was set at 0.0006. No sex difference was assumed. Penetrances were 0.001, 0.5, and 0.8, respectively, for the 1/1, 1/2, and 2/2 genotypes (1, normal allele; 2, mutated allele; a penetrance of 0.001 for the 1/1 genotype corresponds to the prevalence of sporadic KC). The marker mapping information was obtained from the Marshfield map25 and the NCBI database (Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM). We used Genehunter for haplotype reconstruction. The program reports the most likely haplotype, marking crossovers and highlighting double crossovers. It also indicates regions of haplotype uncertainty.
Assignment of the KC gene to 2p24
We undertook a genome-wide search for linkage in an initial set of seven families whose results are displayed in fig 1. Non-parametric analysis by Merlin found suggestive linkage on chromosome 2p with a LOD score of 2.64 (Zlr = 4.91, p<10−5) for marker D2S305. No other LOD scores >1.5 were observed. In particular, no evidence for linkage between KC and markers on the chromosome arms 16q, 18p, or 21q was observed.
The full data set of 28 families was then typed for 36 additional markers across a 80 cM region spanning D2S319 to D2S286. Marker order is displayed in table 1. When all families were analysed we obtained suggestive evidence of linkage with a maximum LOD score at D2S305 of 3.26 (Zlr = 5.02, p<10−5) (fig 2, grey line).
Since segregation of KC in our families seemed in agreement with autosomal dominant inheritance, we performed multipoint parametric analysis and applied a dominant model to our data. We set gene frequency of the familial form of KC at 0.0006 and applied penetrances of 0.001, 0.5, and 0.8 corresponding to the 1/1, 1/2, and 2/2 genotypes. Allowing for genetic heterogeneity, we obtained a maximal HLOD score of 5.13 for α = 0.52 (α = fraction of families used to obtain a maximum LOD score) at marker D2S320 (fig 2, black line). These results demonstrate significant linkage for KC at 2p24. Of note, the 1-LOD-drop support interval around the maximum HLOD score, an approximate 90% confidence interval, included a region spanning D2S2346 to D2S2342 (including D2S305 and D2S320).
Haplotypes were established with all 48 microsatellite markers given in table 1. Since the mode of inheritance and the penetrance of KC have yet to be determined, we first examined the segregation of haplotypes at 2p24 between affected individuals. We show that marker alleles cosegregated with the KC phenotype in 17 out of the 28 families (fig 3). Haplotypes differed between families. Haplotypes extend up to, but do not include, markers at the ends of the bars. Critical meiotic recombinants could be identified in families 3, 5, and 6. Integration of healthy individuals changed haplotypes in families 17 and 30, where a gap was found in the shared haplotype due to a double recombination event in a healthy sibling. Two intervals were therefore shared by all affected members of the17 families (hatched area in fig 3). Detailed genotype data for families 3, 5, 6, 17, 30, and 33 are shown for 12 markers spanning the shared region in figs 4A and 4B. The first is telomeric to marker D2S2233 and centromeric to D2S305, while the second is telomeric to marker D2S2373 and centromeric to D2S2150. The overall interval covers a region of ∼3.78 cM (1.69 Mb). The first interval (D2S2233–D2S305) extends over 0.4 Mb and the second interval (D2S2373–D2S2150) extends over 0.5 Mb.
Search for candidate genes or transcripts
According to data available on the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=OMIM), UCSC Genome Browser (http://www.genome.ucsc.edu/cgi-bin/hgGateway), and Ensembl (http://www.ensembl.org/) databases, eight previously identified genes were contained between markers D2S305 and D2S2373. These genes are OSR1, KIAA1336, MATN3, LAPTM4A, SDC1, PUM1, ARHB, FLJ14249, GDF-7, and FLJ21820 (fig 5). Also, 10 Acembly (http://www.humangenes.org/) spliced gene predictions have also been mapped to this region. When only the two shared intervals are considered, this list is reduced to the known genes OSR1, ARHB, FLJ14249, GDF-7, and FLJ21820, and the Acembly spliced gene predictions keekla, skylu, klodo, kerkla, klardo, plydo, and skeedo. Of this list, only ARHB (RhoB) and GDF-7 have a known cellular function (actin reorganisation and control of cell differentiation, respectively).
KC is the leading cause of corneal transplantation in developed countries. Therefore, geneticists are working to map the gene responsible for familial KC and so greatly increase the little knowledge we have of this pathology. The results of the present genome-wide linkage and refinement studies indicate that a major gene for KC, responsible for 50–60% of familial KC cases studied, is located within a 1.69 Mb region (one of two small regions, both <0.5 Mb) at 2p24.
We performed here a genome-wide scan in seven large KC families and found a suggestive level of linkage at 2p24. The divergence of the values obtained for the LOD score and the Zlr statistic at this locus (LOD = 2.64 is only evidence suggestive of linkage, whereas Zlr = 4.91 is generally accepted as significant evidence) is probably due to a skewed distribution of Zlr because of the small number of families included in this first step of the study (for detailed discussion see Kong and Cox23).
Refinement of the 2p24 region using a larger set of 28 KC families resulted in a non-parametric LOD score curve with two peaks with a LOD score >3. Various hypotheses could explain the existence of these two peaks and are yet to be tested. First, one of the peaks could be artefactual, maybe due to the heterogeneity of the population and/or the phenotype. For example, linkage studies for asthma resulted in two peaks at 20p13 separated by approximately 5 cM with exactly the same LOD score values.26 Refinement of the phenotype used for these linkage studies supported only the region under the second peak and led to identification of mutations in the ADAM33 gene. It is possible that there are different phenotypes for KC that are for the moment clinically undistinguishable. Second, the two LOD peaks could be accounted for by the existence of two causal KC genes at the same locus. This has already been demonstrated in Griscelli syndrome where some patients were found to have no mutations in the MyoVa gene whilst haplotype analysis showed segregation at the same 15q21 locus.27 Finally mutations were discovered in RAB27A, 1.6 cM distant from the MyoVa gene.28
Since the beginning of our project, one group has published the results of their genome-wide scan suggesting the existence of a locus for familial KC on chromosome 16q22.3–q23.1 in a genetically isolated Finnish population.16 Mutations in the VSX1 gene (20p11–q11), coding for a putative homeobox domain transcription factor have also been identified in patients suffering from KC. However these mutations were only found in 4.7% of KC cases which corresponds to approximately 1% of familial KC cases.29 Two additional loci (20q12 and 18p) and a founder effect have also been identified by association studies.18,19 Nevertheless, these results are not in conflict with our data since a high level of genetic heterogeneity is suspected for familial KC, resulting in different loci segregating with the disease in various families or populations.
Despite the use of a fairly large population including several large families, we did not obtain clear evidence (LOD = 3.26) of non-parametric linkage at commonly accepted statistical significance levels (LOD = 3.3). The segregation of KC in the majority of our families seemed largely in agreement with a dominant Mendelian mode of inheritance with incomplete penetrance. In view of this and since genetic heterogeneity is suspected for KC, we applied a dominant model to obtain parametric HLOD scores. After analysis, we obtained a significant HLOD of 5.13 with α = 0.52.
Haplotype analysis also confirmed that a chromosomal region at 2p24 segregated with the disease phenotype in 17 out of the 28 families. This corresponds to 60% of the families and is close to the 52% used by Genehunter to optimise the HLOD score (the difference is not statistically significant). The higher percentage obtained by haplotype analysis could be due to the fact that some of the unaffected individuals used by Genehunter carry the mutated allele but display no KC phenotype because of incomplete penetrance.
In families 3 and 5, single (individual 5.II.2) and double (individual 3.III.4) recombination events allowed us to narrow down the locus to a 1.69 Mb interval. In families 17 and 30, inclusion of an unaffected sibling with a double recombination event in the haplotype analysis divided the candidate region into two smaller regions. This could lead to several different explanations. First, both apparently healthy siblings in families 17 and 30 (both are over the normal KC onset age) could be carriers of the causal mutation but display no phenotype. Second, the natural occurrence of two such double recombinations seems very unlikely and this event could perhaps be explained by spurious marker order. However, we observe a rather high recombination rate in this region in our families (95 recombination events for 97 meioses over the 80 cM region and 40% of these recombination events are restricted to two 5 cM regions surrounding markers D2S2373 and D2S367) and several small double recombinations. Microsatellite markers in the D2S272–D2S2221 region were regenotyped and previously observed double recombination events were validated. This chromosomal region contains numerous repetitive sequences (Alu, LINE SINE) which could perhaps influence the stability of the 2p24 region and should be taken into account for positioning candidate regions.
We conclude that we have identified a causative locus for familial KC at 2p24 which predisposes to the disease in a significant proportion of the families. The fact that we discovered no common haplotype between the various linked families is not surprising since no relationship is known of between these families and the microsatellite markers used for haplotype analysis are probably not dense enough to prove a founder effect.
This new KC locus covers a physical distance of 1.69 Mb which was further reduced to 0.9 Mb following the analysis of informative family 17. Eight known genes and 10 Acembly gene predictions have been mapped within its boundaries. Functional candidate KC genes have been proposed up to now on the basis of histological analysis. These include corneal collagens, proteinases, proteinase inhibitors, and interleukin-1 associated proteins. Collagen 6A1 was a major candidate after linkage; it was found on chromosome 21q in a large pedigree with Down syndrome, but this chromosome has been entirely sequenced without the discovery of any mutation.30 Metalloproteinases are another candidate gene group; MMP-2 and MMP-9 have both been shown to be upregulated in patients with KC.5,12,31 MMP-9 is located near the 20q12 locus identified in Tasmanians and MMP-2 is near the 16q locus in a Finnish population, but both have since been excluded as candidate genes.16,19
Of the five known genes (OSR1, RhoB, GDF-7, FLJ21820, and FLJ14249) in the two regions with shared haplotypes, none are known to be involved in human disease. OSR1 encodes odd-skipped related 1, a putative transcription factor which possesses a homeobox domain. RhoB is a possible candidate gene since it stimulates actin stress fibre formation and focal adhesion points. Mutations in RhoB could explain structural defects observed in KC corneas. However, this gene is expressed in many other tissues where its functions are highly important, so it seems unlikely that mutations would lead to symptoms restricted to the cornea. GDF-7 (growth differentiation factor 7) is a good candidate gene due to the existence of TGF-β domains in its sequence. TGF-β and related proteins have already been investigated for their possible implication in KC because of their importance in the corneal epithelium but to no avail. However, probable roles for GDF-7 have been identified so far in the rhesus monkey neocortex, in seminal vesicle differentiation, and in commissural interneuron growth in the mouse spinal cord where dysfunction of this protein leads to aberrant development of the mouse embryo.32–34 Therefore, it seems improbable that a gene essential to nervous system development could confer only a corneal phenotype when mutated in man. The functions of FLJ21820 and FLJ14249 remain unknown. The Acembly gene predictions correspond to spliced Image clones and indicate the localisation of yet unknown genes. Analysis of the expression of all these candidate genes is underway and genomic sequences are being screened for mutations.
Identification of a causative KC gene will improve our understanding of this disease and should aid discovery of other familial KC genes. Comprehension of the molecular biology and biochemistry that underlie KC will be a first step towards new therapeutic strategies to slow and prevent this blindness disorder in both the familial and sporadic forms.
We would like to thank the families for their participation in this research.
This work was supported by grants from INSERM, l’Association Retina France, l’association Valent Haüy, and the Délégation Régionale à la Recherche Clinique des Hôpitaux de Toulouse (2003–AOL No. 0303502). HH held a Fondation de France fellowship.
Competing interests: none declared
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.