Background: The limb girdle muscular dystrophies (LGMD) are a heterogeneous group of Mendelian disorders highlighted by weakness of the pelvic and shoulder girdle muscles. Seventeen autosomal loci have been so far identified and genetic tests are mandatory to distinguish among the forms. Mutations at the calpain 3 locus (CAPN3) cause LGMD type 2A.
Objective: To obtain unbiased information on the consequences of CAPN3 mutations.
Patients: 530 subjects with different grades of symptoms and 300 controls.
Methods: High throughput denaturing HPLC analysis of DNA pools.
Results: 141 LGMD2A cases were identified, carrying 82 different CAPN3 mutations (45 novel), along with 18 novel polymorphisms/variants. Females had a more favourable course than males. In 94% of the more severely affected patient group, the defect was also discovered in the second allele. This proves the sensitivity of the approach. CAPN3 mutations were found in 35.1% of classical LGMD phenotypes. Mutations were also found in 18.4% of atypical patients and in 12.6% of subjects with high serum creatine kinase levels.
Conclusions: A non-invasive and cost–effective strategy, based on the high throughput denaturing HPLC analysis of DNA pools, was used to obtain unbiased information on the consequences of CAPN3 mutations in the largest genetic study ever undertaken. This broadens the spectrum of LGMD2A phenotypes and sets the carrier frequency at 1:103.
- CK, creatine kinase
- HT-DHPLC, high throughput denaturing high performance liquid chromatography
- LGMD, limb girdle muscular dystrophy
- limb girdle muscular dystrophy
- calpain 3
- DNA pooling
Statistics from Altmetric.com
- CK, creatine kinase
- HT-DHPLC, high throughput denaturing high performance liquid chromatography
- LGMD, limb girdle muscular dystrophy
Limb girdle muscular dystrophies (LGMD) include a broad group of genetically determined progressive disorders with a primary or predominantly symmetrical atrophy of the pelvic and/or shoulder girdle musculature, raised serum creatine kinase, and a necrotic regeneration pattern.1 The clinical course is characterised by great variability, ranging from severe forms with onset in the first decade and rapid progression, to milder forms with later onset and a slower course. The most severe forms result in dramatic physical weakness which often leads to other serious complications and, for many, a shortened life span. Other forms can result in relatively minor physical disabilities or develop late in life, allowing affected people to have normal life expectancy and activity levels.
Seventeen LGMD loci have so far been mapped, seven autosomal dominant and 10 autosomal recessive.2 The autosomal dominant forms (LGMD1) are generally milder and relatively rare, representing less than 10% of all LGMD cases.3–8 The autosomal recessive forms (LGMD2) are much more common, having a prevalence of 1:15 000, with geographical differences.9 The products of 10 LGMD2 loci have been identified. They are: calpain 3 for LGMD2A,10 dysferlin for LGMD2B,11,12 α-sarcoglycan (adhalin) for LGMD2D,13 β-sarcoglycan for LGMD2E,14,15 γ-sarcoglycan for LGMD2C,16 δ-sarcoglycan for LGMD2F,17,18 telethonin for LGMD2G,19 TRIM32 for LGMD2H,20 fukutin related protein for LGMD2I,21 and titin for LGMD2J.22 Linkage analyses indicate that there is further genetic heterogeneity for both LGMD1 and LGMD2.
LGMD2A [MIM 253600] is one of the most prevalent forms of autosomal recessive muscular dystrophy in all countries so far investigated and is caused by loss of function mutations in the CAPN3 gene. This spans 52.8 kb of genomic sequence at chromosome 15q15.1, and the 3.5 kb transcript is composed of 24 exons encoding a 94 kDa proteolytic enzyme, calpain 3 or p94.10 This is a skeletal muscle specific member of the superfamily of the calpain large subunits, or calcium activated neutral proteases (EC 22.214.171.124), which are non-lysosomal intracellular cysteine proteases. The human genome encodes at least 15 large subunit genes that are supposed to process various intracellular kinases, phosphatases, phospholipases, transcription factors, and cytoskeletal proteins to modulate their activities.23 Calpain 3 can be subdivided into four domains. Domain II shows similarity with other cysteine proteases, which share histidine, cysteine, and asparagine residues at their active sites. Domain IV comprises four EF-hand structures that are potential calcium binding sites. In addition, there are three unique sequences, NS, IS1, and IS2. Although it is largely unknown why the absence of this protein leads to muscular dystrophy, it has been suggested that it may play a significant role in intracellular signal transduction systems.24,25 Baghdiguian et al26 claimed that calpain 3 deficiency would cause myonuclear apoptosis and a profound perturbation of the IkBa/NF-kB pathway.
Null mutation of calpain 3 in mice causes abnormal sarcomere formation27: in vitro studies showed that calpain 3 can bind and cleave titin and that some mutations result in reduced affinity for titin. The spectrum of mutations affecting the CAPN3 gene is highly heterogeneous. The Leiden muscular dystrophy web site (http://www.dmd.nl), a specialised mutation database for muscular dystrophies, to date reports 139 pathogenic mutations associated with a LGMD2A phenotype. They are distributed over almost all exons of the CAPN3 gene, with nine exons (1, 4, 5, 7, 10, 11, 13, 21, and 22) involved in about 85% of cases. Most mutations (70%) are private variants, although particular mutations were more frequent in some populations. Novel mutations were identified in Japanese LGMD2A patients, which were not found in non-Japanese populations.28
In a large study based on western blot analysis, 53 of 191 patients (28%) with the LGMD phenotype had partial or total calpain 3 deficiency, while this was observed in 8% patients with hyperCKaemia, 6% with proximal myopathy, and 6% with distal myopathy.25 Recently, Fanin et al29 identified 69 of 208 cases of calpain 3 deficiency (33.2%) by western blot analysis.
The present study was designed with two goals in mind: first, to approach the genetic heterogeneity of a complex disorder such as autosomal recessive LGMD with a non-invasive cost–effective strategy based on the screening of DNA pools; and second, to obtain a comprehensive non-biased understanding of the consequences of CAPN3 mutations in myopathic patients, including isolated cases with different degrees of disease.
Selection of patients
DNA samples from 530 LGMD patients and 300 controls were analysed by high throughput denaturing high performance liquid chromatography (HT-DHPLC) in the current survey. Patients in whom a mutation was afterwards identified in other LGMD genes were included. At least 92% of DNA samples were from Italian families, recruited at the Universities of Naples, Padua, and Rome. Additional patients were from Argentina (3), Croatia (6), Slovenia (5), Germany (6), Greece (1), and Turkey (20). Diagnostic criteria follow the European Neuromuscular Centre guidelines at http://www.enmc.org/nmd/diagnostic.cfm. These include limb girdle onset, progression, autosomal inheritance, no involvement of other systems, raised serum creatine kinase (CK), hypodensity on computed tomographic scanning, dystrophic changes on muscle biopsy and so on. As a spectrum of disease severity has been observed in association with most of the genetically defined types, the exclusion of other disorders is a vital part of the general diagnosis of LGMD. Although there is a continuous spectrum of disease severity, we have grouped subjects into five categories: (A) severe LGMD characterised by serious weakness and wheelchair requirement before the age of 35; (B) intermediate LGMD with weakness and difficulty in deambulation, but the ability to walk alone beyond the age of 35; (C) mild with symmetrical scapular winging and hyperCKaemia; (D) isolated hyperCKaemia; and (P) presymptomatic, including patients too young to be classified by clinical criteria but with suggestive laboratory findings (for example, raised CK, myopathic biopsy, and so on).
In all, 123 were classified as group A, 142 as B, 158 as C, 87 as D, and 20 as P. We included all isolated cases with apparent autosomal recessive inheritance. A preliminary linkage analysis using genetic markers specific for all the previously known limb girdle dystrophy loci was carried out on suitable families.30
Mutation discovery strategy
We used the following strategy (fig 1), structured into six sequential steps:
Accurate quantification of genomic DNA, with a fluorometric assay using a reporter dye such as PicoGreen (Molecular Probes, Eugene, Oregon, USA).
Dilution of each individual sample of genomic DNA and test of polymerase chain reaction (PCR) yield.
Pooling of two suitable DNA samples in the same tube.
Set up of finest PCR conditions, according to DHPLC analysis requirements (that is, special buffer formulations, primer design to mask known polymorphisms and so on).
Mutation screening of the entire cohort of our patients and normal controls.
Direct sequencing of abnormal migrating fragments only present in patients.
PCR primers for amplification of CAPN3 coding regions have been published previously.10 However, some of them produced amplicons showing suboptimal melting profiles when tested by Navigator software (Transgenomic Inc, Santa Clara, California, USA). In addition, the PCR yield was often unsuitable for the analysis. Thus most amplimers were redesigned. Table 1 reports the 26 PCR primer pairs for mutation screening of the coding sequence, together with the untranslated regions and the putative promoter. Amplified fragments were 93–296 bp in length. Here each primer pair was able to amplify an exon together with 25–50 bp of flanking intron sequences on both sides. The mutation studies were undertaken using a combined procedure based on DHPLC and the assembly of normalised DNA pools to speed up the screening. The melting characteristics of each PCR fragment were predicted using Navigator software. The elution was carried out with a two minute gradient (Rapid DNA) with a flow of 1.5 ml/min, a slope of 5%/min, and a drop loading of 3%.
DHPLC strategy and pooling rationale
We analysed the CAPN3 exons and flanking intronic sequences and the putative promoter using high throughput denaturing HPLC (HT-DHPLC).31–36 The high throughput configuration of the instrument requires only a two minute gradient run for the analysis. Table 1 shows the sequence of the primers, PCR product sizes, annealing temperatures, and the conditions of analysis by DHPLC.
DHPLC can detect heterozygous mutations in DNA samples in the form of an additional peak of heteroduplex DNA. When a subject carries a homozygous mutation, it is necessary to anneal the sample with a normal control. Instead of normal samples, we pooled equal amounts of DNA from two unrelated patients before PCR. This was sufficient to reduce the number of DHPLC analyses and PCR reactions by at least 66%. In fact, in both cases of a heterozygous or a homozygous mutation, heteroduplex DNA will be generated within a pool. A false negative on DHPLC can only result when a pool is composed of both DNAs that are homozygous for the same mutation. Nevertheless, this risk is very low, considering the frequency of the homozygous patients. Among the most frequent homozygous mutations were the following: 550delA was found in 12 of the 530 samples, 883-886GATA→CTT and 2212 C→T in four, 139delATC and 2242C→T in three, 802-9G→A and 2184G→A in two, and other 12 mutations only in one. Thus the probability of forming a false negative pool is 1:1312 (a2 + b2 + c2 +.../n2, where n is the total number of samples; 214/280 900): too low to affect our analysis of 265 pools.
Patients under study
We studied a randomly collected group of 530 patients and 300 controls. Most patients are isolated cases with apparent autosomal recessive inheritance. We found at least one mutation in 141 patients and only 42 of these were homozygous (29.8%). Eighty seven of them (61.7%) were isolated cases, while 54 (38.3%) were familial.
They were classified as severe LGMD (group A), intermediate LGMD (group B), mild myopathy (group C), asymptomatic hyperCKaemia (group D), and presymptomatic patients (group P). We found 49/123 mutations (39.8%) in group A, 44/142 (31.0%) in group B, 29/158 (18.4%) in group C, 11/87 (12.6%) in group D, and 8/20 in group P (40.0%) (table 2). This strongly suggests that in typical LGMD with wheelchair confinement before or after age 35 (groups A and B), CAPN3 mutations are the most frequent genetic cause, as no other LGMD form involved 35.1% of patients. In addition, a mutation of the second allele was identified in the large majority of group A and B patients (46/49 = 93.9% and 38/44 = 86.4%, respectively), but only in 15/29 = 51.7% of group C and 3/11 = 27.3% of group D patients. As the sensitivity of DHPLC is identical for all samples, a large proportion of heterozygous patients belonging to group C and D might not have a second mutation. In milder myopathies CAPN3 mutations are no more common than DYSF or FKRP mutations. The mutation detection rate among patients with defects of calpain 3 on western blot analysis was 56%.
The CK level in patients in all groups was 11.1-fold above the upper limit of normal (range threefold to 25-fold). However, there were some young patients who had very high CK levels (up to 100×) and four with normal CK.
In accordance with the observations of Zatz and co-workers,37 we found more rapid progression in LGMD2A affected males than in females. Considering patients with both mutations, females in 34.0% of cases were in group A, 48.9% were in group B, while in the males 52.6% were in group A and 26.3% in group B. As the difference between groups A and B depends on the age of wheelchair confinement, females tend to lose their ability to walk later.
Mutations and polymorphisms
Altogether, 82 different mutated alleles were detected in the CAPN3 gene, of which 45 (54.9%) are novel (table 3). No putative mutations were encountered in 300 controls. The mutations include 66 single base pair substitutions (18 transversions and 48 transitions) which resulted in a change of amino acid in 54 cases (81.8%), in premature stop codons in five cases, and in splicing defects in seven cases. An additional 11 mutations were small deletions, one was an insertion, and two were indels. All but five caused a frame shift, leading to a secondary premature stop codon. We identified 27 novel mutations affecting single amino acids that are conserved. However, this criterion is not fully reliable as the conserved glutamic acid 107 is polymorphic in humans. Some mutations affect domains that are common to all the other members of the calpain superfamily: for example, F779 is part of an EF-hand calcium binding motif DFXXF. Patients with the same mutations have a variable phenotypic spectrum. For example, among 12 cases that are homozygous for the 550delA allele, six belong to group A, five to group B, and one to group C.
When the mutation of the second allele was not found, we tried to identify it by reverse transcriptase polymerase chain reaction (RT-PCR) from muscle RNA, when available. We also carried out long range PCR analysis. No major genomic deletion in the calpain 3 gene was identified among these patients.
We found mutations scattered over 21 of the 24 exons of the CAPN3 gene, with no mutation in exons 9 (78 bp), 12 (12 bp), and 23 (59 bp), all of which are among the smallest exons and in which no definite mutation has been described so far. Interestingly, we found 10 mutations in exon 1, seven of which are novel (70%). Alternatively spliced exons, such as 15 and 16, relatively more often show out of frame mutations.
We identified three novel polymorphisms (498+35G→T, 606T→C, and 1524+81C→T) found both in patients and controls, and 15 novel variants in addition to others that are already known (table 4). This could be a major drawback for the analysis, as polymorphisms generate heteroduplexes that may mask true mutations. We therefore designed additional primers that cover these polymorphisms and rescreened samples. We only sequenced the heteroduplexes that were maintained following this second PCR. Otherwise, we have sequenced all the heteroduplex-forming fragments to avoid missing mutations. We were unable to show an association between six synonymous and non-synonymous variants and LGMD2A: these were found only once in patients and not in controls and therefore could not be classified (table 4).
We present the results of all the sequence variations identified in the CAPN3 gene. Mutations of the CAPN3 gene cause the most prevalent form of limb girdle muscular dystrophy, LGMD2A. The phenotypic consequences of CAPN3 defects are similar to those caused by mutations in many other known and unknown LGMD genes. This immediately indicates first, that direct genetic testing is necessary to discriminate between the forms; and second, that in a large fraction of LGMDs a CAPN3 mutation can be found. Unfortunately, the analysis of CAPN3 gene is complex, as it is composed of 24 exons and has 218 pathogenic and non-pathogenic alleles, including those reported here. There is also no prevalent “hot spot” mutation that can be routinely screened, unlike the cystic fibrosis gene.
We set up a high throughput strategy for the systematic analysis of the CAPN3 gene using DHPLC. All comparative studies between methods for mutation discovery confirmed the greater efficacy of DHPLC, which offers the advantages of sensitivity, automation, and cost–effectiveness.31–36 In addition, HT-DHPLC adds a fourth benefit, speed. Compared with standard DHPLC, it cuts analysis time and reduces human intervention: two 384 well plates (768 samples) are automatically analysed in 54 hours. For further enhancement, we analysed PCR products from DNA pools, as the system is able to distinguish a mutant allele among several wild-type alleles. Sample pooling before PCR cuts the costs of the entire procedure, making it the cheapest of all the mutation discovery strategies. In comparison with direct sequencing, standard DHPLC is at least eight times cheaper,32 but together with high throughput sample pooling the entire procedure becomes up to 20 times cheaper.
With DHPLC, any sequence polymorphism that generates heteroduplex DNA may confound results by masking pathogenic changes. This problem can be solved in two different ways: first, by sequencing all the heteroduplex forming fragments to avoid missing mutations; second, by designing primers that cover or exclude the polymorphic site, when the latter is intronic. We show that our approach to mutation discovery had an optimal sensitivity based on the following data: in more than 90% of patients with typical severe or intermediate LGMD (groups A and B) we were able to discover the second allele mutation. DNA based exon analyses miss deep intronic mutations that activate cryptic splice sites by adding extra sequences. This does not seem to be the case, as RT-PCR, when possible, was unable to detect any defect.
A general problem of human molecular genetics is in the genotype–phenotype correlation for non-truncating mutations. Mutation discovery strategies should discriminate between true pathogenic defects versus rare innocuous variants. Even if the disease is familial and strongly linked to a locus, one must take into account that any sequence variation co-segregates with the true but unidentified mutation. Thus each variation observed only once should be considered a “putative mutation” only when a complete study of the gene and an adequate number of controls has been made. This implies that heterozygous status should not be classified as autosomal recessive LGMD2A in the absence of the second mutation. The number of controls is a critical issue and depends on the heterogeneity of neutral alleles.38 We consider the standard analysis of 100 normal individuals inadequate. Pooled DHPLC analysis can achieve this goal by increasing the number of chromosomes analysed at reasonable cost.
Unlike all the other studies on LGMD2A, we included isolated cases and milder and atypical patients. This is helpful in defining a wider non-biased phenotypic spectrum associated with defects at the CAPN3 locus.
We identified 82 different mutations. The distribution of mutations along the gene somewhat differs from that reported in the literature, suggesting both population differences and methodological bias. The most frequently mutated exons in our study are 1, 4, 5, 6, 7, 10, 11, 13, and 21 accounting for 87% of all LGMD2A alleles. In Brazilian families, exons 2 and 22 should be included in the amplicons first analysed. Unexpectedly, we found 10 exon 1 variants that constitute 70% of all mutations described worldwide. This may be result from a fault in the screening of this exon by most groups, as our detection rate was enhanced when we split exon 1 in two overlapping fragments (table 1). The most frequent allele is del 550 A which was found 45 times. Six of 12 del550 homozygous patients were from north eastern Italy. The fraction is much lower than in Croatia,39 but higher than in southern Italy. This confirms the hypothesis of a common founder.40
We then compared mutation discovery results with available western blot data. Seventy patients with reduced or absent calpain 3 western blot were analysed and we found a causative mutation in 56% of cases. This, together with the data by Fanin et al,29 implies that western blot analysis alone could give both to false positive and false negative results. The conclusive diagnosis of LGMD2A is thus molecular. We show that reliability only derives from very large patient-to-control studies that are feasible using high throughput mutation analysis techniques.
Accordingly, the milder the phenotype, the higher the percentage of patients with one single CAPN3 mutation. Considering group A and B patients in the current definition of LGMD with autosomal recessive inheritance, 93 of 265 subjects (35.1%) can be classified as LGMD2A. Mutations were also found in patients belonging to group C (18.4%) in half of which the second mutation was not found. The results of the screening indicate that 12.6% of patients with no weakness and increased CK levels (group D) do have CAPN3 mutations.
At the end of this study there were patients from groups C and D with a single pathogenic change. We think that these patients may either be true heterozygotes for a pathogenic CAPN3 allele or may carry a yet unrecognised hypomorphic allele variation. In some of these patients, we found polymorphisms or variants, also identified in normal controls: one should assume that they are neutral changes. However, the combination of a polymorphism in one copy of the CAPN3 gene with an LGMD2A mutation in the other copy could even be responsible for very mild phenotypes with low penetrance, similar to the effect of the 5T allele in cystic fibrosis.41,42
The more rapid progression in affected males than females has no explanation. If sex steroids modulate the calpain 3 mechanism, this could have implications for treatment.
When the involvement of CAPN3 gene in disease was reported for the first time,10 six different mutations were found within La Réunion Island patients, although a single founder was expected. A digenic inheritance model and several alternative hypotheses were proposed by Beckmann to explain the so called “La Réunion paradox.” Considering the number of patients recruited by all centres and the population of origin, we estimate that autosomal recessive LGMDs have a cumulative prevalence of 1:15 000, and 35.1% of the cases are LGMD2A. Thus the prevalence of typical LGMD2A should be 1:42 700 and the carrier frequency 1:103. Additional mutations are carried by atypical patients or subjects with high CK, most of which are not recruited for genetic studies. This recalculation of the carrier frequency helps to explain the paradox. La Reunion now has a population of 720 900 people, so we expect to find at least 7000 disease carriers. During the years 1707 to 1804 the population of the island increased 88-fold. This implies that multiple waves of immigration occurred and many different founders were introduced.
To our knowledge, this is the largest molecular study of LGMD2A in a highly heterogeneous population of patients. CAPN3 mutations account for about one third of classical LGMD cases and the other nine genes account for about an additional third of cases; the remaining third will be caused by as yet unknown genetic causes. The question arises as to whether there is a multitude of genes each accounting for a limited number of these cases, or there are a few major disease genes that cause all these forms. The former may be more realistic and similar to the widespread genetic heterogeneity observed in other human diseases (for example, retinitis pigmentosa or mental retardation).43–47
We will make available materials used for the present investigation upon justified request to the corresponding author. Mutation information is being submitted to Leiden public database.
We thank all the LGMD families. We acknowledge Maria Esposito and Manuela Dionisi at the TIGEM Mutation Detection facility. We also thank Francesca D’Amico and Maria Rosaria Righetti for help in mutation discovery and the TIGEM-IGB sequencing core facility. Control samples were from Naples Human Mutation Gene Bank.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.