Background Primary ciliary dyskinesia (PCD) is a rare, genetically heterogeneous ciliopathy disorder affecting cilia and sperm motility. A range of ultrastructural defects of the axoneme underlie the disease, which is characterised by chronic respiratory symptoms and obstructive lung disease, infertility and body axis laterality defects. We applied a next-generation sequencing approach to identify the gene responsible for this phenotype in two consanguineous families.
Methods and results Data from whole-exome sequencing in a consanguineous Turkish family, and whole-genome sequencing in the obligate carrier parents of a consanguineous Pakistani family was combined to identify homozygous loss-of-function mutations in ARMC4, segregating in all five affected individuals from both families. Both families carried nonsense mutations within the highly conserved armadillo repeat region of ARMC4: c.2675C>A; pSer892* and c.1972G>T; p.Glu658*. A deficiency of ARMC4 protein was seen in patient's respiratory cilia accompanied by loss of the distal outer dynein arm motors responsible for generating ciliary beating, giving rise to cilia immotility. ARMC4 gene expression is upregulated during ciliogenesis, and we found a predicted interaction with the outer dynein arm protein DNAI2, mutations in which also cause PCD.
Conclusions We report the first use of whole-genome sequencing to identify gene mutations causing PCD. Loss-of-function mutations in ARMC4 cause PCD with situs inversus and cilia immotility, associated with a loss of the distal outer (but not inner) dynein arms. This addition of ARMC4 to the list of genes associated with ciliary outer dynein arm defects expands our understanding of the complexities of PCD genetics.
- Clinical Genetics
- Molecular Genetics
- Other Respiratory Medicine
This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 3.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/3.0/
Statistics from Altmetric.com
Primary ciliary dyskinesia (PCD; MIM244400) is a heterogeneous genetic disorder arising from ultrastructural defects that cause abnormal function of motile cilia and sperm flagella.1 The disease has an autosomal recessive mode of inheritance and affects one in every 15 000–30 000 births. Motile cilia are hair-like organelles found on the epithelial surface of the respiratory airway tract, the brain ependyma and fallopian tubes. Their axonemal structure consists of nine peripheral outer doublet microtubules surrounding a central microtubular pair (9+2 arrangement), a highly similar structure to that found in sperm tail flagella. During embryogenesis, motile monocilia at the node lack the central pair apparatus (9+0 arrangement). Microtubule-associated protein complexes are attached along the length of the axoneme at regularly intervals, which regulate axonemal stability and ciliary motility. Of these, the inner and outer dynein arms (IDA and ODA) are responsible for beat generation together with radial spoke and nexin-dynein regulatory complexes.
Abnormal motility of respiratory cilia leads to congestion of the body's mucociliary clearance mechanism, causing a number of symptoms in PCD patients which include neonatal respiratory distress, chronic respiratory infections, sinusitis, otitis media and destructive lung disease (bronchiectasis).2 Other features include subfertility in both sexes and left-right organ laterality abnormalities, predominantly situs inversus, and occasional hydrocephalus. In a proportion of patients, severe heterotaxic isomerisms and cardiac malformations can occur.3 So far, genetic studies have identified mutations in over 20 genes leading to various ultrastructural defects, including RPGR which causes a syndromic form of PCD.4 Large-scale transmission electon microscopy (TEM) studies in patients suggest that 65% of PCD arises from defects involving the outer dynein arms.5 ,6 Of the genes which are known to cause these defects (reduction or loss of the ODAs) when mutated, DNAH5, DNAH11, DNAI1, DNAI2, DNAL1 and NME8 (previously TXNDC3) encode subunits of the axonemal outer dynein arm components, and CCDC114 encodes an outer dynein arm-docking complex component. Mutations causing PCD with ODA defects were also recently described by Hjeij et al in ARMC4, encoding a protein involved in assembling outer dynein arms into cilia which is likely involved in their targeting and/or anchoring onto microtubules.7 Mutations involving deficiency of both the outer and inner dynein arms have also been identified in genes that encode a group of cytoplasmic proteins (DNAAF1/LRRC50, DNAAF2/KTU, DNAAF3, CCDC103, HEATR2, LRRC6 and DYX1C1), which are likely to play a role in the preassembly of the dynein arm components and/or in their axonemal transport.4 ,8 Additionally, mutations have been reported in genes that encode protein subunits of the radial spoke heads (RSPH4A, RSPH9), proteins linked to the nexin-dynein regulatory complexes (CCDC39, CCDC40, CCDC164) and the central pair apparatus (HYDIN).4
In order to determine the genetic basis of disease in PCD families, we have employed a next-generation sequencing approach. All patient samples in this study were obtained with informed consent according to the protocols approved by the ethical committees of the Institute of Child Health/Great Ormond Street Hospital (#08/H0713/82) and those of collaborating institutions. First, whole-exome sequencing was performed in one affected individual (II:1) of a consanguineous Turkish family PCD-221 at the Wellcome Trust Sanger Institute (Cambridge, UK) as part of the UK10K project.9 PCD-221 II:1 is one of two affected siblings who present with a classic clinical course, both having laterality defects, respiratory symptoms and recurrent chest infections since a young age, compromised lung function, chronic ear, nose and throat (ENT) symptoms including rhinitis and otitis media. PCD-221 II:1 suffers from frequent pneumonias and bronchiectasis, and II:2 has had surgery for hydronephrosis. Additionally, both siblings have intellectual and developmental delay, and an unusual ocular phenotype of bilateral ptosis, variable divergent strabismus and upgaze paresis with poor horizontal saccades, suggestive of a brain stem or cranial nerve disinnervation syndrome. Exome sequencing and variant calling was performed as previously described, using approximately 3 µg of genomic DNA and the Agilent Technologies Human All Exon 50 Mb kit.4 ,9 Over 3 Gb of sequence was generated, such that >68% of the target exome was present at greater than 20-fold coverage (see online supplementary table S1). Analysis of the exome variant profile was performed using the EVAR software tool V0.2.2 β. Copy number variations (CNV) were analysed from the exome data using ExomeDepth.10
To prioritise candidate genes, we based our analysis on the rare-recessive disease model, with the knowledge that PCD is largely caused by mutations affecting the protein-coding region of genes. Since the PCD-221 II:1 individual is the offspring of a consanguineous marriage, we focused on homozygous variants predicted to cause non-synonymous or splice-site substitutions or indels. We also filtered to prioritise only those that were either novel or present in the 1000 Genomes Project exome database with a frequency <0.01. We next used our in-house internal allele count data, removing variants detected more than 10 times across a database of 500 exomes available from the UK10K_RARE cohort (http://www.uk10k.org/studies/rarediseases.html), because PCD-causing mutations would not be predicted as likely to appear in multiple well-phenotyped non-PCD patients. This filtering strategy revealed eight homozygous variants of interest that met these criteria, which are listed in online supplementary table S2. We proceeded to search for the presence of these genes and their species-conserved homologues in the Cilia Proteome database,11 and this identified three genes EEF1D, MYO1D, ARMC4 (see online supplementary table S2). Of these, EEF1D encoding a translation elongation factor was excluded on gene function grounds12 and also since further analysis showed that the missense change identified was only in a highly truncated transcript of unknown functional significance (ENST00000532400). The MYO1D variant was also excluded based on putative gene function despite the suggested link between MYO1D and left-right asymmetry determination since it is a widely expressed cytoskeleton-associated unconventional myosin.13 ,14 Furthermore, the identified variant was a missense mutation scored as ‘benign’ (score 0.292) in Polyphen-2 and ‘tolerated’ in Sorting Intolerant From Tolerant software (SIFT) for its effect on protein function (c.2585A>T; p.His862Leu). This left a single homozygous protein-truncating nonsense variant (c.2675C>A; pSer892*) in ARMC4. The expected damaging effect of this variant, which was the only predicted null-effect allele in the final filtered set, provides strong support for its likely pathogenic role. Segregation analysis in all available members of the family including the affected sibling confirmed correct recessive inheritance of this variant (figure 1A and see online supplementary figure S1).
In parallel, whole-genome sequencing (WGS) was performed as part of the UK10K project in the two unaffected, obligate carrier parents of a consanguineous Pakistani PCD family PCD-141 (individuals I:1 and I:2 figure 1A), since material from their affected offspring was not sufficient for exome sequencing. In this family, there are three affected siblings, all of whom display situs inversus and classic disease symptoms similar to those described for PCD221, including repeated infections of the chest, chronic nasal discharges and bronchiectasis. Additionally, all three siblings have had surgery to remove nasal polyps. For WGS, approximately 3 µg of genomic DNA was sheared to 100–1000 bp (Covaris) and the sheared DNA subjected to Illumina Paired-end DNA library preparation. Following size selection (300–500 bp insert size), DNA libraries were sequenced as 100 bp paired-end reads on the HiSeq platform (Illumina). For each subject, more than 91% of genomic bases were represented by at least 24 reads (see online supplementary table S3). We filtered per chromosome, to identify protein-altering heterozygous variants shared by both parents with a MAF<0.01 in the 1000 Genomes Project exome database, using the same criteria as for the exome sequencing. 522 heterozygous variants meeting the filtering criteria were shared between the two parents, out of more than nearly three million heterozygous variants per parental sample. Of these, just 16 shared variants were found in genes represented in the Cilia Proteome database, and only one, ARMC4, had any functional annotation suggestive of a role in cilia motility. Thus, this strategy as detailed in online supplementary table S4 revealed a second ARMC4 protein-truncating nonsense variant (c.1972G>T; p.Glu658*). Notably, of the 16 variants that were shared and homozygous, this was the only stop-gained effect allele, providing further support for a disease-causing effect. Segregation analysis by Sanger sequencing in all available family members confirmed recessive inheritance with consistent genotypes (figure 1A and see online supplementary figure S1). The lack of an ocular phenotype in anyone from family PCD-141 suggests that this finding in family PCD-221 is unconnected to ARMC4 mutations, and further analysis of the PCD-221 exome data is ongoing to investigate potential other loci.
Another study of mutations in ARMC4 causing PCD recently reported that the protein is involved in outer dynein arm assembly into the cilia and probably has a role in their correct axonemal docking and targeting.7 Furthermore ARMC4, encoding the 1044 amino acid Armadillo repeat-containing protein 4, has previously been implicated in ciliogenesis.15 We also used the UMCG Groningen Gene Network tool which analyses data from 80 000 Gene Expression Omnibus microarrays to predict gene function in Gene Ontology Consortium terms, and found the top-scoring predictions for ARMC4 were highly significant and all involved in cilia functions: ciliary or flagella motility (p=6.10×10−19), microtubule-based movement (p=1.16×10−9) and cilium assembly (p=1.16×10−9). Interestingly, ARMC4 was previously proposed to be an axonemal protein equivalent to radial spoke protein 8 (RSP8) of Chlamydomonas, however, this was acknowledged to be a low-scoring homology.16 There appears to be no misfunction of the radial spokes in ARMC4-deficient cilia, and this seems to be an erroneous homology, due to the multiple armadillo repeats present in ARMC4 and RSP8 proteins.17
We proceeded to use protein modelling to further investigate ARMC4 function. We could not detect a clear homologue in the PCD model species, Chlamydomonas, but BLAST and SMART18 protein domain homology searches showed that there are 12 predicted ARM repeats at the C-terminus, in addition to three low amino acid complexity regions of unknown significance in the N-terminus (figure 1B). This contrasts with the study of Hjeij et al7 which predicts 10 ARM motifs and one HEAT repeat in ARMC4, as is annotated in the Swissprot database (http://www.ncbi.nlm.nih.gov/protein/74744660). Both models are based on predictions rather than experimental evidence. The N-terminus did not contain sufficient similarity to any known protein domains to allow modelling, however, a structural model could be generated for the C-terminus using I-TASSER 3D-structural model prediction. This predicts that the tandem ARM-repeat domains of ARMC4 fold together as a series of tandem helices forming a superhelix, that creates a surface or groove for protein interaction similar to that of the β-catenin ARM repeat structure (figure 1B).19 ,20 The two nonsense mutations identified in the PCD families are most likely to be non-functional through nonsense-mediated decay (NMD); as shown in figure 1B, they would both remove multiple ARM domains and that by inference would likely influence the protein binding capabilities of ARMC4 if a truncated form was present.
We then performed a STRING search to look for predicted protein–protein interactions for ARMC4. This interactome analysis generated a small network (figure 1C) including a predicted direct interaction between ARMC4 and the dynein intermediate chain protein DNAI2, a known component of the outer dynein arm located at the ODA base, mutations in which are also responsible for causing PCD with outer dynein arm defects and cilia immotility.21 Notably, a direct interaction with FOXJ1 the master regulator of motile ciliogenesis was also predicted in this analysis. FOXJ1 is essential for assembly of motile cilia in vertebrates, through the regulation of genes specific to motile cilia.22 Last, we investigated ARMC4 expression during ciliogenesis of normal human ciliated bronchial epithelial (NHBE) cells by TaqMan qPCR. ARMC4 transcript levels were undetectable in non-ciliated basal NHBE cells, whereas after ciliogenesis, a significant upregulation (over 1000-fold) of ARMC4 expression was observed. Similar results were obtained for DNAI1, a known ciliary gene (figure 1D).
To understand the impact of mutations in ARMC4, we examined transmission electron microscopy (TEM) of respiratory cilia cross-sections from nasal samples collected from the two families carrying ARMC4 mutations. TEM was performed as previously described.6 All affected individuals displayed a loss of the outer dynein arms, as demonstrated in figure 2A for the affected individuals PCD-221 II:1 and PCD-141 II:5. In order to further investigate ARMC4 function, we analysed its subcellular localisation by high-resolution immunofluorescence microscopy in the patient's ciliated epithelial cells as previously described.23 In control cells, ARMC4 protein was observed to localise along the whole length of the cilia axoneme (figure 2B). However, in the PCD patient, PCD-221 II:1, severely reduced levels of ARMC4 along the cilia axoneme were observed in contrast to the control individual, in comparison to the axonemal marker acetylated α-tubulin which was unaffected (figure 2B). There was a notable accumulation of ARMC4 staining in the cell body in the patient's cells, apparently clustered directly underneath the ciliary base. This needs further investigation but could conceivably represent an accumulation of truncated protein not subject to NMD, and it is also seen in the previous study of Hjeij et al7 in patients carrying ARMC4 termination mutations. We next used two well-established diagnostic markers of axoneme integrity, DNAH5 and DNALI1, to examine cilia structure in ARMC4 patients. DNAH5 detects both the different types of outer dynein arms that have previously been described in respiratory cilia: one class at the distal half of the axoneme (DNAH5-positive, DNAH9-positive) and one class at the proximal end (DNAH5-positive, DNAH9-negative).24 DNALI1 stains the inner dynein arms along the entire axoneme length.21 This analysis confirmed the presence of the IDAs in cilia of PCD-221 II:1 (see online supplementary figure S2). However, in contrast there was a complete loss of DNAH5 staining at the distal ends of the cilia, but a retention of a reduced level of DNAH5 staining in the proximal ends of cilia (figure 2B). This indicates the loss of distal DNAH5-positive ODAs but retention of proximal DNAH5-positive ODAs which is in agreement with previous observations by Hjeij et al7 in ARMC4-deficient cilia.
To demonstrate the effect of ARMC4 deficiency on ciliary beat frequency, we also performed high-speed videomicroscopic analysis of patient's nasal cilia as previously described.23 In both the affected siblings PCD-221 II:1 and II:2, the cilia were immotile compared with controls, which is a consistent pattern as seen in many other patients with outer dynein arm defects due to mutations in various different genes. The occasional twitch was seen in some cilia but in the majority they were completely static (see online supplementary videos 1–2). Taken together, these findings suggest that genetic defects in ARMC4 result in loss of selected ODAs.
In summary, here we show that loss-of-function mutations in ARMC4 cause PCD associated with left-right axis defects and a loss of the cilia's distal outer dynein arms. Both the nonsense mutations identified result in a deficiency of the ARMC4 protein along the length of the ciliary axoneme with an accompanying loss of the distal but not the proximal outer dynein arms, as defined by DNAH5 immunostaining, and this is associated with cilia immotility. Both mutations affect the highly conserved ARM repeat superhelix at the protein's C-terminus, likely disrupting its interactions with other protein partners. ARMC4 is, therefore, the eighth gene to be associated with a deficiency of outer dynein arms, and retention of the inner dynein arms. A previous report7 has shown that ARMC4 is not likely to be an integral component of the outer dynein arm, but is more probably involved in ODA targeting, docking or attachment. It is known that armadillo repeat-containing proteins can have more than one function in cells, potentially interacting with different protein partners.19 The putative interaction we detected between ARMC4 and DNAI2 requires further experimental proof, however if true, it would potentially localise ARMC4 close to DNAI2 within the axonemal outer dynein arm structures, towards the base of the outer dynein arm. Notably, Hjeij have shown in ARMC4-deficient cilia that DNAI2 similarly to DNAH5 is present proximally but absent distally from the ciliary axonemes.
Our findings have clinical application since we have demonstrated for the first time that mutations in genes causing PCD can be identified by combining whole-exome and whole-genome sequencing data. Furthermore, we conclude that the downstream analysis of WGS data can enable the identification of mutations in genes causing disease in cases where genetic material of patients is unavailable or hard to obtain. Our analysis of parental sequencing data of PCD patients shows this can be used to assist the genetic diagnosis of the disease. ARMC4 joins an important group of highly conserved ARM repeat-containing proteins associated with the ciliary axoneme that play roles in motility which includes RSP8, RSP14 and PF16.17 ,19 ,25 The exact nature of the essential role of ARMC4 in targeting the outer dynein arms to cilia remains to be fully characterised, however, these results further expand our understanding of the molecular genetic basis of PCD, and facilitate the rapidly growing application of genetics in PCD diagnostics.
1000 Genomes Project, http://www.1000genomes.org
UMCG Groningen Gene Network, http://genenetwork.nl:8080/GeneNetwork/
We would like to thank the PCD families for their participation in the study, and the PCD Family Support Group. We are also grateful to the physicians involved in analysis of the families especially Alison Male and Siobhan Carr. We thank Sarah Ollosson and Andrew Rogers for light and electron microscopy. The Centre for Translational Genomics-GOSgene at the UCL Institute of Child Health is supported by the National Institute for Health Research Biomedical Research Centre at Great Ormond Street Hospital for Children NHS Foundation Trust and UCL Institute of Child Health. We are grateful to the UK10K consortium in particular the Rare Diseases Group for making this study possible; a full list of the UK10K investigators is available at http://www.uk10k.org/publications_and_posters.html.
This web only file has been produced by the BMJ Publishing Group from an electronic file supplied by the author(s) and has not been edited for content.
Files in this Data Supplement:
Contributors Exome and genome sequencing data is from the UK10K project. Analysis of the sequence data was performed by AO, MS, CTJ and CB. Gene expression analysis was by MMM and SLH. Immunofluorescence and other molecular analysis was performed by AO, MS and MP. Clinical studies are from EMR, JED-R, CH, JJS, SH, GP, ATM and EMKC. Electron microscopy and video analysis by AS and CH. AO coordinated and performed molecular studies, he and HMM designed the study and wrote the manuscript.
Funding Wellcome Trust, Milena Carvajal Pro-Kartagener Foundation, Action Medical Research, Newlife Foundation.
Competing interests Funding for UK10K was provided by the Wellcome Trust under award WT091310. JJS, SH and ATM are supported by the Moorfields Eye Hospital Biomedical Research Centre. P.J.S. is supported by the Wellcome Trust and the British Heart Foundation. P.L.B. is a Wellcome Trust Senior Fellow. MS is supported by an Action Medical Research UK Clinical Training Fellowship. MMM, PLB, PJS, SLH and HMM are supported by the Great Ormond Street Hospital Children's Charity and a Child Health Research Appeal Trust funded PhD studentship to MMM (SLH). GP is supported by the Dutch patient organisation PCD Belangengroep, funded by ‘It Krystteam’ (Friesland). EMKC and HMM are supported by grants from the Milena Carvajal Pro-Kartagener Foundation, Action Medical Research (GN1773, GN2101) and Newlife Foundation for Disabled Children UK (10-11/15).
Ethics approval Ethical Committee of the Institute of Child Health/Great Ormond Street Hospital (#08/H0713/82).
Provenance and peer review Not commissioned; externally peer reviewed.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.