Background: Pulmonary fibrosis is a complex disease for which the predisposing genetic variants remain unknown. In a prior study, susceptibility to bleomycin induced pulmonary fibrosis was mapped to loci Blmpf1 and Blmpf2 on chromosomes 17 and 11, respectively, in a C57BL/6J (B6, susceptible) and C3Hf/KAM (C3H, resistant) mouse cross.
Methods: Herein, the genetic basis of bleomycin induced pulmonary fibrosis was investigated in an approach combining gene expression and sequencing data with previously mapped linkage intervals.
Results: In this study, gene expression analysis with microarrays revealed 1892 genes or ESTs (expressed sequence tags) to be differentially expressed between bleomycin treated B6 and C3H mice and 67 of these genetic elements map to Blmpf1 or Blmpf2. This group included genes involved in an oxidative stress response, in apoptosis, and in immune regulation. A comparison of the B6 and C3H sequence, for Blmpf1 and Blmpf2, made using the NCBI database and available C3H sequence, revealed approximately 35% of the genes in these regions contain non-synonymous coding sequence changes. An assessment of genotype/phenotype correlation among other inbred strains revealed 36% of these B6/C3H sequence variations predict for the known bleomycin induced fibrosis susceptibility of the DBA (susceptible) and A/J (resistant) mouse strains.
Conclusions: Combining genomics approaches of differential gene expression and sequence variation potentially identifies approximately 5% the linked genes as fibrosis susceptibility candidate genes in this mouse cross.
- ESTs, expressed sequence tags
- linkage analysis
Statistics from Altmetric.com
Pulmonary fibrosis is a disease in which the lung response to known (such as environmental toxins or radiation therapy) or unknown injuries produces a functionally compromising pathology. This devastating disease has an estimated incidence of 1 case per 10 000 per year and associated mortality is 50–70% at 5 years post diagnosis.1 The disease is characterised by cellular proliferation and progressive accumulation of extracellular matrix constituents resulting in remodelling of the lung interstitium.1 The mechanisms through which this functionally compromising pathology develops are unknown, but inflammatory,2–4 apoptotic,1 and matrix metalloproteinase/tissue inhibitor of metalloproteinase imbalance2,5 pathways have been implicated. Based on reports of familial pulmonary fibrosis, the disease is thought to have a genetic basis but the specific genes involved have not been identified.1,6
To circumvent the clinical limitations of environmental variability, genetic heterogeneity, and small sample sizes, investigations of inbred mouse strains which differ in their susceptibility to the fibrotic phenotype7,8 can be used to identify fibrosis susceptibility genes. Specifically, the anti-neoplastic drug bleomycin, which when used clinically leads to the development of fibrosis in approximately 5% of the exposed clinical population,9 when delivered through a 7 day subcutaneous dose has also been found to produce a fibrotic phenotype in mice which resembles idiopathic pulmonary fibrosis.10
We have previously mapped two loci of murine strain difference in propensity to develop fibrosis after bleomycin treatment in a C57BL/6J (B6, susceptible) and C3H (resistant) mouse model.11 The influence of the loci, named Blmpf1 (bleomycin induced pulmonary fibrosis 1) on chromosome 17 (LOD = 18.4) and Blmpf2 on chromosome 11 (LOD = 5.6) on the fibrotic phenotype, and the interaction of the two loci, were confirmed in additional studies of congenic and chromosome substitution mice.11
Consistent with the initial mapping of many complex traits, the Blmpf1 and Blmpf2 intervals are too large (approximately 13 and 26 Mb, respectively) to permit a practical assessment of individual candidate genes in the search for causative genetic variation. Thus, our approach to identifying candidate genes of fibrosis susceptibility has been to combine the results from additional genomic experiments with the mapping data. Specifically, gene expression analysis with microarrays, parental strain DNA sequence analysis, and genotype/phenotype correlation in additional mouse strains, were used to produce a set of candidate genes involved in the pathogenesis of experimental fibrosis. The resources available for this study included the genomic sequence of B6 mice12 and the partial genomic sequence of the C3H strain,13 and the available genome sequence (Celera) and documented bleomycin induced fibrosis susceptibility of inbred strains DBA/2 (susceptible),14 A/J (resistant),7 and 129 (susceptible).15
In this study, we present evidence for a set of candidate genes underlying the susceptibility loci Blmpf1 and Blmpf2. The fibrosis genes were isolated from among the mapped positional candidates, by identifying the subset of positional candidate genes which are differentially expressed between B6 and C3H mice in the bleomycin treated lung, and for which there is a sequence variation between the strains. These candidate genes were also assessed for genotypic correlation in mouse strains of known fibrosis susceptibility phenotype.
Mice of the C57BL/6J (B6) and C3H/HeJ (C3H) strains were purchased from Jackson Laboratories (Bar Harbor, ME) and housed in the animal facility of the Meakins-Christie Laboratories.
Mice were treated at 8 weeks of age. Lung damage was elicited by administering bleomycin through osmotic minipumps implanted subcutaneously, as described previously.16 Male mice received 100 units of bleomycin per kg body weight (∼2.5 units per mouse) and female mice received 125 units per kg body weight. Male and female mice were treated in separate studies due to the higher drug dose required for female mice and due to the sex specificity of the previously mapped loci. The mice were sacrificed (n = 4–8 mice per group) when moribund or at 3 or 6 weeks after treatment. Twenty untreated mice, five males and five females of each strain, were used as controls and were sacrificed at the 6 week time point.
Histology and fibrosis scoring
At autopsy, the lungs were removed and the single left lobe of each mouse was perfused with 10% neutral buffered formalin and submitted for histological processing. Each left lung was stained with Masson’s trichrome for identification of the site(s) of collagen deposition in the lung. The area of the fibrosing phenotype for each mouse was quantified with image analysis of histological sections as described previously.16 Specifically, the area of fibrosis in the left lung lobe was determined from a user drawn region surrounding the fibrosis and compared to the area of the entire lobe to yield the percent of pulmonary fibrosis for individual mice.
Following sacrifice, the right lung of each mouse was immediately homogenised in 2 ml of Trizol reagent and placed in dry ice. The homogenates were stored at −85°C until RNA isolation. Total lung RNA was extracted according to the manufacturer’s instructions (Sigma, St Louis, MO). The RNA from the right lungs of four or five mice from each group, defined by sex, strain, treatment, and time point, was pooled, as in Perkowski et al,17 to minimise biological variation in gene expression within a group. One sample of pooled RNA for each group was processed through the RNAEasy column (Qiagen, Valencia, CA) and submitted for hybridisation. RNA quality was assessed and confirmed prior to and following pooling using the Agilent Bioanalyzer (Agilent Technologies, Palo Alto, CA, USA). The experiment was performed with one chip per mouse group, represented by its pooled RNA.
Microarray hybridisation was performed by the Affymetrix Gene Chip Core facility at the McGill University and Genome Quebec Innovation Centre. The probe synthesis, hybridisation, and washing protocols used were as described in Novak et al18 and followed the standardised Affymetrix protocol. The starting material was 10 μl of total RNA. After hybridisation to the Murine MOE430A GeneChip (Affymetrix, Santa Clara, CA), the gene chips were automatically washed and stained with streptavidin-phycoerythrin by using a fluidics system. The chips were scanned with a GeneArray Scanner (Agilent Technologies). The resultant gene expression profiles were then extracted and viewed using Microarray Suite 5.0 (Affymetrix). The MOE430A GeneChip arrays contain 22 690 probe sets derived from sequence clusters contained in Build 107 (UniGene, June 2002) which represent approximately 14 000 functionally annotated genes and a set of expressed sequence tags (ESTs).
Microarray data analysis
Bioconductor version 1.4 (see http://www.bioconductor.org/) routines within the R (version 1.90) statistical language19 were used for quality control, normalisation, and differential expression. In particular, the quality of the raw microarray data was assessed by inspecting similarities between the intensity distribution and RNA digestion plot for each array. Normalisation was performed using the robust probe level model.20 Detection of differential expression was performed using the LIMMA package21,22 with a p value <0.05. Lists of significantly differentially expressed genes were generated intra-strain (control v bleomycin exposure) and between strains. Hierarchical clustering was performed via Cluster version 3.023 using the average linkage and Euclidean distance options. Results were visualised using TreeView version 1.08 (http://rana.lbl.gov/). The detection of significantly over represented Gene Ontology categories was performed using the FatiGO tool.24 The test of statistical significance considers the number of differentially expressed genes found in this category compared to the total number of genes in the category represented on the chip. Raw and normalised expression data are available at the NCBI GEO repository.
Quantitative real time PCR
To generate the cDNA for real time PCR, 4–5 μg of total RNA from the right mouse lung was reversely transcribed with oligo(dT)12–18Primer using Superscript II RNase H− Reverse Transcriptase (Invitrogen, Carlsbad, CA) in a 20 μl total volume. Reverse transcription was performed at 65°C for 5 min, followed by the activation of SuperScript II RT at 42°C for 50 min. The reaction was inactivated by heating to 70°C for 15 min.
Sequence specific primer sets, designed using Primer 3 or taken from the RT-PCR database, were used to carry out quantitative real time PCR. The primer sequences were confirmed, by BLAST comparison to the mouse genome sequence in the Ensembl database, to span an intron, and to be specific for the gene being assayed. Probe and primer sequences are as follows (5′→3′):
glutamate receptor, ionotropic, AMPA1 (alpha 1) (GenBank accession number NM_008165, PrimerBank ID 34328128a1), forward: CACTGGGTGATGCTCCTCG, reverse: CCTCACTGGGTAGTCCATACAC
pituitary tumour transforming 1 interacting protein (GenBank accession number NM_145925, PrimerBank ID 22122339a1), forward: GTCCGCCCTGAGAAATCCAG, reverse: CTCGCCCTTGTCGTACCAC
glyoxalase 1 (GenBank accession number NM_025374, PrimerBank ID 31981282a1), forward: GATTTGGTCACATTGGGATTGC, reverse: TCCTTTCATTTTCCCGTCATCAG
complement component 4 (within H-2S) (GenBank accession number NM_009780, PrimerBank ID 6753226a3), forward: TGCCAATGAAGACTACGAAGAC, reverse: TGCCATTTCGCCAGATACACA
spinocerebellar ataxia 10 (GenBank accession number NM_016843), forward: AAGCTAGTGGGTGAGGAGCA, reverse: ATCGTAACCAGGGCTTCCTT
proteasome subunit α type 3 (GenBank accession number NM_011184), forward: GCAGTTTCATGTTGGGGTCT, reverse: CGGCAAGTCATTTCCTTCAT,
For real time PCR, a LightCycler thermocycler from Roche (Indianapolis, IN) was used. Each reaction in a 32 well carousel contained 1 μl of cDNA template and 10 μl of QuantiTect SYBR Green PCR Kit (Qiagen). The PCR reaction followed a heating temperature of 95°C, an annealing temperature of 55°C, and an elongation temperature of 72°C for 45 or 50 cycles. Each experiment included a non-reverse transcribed control to assay for genomic contamination. No amplification was observed in this type of reaction or in reactions in which no cDNA template was used.
Relative gene expression data analysis was carried out with the standard curve method.25 In this analysis, the expression level of each of the target genes was determined relative to that of a reference gene. The reference gene was chosen from the set of genes whose expression varied from 0.9 to 1.1 relative units across the 13 arrays. The expression of the putative reference gene was then investigated, with RT-PCR, in each of the sample types used in this study. The ratio of the sample fluorescence to that of a standard curve generated using a serial dilution of the template (a PCR fragment cloned into pCR 2.1-TOPO vector) was determined for each of putative reference genes Sca10 and PSMA3. The expression of Sca10 was found to vary from 0.85 to 1.15, relative to PSMA3, across eight treatment groups and was used as the reference gene for the relative expression of the target genes.
The fluorescence data for each target gene were normalised to its own standard curve (generated through serial dilution of the cloned PCR product), then normalised to the expression level of the reference gene, and finally, expressed as a ratio of the calibrator sample which was included in every run. The expression of each target gene was taken as the average of the expression levels of two mice from the same treatment group, and each of these samples was run in quadruplicate.
B6/C3H sequence comparison
An interrogation of Ensembl version 21.22b.1 (http://www.ensembl.org/Mus_musculus/) was made to ascertain the number of genes mapping to the linkage regions Blmpf1 and 2. The Ensembl positions of the markers flanking Blmpf1 (D17Mit198/D17rua1) and those flanking Blmpf2 (D11Mit136/D11Mit5) were determined and the sequence data therein considered. To ascertain which genes in these regions are polymorphic between B6 and C3H mice, the NCBI database (dbSNP build 121, http://www.ncbi.nlm.nih.gov/SNP/MouseSNP.cgi) was queried for B6/C3H SNPs and MIT produced C3H sequence was compared to Ensembl derived B6 sequence. For the SNP analysis, the database was queried for B6/C3H polymorphisms on chromosomes 11 and 17 and re-queried for SNPs on these chromosomes which are non-synonymous. The genes containing the SNPs of the latter query were assessed for map position to the Blmpf1 and 2 linkage regions.
Sequence variations were also identified by comparing (by BLASTN) the Ensembl derived B6 sequence for each gene of Blmpf1 and 2, to the MIT provided C3H sequence (http://www.broad.mit.edu/ftp/pub/mouse_contigs/Mar10_02/).12 Each of the B6 genes and 1 kbp of the flanking 5′ and 3′ sequence were compared to the database of C3H. The C3H contigs with an E value of zero and a match score of 800 were further evaluated. For each of the genes in Blmpf1 and 2 it was determined if the inbred strains (B6 and C3H) differ in their sequence, and, for the coding regions, if that sequence change was synonymous (encoded the same amino acid) or non-synonymous (encoded different amino acids). This analysis was completed by translating the B6 sequence with and without the exonic mismatch (deletion/insertion/nucleotide change) and comparing the resultant peptides.
Multi strain sequence comparison
The markers flanking each of Blmpf1 and 2 (see above) were located in the Celera (Rockville, MD) mouse genome database (http://www.celera.com/, CDS 13h release) and the number of genes (excluding pseudogenes) mapping to each region was determined. Using the DNA positions of the flanking markers, the Celera Mouse SNP reference database (v3.6) was queried for SNPs within each linkage region. These SNP data were then filtered to uncover the set of SNPs for which B6, DBA/2, and 129 mice (all phenotypically similar in bleomycin response) carry the same allele and for which A/J mice (phenotypically distinct) have a different allele. Since it is not known which of the substrains of mouse strain 129, sequenced by Celera, was determined to develop fibrosis after bleomycin exposure,15 this analysis was completed with and without the 129 strain. In both analyses, these data were further filtered to exclude SNPs appearing in the intronic region or identified as synonymous, as has been done by others.26,27
To measure the level of gene expression in the lung and to histologically confirm the strain variation in fibrosis phenotype, mice of the B6 and C3H strains were treated with bleomycin and sacrificed at different time points after treatment. In fig 1 the average fibrosis percent of the lung is shown in mice sacrificed 3 or 6 weeks after bleomycin treatment, and in mice sacrificed due to respiratory distress. The mice sacrificed when sick had lost more than 20% of body weight and were sacrificed from 21 to 36 days after treatment. B6 female mice developed pulmonary fibrosis defined by alveolar atelectatic regions with collagen deposition, of approximately 8% of the lung (fig 1) in response to bleomycin treatment. This fibrosis response was evident, and of similar magnitude, in B6 female mice at 3 and 6 weeks after treatment and in the mice sacrificed due to respiratory distress. The fibrotic lung response of B6 male mice to bleomycin was similar to that of the B6 female mice at 3 weeks and when the mice were in distress (p = 0.10 and 0.15, respectively). B6 male mice are, however, more sensitive to bleomycin as this fibrosis level was in response to a lower dose of the drug. No bleomycin treated B6 male mice survived to the 6 week time point.
In C3H mice the response to bleomycin was a minimal lung disease of limited fibrosis foci. As shown in fig 1, the C3H strain mice, males and females, developed fibrosis foci involving approximately 0.4% of the lung at 3 and 6 weeks after drug exposure. This response was not different between male and female mice (p = 0.11). No C3H mice were identified to be in respiratory distress. The lung response of B6 female mice to bleomycin was greater than that of C3H female mice at both 3 and 6 weeks (p = 9.3×10−6, p = 6.6×10−5, respectively; fig 1). The strain difference in the response to bleomycin is evident in the male mice as the mean fibrosis score for bleomycin treated B6 males at 3 weeks post exposure was greater than that for C3H males (p = 8.8×10−3).
B6 v C3H sequence comparison
To identify the sequence variation between B6 and C3H mice for the genomic regions Blmpf1 (12.9 Mb) and Blmpf2 (25.9 Mb) (table 1), the NCBI database was queried for B6/C3H SNPs. With filtering by strain (B6 v C3H) and chromosome only, this database returned 1278 SNPs on chromosome 17 and 2588 on chromosome 11. Further filtering of this query produced 31 and 30 non-synonymous SNPs on the two chromosomes. An analysis of the genes in which the SNPs were contained compared to the Blmpf1 and 2 gene lists revealed approximately 1.6% (11 of 679) of the genes contained B6/C3H sequence variation(s) which would be predicted to alter the protein product (coding region non-synonymous polymorphism).
To augment this analysis, a second B6/C3H sequence comparison was completed using the B6 sequence data of Ensembl and the partial C3H draft sequence data of MIT.13 For 42% of the linkage region genes, a C3H sequence match (⩾90% of gene length) was identified. Of these genes (n = 288), a non-synonymous coding region sequence variation (SNP or insertion/deletion) was identified in 35%. A further 84 genes (of 377, 22%), for which a C3H sequence encompassing 50–89% of gene length was identified, contained a coding non-synonymous sequence variation in the portion of the gene evaluated. Based on this analysis it is conservatively estimated that the fraction of genes in Blmpf1 and 2 that contain B6/C3H sequence variations to alter the encoded protein is 35%.
Multi strain sequence comparison
As the analysis of SNPs in the B6 genome compared to the C3H is limited by the lack of a complete genetic sequence of the C3H strain, a second SNP analysis was completed using the known response to bleomycin of the four strains for which sequence is available in the Celera database. Localisation of the two flanking markers of each of Blmpf1 and 2 on the Celera map revealed both Blmpf1 and 2 to be larger on the Celera map than on the Ensembl map (regions of ∼20 and 29.1 Mb, respectively, in Celera). Scanning of the Celera database for SNPs in the linkage regions identified a number of SNPs consistent with the rate of one SNP/kbp of sequence which has been reported for the mouse genome12 (table 2).
The linkage region SNP data were filtered to identify candidate genes where the sequence of fibrosis prone strains (B6, DBA/2) was identical and distinct from that of a fibrosis resistant strain (A/J). After removing intronic or synonymous SNPs from the analysis, 148 SNPs encompassing 52 genes in Blmpf1 and 154 SNPs in 24 genes of the sequence of Blmpf2, were revealed. To integrate this sequence variation with the previous analysis, the subset of the 76 SNP containing genes from the Celera database, for which the B6/C3H sequence comparison was available, was compiled (n = 28). Ten (36%) of the 28 genes were identified to contain SNPs where the allele in B6 mice was the same as in DBA/2 mice but distinct from the A/J strain and where the B6 allele differed from the C3H allele.
When the data from the Celera database were filtered to include the fibrosis susceptible 129 strain, 28 SNPs encompassing 13 genes in Blmpf1 and 17 SNPs in six genes of the sequence of Blmpf2 were identified. Of these genes, two in Blmpf1 (cytochrome p450 4f14, histocompatibility-2 M1) and one in Blmpf2 (retinoic acid induced-1) were also identified as containing a B6/C3H sequence variation.
Gene expression data
Affymetrix GeneChip microarrays were used to determine which genes are differentially expressed in the lung tissue of B6 mice compared to C3H mice. In this experiment, the data of each array represent the gene expression in the lungs of a group of mice of the same sex, strain, bleomycin dose, and time point. The expression data for the 13 chips used in this experiment are given in fig 2. As shown in this figure, the data cluster into three main groups: genes highly expressed in B6 mice (treated or control); genes highly expressed in C3H mice (treated or control); and genes over or under expressed in B6 mice treated with bleomycin relative to both C3H mice and B6 control mice. The similarity in gene expression within each strain, evident in fig 2, was analysed to determine if the gene expression profiles differed between male and female mice within a strain, or between mice evaluated at different times after bleomycin treatment.
To determine if pulmonary gene expression differed between male and female mice, we compared the gene expression profile of B6 male control mice to that of B6 female controls, and the profiles of B6 male mice treated with bleomycin were compared to those of bleomycin treated B6 female mice. The same comparisons were made of the C3H derived gene expression data. Two genes (DEAD box polypeptide 3, Y linked and eukaryotic translation initiation factor 2, subunit 3, structural gene) were found to be differentially expressed (>2 fold difference in expression, p<0.05) between male and female mice and this difference was evident in all the comparisons (data not shown). As only two Y linked genes were identified to differ in expression between male and female mice, subsequent analyses were completed with each time point comprised of data from male and female mice.
In a similar analysis we determined if the gene expression profile of C3H mice at 3 weeks post bleomycin differed from the profile at 6 weeks. This evaluation was also completed for B6 mice, and, for this strain, a comparison to the profile generated from mice identified to be in respiratory distress was also completed. In the B6 mice two genes (chemokine (C-X-C motif) ligand 10, and deoxycytidine kinase) were found to be more highly expressed at 3 weeks than at 6 weeks or in mice in distress while in C3H mice one gene (lipocalin 2) was more highly expressed at 3 weeks than at 6 weeks (>2 fold difference in expression, p<0.05; data not shown). Based on this analysis the gene expression profiles at 3 and 6 weeks post bleomycin and from mice sacrificed due to distress were determined to be similar and these datasets were combined for further analyses.
Strain specific gene expression response to bleomycin
To determine the set of genes involved in the pulmonary response of a B6 mouse to bleomycin, the data from five arrays (female 3 weeks, female 6 weeks, female sick; male 3 weeks, male sick) were compared to the data from two control arrays (male control, female control). A total of 766 genes were determined to be significantly over expressed in the lungs of treated mice compared to untreated controls and 635 genes were relatively under expressed, as shown in supplementary table 1 (p<0.05) (supplementary table 1 is available at http://www.jmedgenet.com/supplemental). In addition, 367 ESTs were differentially expressed (p<0.05) in the lungs of bleomycin treated B6 mice compared to controls (supplementary table 1).
By Gene Ontology analysis, the biological processes most affected in the B6 bleomycin treated mouse lung were cell communication (p = 0.011) and response to stimulus (p = 0.018). The molecular functions most altered were protein (p = 0.0003), nucleic acid (p = 0.0073), and glycosaminoglycan (p = 0.03) binding; extracellular matrix (p = 0.013) and muscle (p = 0.018) constituents; and small protein conjugating enzyme (p = 0.019), lipid transporter (p = 0.03), and GTPase regulator (p = 0.045) activity. As shown in table 3, the most significantly differentially expressed genes (ranked by p value) included those of the extracellular matrix such as fibronectin 1 and collagen types 1α2 and Vα2, and genes involved in extracellular matrix homeostasis as indicated by cathepsin S and B over expression in lungs of bleomycin treated mice compared to controls. Also differentially expressed were genes involved in immunity, such as cd68 antigen, bone marrow stromal cell antigen 1, and an IgG receptor. Each of these genes is an example of the types of genes differentially expressed in the B6 lung after bleomycin exposure. Additional types of genes differentially expressed in the B6 lung included cytokines, integrins, growth factors and chemokines, and their receptors; oxidative stress response genes; and genes encoding cell signalling proteins, ion channels, and cytochrome p450 proteins. The gene expression data are consistent with the inflammatory and fibrotic phenotype evident in the lungs of bleomycin treated B6 mice.
A similar analysis of microarray data was completed to determine the set of genes involved in the pulmonary response of a C3H mouse to bleomycin. The data from four arrays of treated mice (female 3 weeks, female 6 weeks; male 3 weeks, male 6 weeks) were compared to the data from two arrays generated from control mice (male control, female control). Substantially fewer genes (n = 5) in this strain were determined to be significantly over expressed in the lungs of treated mice compared to untreated controls and one gene was relatively under expressed (endothelial cell-specific molecule 1), which is consistent with the minimal lung response evident by histology. Of the genes increased in expression in the bleomycin treated C3H lung, glycoprotein (transmembrane) nmb, p21 (cdkn1a), and glycoprotein 49 B were also increased in expression in the B6 mouse lung after bleomycin, and to a significantly higher extent (p<0.05) than in C3H (data not shown). Cathepsin K was the only gene identified to be induced by bleomycin in the lungs of C3H mice to a greater extent than in the lungs of B6 mice. The remaining two genes induced or repressed in the C3H lung (pleckstrin homology-like domain, family A, member 3, and endothelial cell-specific molecule 1), were similarly induced or repressed in the bleomycin treated B6 lungs (data not shown).
Strain comparison of gene expression data
Analyses were completed to identify the set of lung genes which differed in expression between B6 and C3H mice. The data from the treated mice (five B6 chips, four C3H chips) were compared and the data from untreated mice (two B6 control chips, two C3H control chips) were also evaluated. A total of 734 genes were determined to be significantly over expressed in the lungs of B6 bleomycin treated mice compared to C3H treated mice and 705 genes were relatively under expressed (p<0.05; supplementary table 2, see http://www.jmedgenet.com/supplemental). In addition, 453 ESTs were differentially expressed (p<0.05) in the lungs of bleomycin treated B6 mice compared to C3H (supplementary table 2).
The set of genes differentially expressed between bleomycin treated B6 and C3H mice was studied to identify those for which a strain difference in expression exists, independent of bleomycin treatment. The evaluation of the microarray data representing gene expression in the lungs of control mice revealed that 201 of the genes identified as differentially expressed in the lungs of bleomycin treated B6 and C3H mice were also differentially expressed in the control mice and for 176 (88%) of these the bleomycin treatment did not alter this strain difference in expression by >2 fold (data not shown). Approximately half of the differentially expressed genes in this comparison of control datasets were found to be expressed at a higher level in B6 mice than in C3H mice.
For the majority of the genes differentially expressed between bleomycin treated B6 and C3H mice (89%), the differential expression was induced by the bleomycin treatment. By Gene Ontology analysis, the biological processes most often represented in the set of differentially expressed genes were, as in the B6 response to the drug, cell communication and response to stimulus. As shown in table 4, the most significantly differentially expressed genes included those involved in immune response such as the major histocompatibility complex gene H2-D1, complement component 4, and regenerating islet derived 3γ; and genes involved in protein transport including coatomer protein complex, subunit ζ2, and solute carrier family 15 (H+/peptide transporter), member 2. A complete list of differentially expressed genes is available in supplementary table 2 (http://www.jmedgenet.com/supplemental). The datasets given in supplementary tables 1 and 2 indicate substantial gene expression profile differences between the B6 response to the drug relative to control (supplementary table 1) and the B6 relative to C3H bleomycin response (supplementary table 2). Approximately 53% of the genes of the B6 intrastrain response dataset are not present in the interstrain response, and 56% of the B6/C3H differentially expressed genes are not of altered expression in the lungs of bleomycin treated B6 mice compared to controls.
Blmpf1 and 2 differential gene expression
To propose candidate genes for Blmpf1 and 2, the set of genes which were both differentially expressed in the bleomycin treated lungs of B6 mice relative to C3H mice, and which map to the linkage intervals, was documented. Thirty seven differentially expressed genes mapped to Blmpf1 and 19 to Blmpf2. The subset of this group, for which the gene expression of one strain was at least 1.5 fold of the other, is shown in table 5. The set of candidate genes includes major histocompatibility complex genes, oxidative stress response genes, and DNA damage response genes among others. In addition, six ESTs mapping to Blmpf2 and five ESTs mapping to Blmpf1 were identified to be differentially expressed in the lungs of bleomycin treated B6 mice relative to C3H mice, with expression changes ranging from 0.56 to 1.68 fold (p<0.05; data not shown).
To further assess these candidate genes, the B6/C3H sequence comparison (described earlier) was reviewed. This analysis revealed 18 of the 56 candidate genes contained coding region non-synonymous polymorphisms and/or 5′ or 3′ UTR SNPs, and 15 genes were without such changes (in at least 90% of the gene), while for 23 genes the C3H sequence was not available. The type of sequence variation identified for the differentially expressed candidate genes is listed in table 5.
To determine if the expression level of all the genes of the linkage regions had been evaluated, a review of the Affymetrix MOE430A chip was made using the Ensembl and Affymetrix databases. Approximately 55% of the genes in Blmpf1 and 53% of the genes of Blmpf2 are contained on this microarray. Of the 166 genes in Blmpf2 and not on the Affymetrix chip, 103 (62%) are Riken cDNAs or Ensembl predicted genes, 35/166 (21%) are olfactory receptors, and 28 (17%) are known genes. Of the 150 genes of Blmpf1 not found on the microarray used in these studies, 61 (41%) are Riken cDNAs or Ensembl predicted genes, 37 (25%) are olfactory receptors, and 52 (34%) are known genes (including 13 genes of the major histocompatibility complex).
To assess the validity of the gene expression differences identified by microarray analysis, the expression levels of four genes were assessed with RT-PCR analysis. These data are presented in fig 3A. These genes were chosen to investigate examples of both differential and non-differential gene expression levels. As shown in fig 3B, the gene expression differences identified by microarray analysis were consistent with the results obtained through real time PCR.
A set of candidate genes was implicated in susceptibility to bleomycin induced pulmonary fibrosis through gene expression and sequence analysis combined with prior data from linkage studies. In prior studies, Blmpf1 and 2 were mapped to 13 and 28 Mb intervals containing a total of 679 known or unknown genes. Using a combination of approaches, polymorphisms in specific genes of the oxidative stress response, apoptosis, and immune pathways have been identified as the potential causal variants in the development of fibrosis in this mouse model.
Consistent with previous studies of this model,7,11,16 the B6 mice developed substantial fibrosis in response to bleomycin exposure while C3H strain mice did not. The B6 response to the drug was histologically similar in both male and female mice sacrificed at three time points, as was the near null response of C3H mice at the times evaluated. For gene expression analysis, this permitted essentially five replicates of the B6 bleomycin response and four of the C3H response. Our study revealed the B6 pulmonary response to bleomycin included altered expression of extracellular matrix component and regulation genes, of inflammatory pathway components, and of DNA damage response genes, in agreement with previous studies of the B6 response15,28 and with clinical data.29 The genetic and mechanistic complexity of pulmonary fibrosis is evident in that nearly 1900 genes were identified to be differentially expressed in the B6 bleomycin treated lung compared to the C3H.
Combining the expression data with the linkage data revealed approximately 10% of genes/ESTs in the linkage regions were differentially expressed in the lungs of bleomycin treated B6 and C3H mice. The defined list of gene expression candidates is likely to be reasonably complete as the microarray used in these studies permitted the assessment of ∼90% of the known genes in the linkage intervals, when olfactory receptor genes, which have no established pathway in pulmonary fibrosis, are not considered. The identified genes are considered candidates as the causal variation leading to the development of bleomycin induced pulmonary fibrosis with the assumption that the fibrosis causative gene is differentially expressed in the bleomycin treated lung. For approximately half of the linkage and expression candidate genes identified, listed in table 5, the difference in expression was measured in untreated mice as well. This may indicate that B6 mice have an inherent deficiency in enzyme levels of certain pathways that leaves this strain susceptible to fibrosis, as opposed to an induced deficiency. This finding would be consistent with the rapid development of fibrosis in this strain, as the maximal fibrosis was evident 2 weeks after the completion of drug delivery.
The majority of the bleomycin induced pulmonary fibrosis candidate genes encode proteins involved in a DNA damage or oxidative stress response, in apoptosis, or in immune/inflammatory pathways. Each of these mechanisms has been implicated in the development of fibrosis. Bleomycin induced pulmonary fibrosis develops through a series of pathways initiated by the bleomycin binding to DNA, principally in alveolar epithelial cells, and inducing lesions through a free radical reactive complex.30 The DNA damage and oxidative stress may lead to apoptosis of the epithelium31 and to a TH2 inflammatory response in the lung by increased major histocompatibility complex presentation of B7 and class II molecules on epithelial cells,32 activating T cells to a TH2, or fibrotic, phenotype.33 The extent and duration of this inflammatory response may also be controlled, in part, through apoptosis.34 Specific to the candidate genes proposed from this model, the B6 response to bleomycin may differ from the C3H response due to increased levels of bleomycin in the lungs of B6 mice. The expression of pituitary tumour transforming-1, which is a thiol protease inhibitor35 that may act on the enzyme which degrades bleomycin, bleomycin hydrolase, was higher in B6 mice than C3H. Genes involved in immune responses (H2; secretoglobin, family 3A, member 1; complement component 4; allograft inflammatory factor 1) were also differentially expressed and thus the distinct fibrotic reactions of these strains could be due to differences in T cell regulation. The decreased levels of glyoxalase 136 and increased levels of p21 in the lungs of B6 mice may inhibit T cell apoptosis, and together with increased expression of proviral integration site-1, which induces T cell proliferation,37 may have resulted in the maintenance of the profibrotic inflammatory signal. Finally, trefoil factor 2, which is increased in B6 mice, may promote the wound healing seen in fibrosis.38
In addition to differential expression, the genes of the linkage regions were assessed for sequence variation. The NCBI database returned SNPs at a rate of 1/10 kbp, which is lower than that documented by others12,13 for the mouse genome, so the assessment of sequence variation was extended. Wade et al13 sequenced portions of the C3H genome in their study which identified the mouse genome as being comprised of regions of high and low SNP density. From the data of their report, both Blmpf1 and 2 consist of random regions of low and high SNP density. Using these data and the publicly available B6 sequence, our analysis revealed up to 35% of the genes in the linkage regions contained coding sequence variations which would be predicted to alter the protein produced. The variation rate of 35% is likely an over estimate, due to the sequencing error rate of 1/1 kbp, and indeed an assessment of the Celera database, using B6 and A/J mice, revealed this type of SNP (coding non-synonymous) occurred in approximately 4% of the genes in Blmpf1 or 2.
Sequence variation among inbred strains of known bleomycin induced fibrosis phenotype was used to assess candidate genes underlying Blmpf1 and 2. Specifically, where C3H sequence is not yet known, SNPs between B6 and A/J (resistant) were reported. More generally, the evaluation of additional strains permits the reduction of background variation in the identification of causal variation. Only ∼10% of the genes mapped to Blmpf1 or 2 have non-intronic sequence variations predicting for the response of B6, A/J, and DBA/2 mice to bleomycin, including five genes of our candidate list. The inclusion of haplotypes of inbred strains other than B6 or C3H, however, is predicated on the assumption that the same loci, and the same genes, influence susceptibility to bleomycin induced pulmonary fibrosis in all evaluated strains. This is not known and no other loci for this trait have been mapped. The gene underlying Blmpf1 may be relevant in this analysis as the MHC haplotype (within Blmpf1) has been shown to influence fibrosis susceptibility in other mouse strains,7 and this same region of chromosome 17 harbours loci of susceptibility to radiation induced pulmonary fibrosis39 and ozone induced40 and sulphate particle induced lung response.41
In previous studies, bleomycin hydrolase, the gene which encodes a protein that detoxifies bleomycin, was proposed as a candidate for Blmpf2.11 The gene has subsequently been mapped to 76.5 Mb on chromosome 11, which is outside the linkage interval, and therefore was not evaluated for sequence differences. It was also not found to be differentially expressed in any comparison in this study. Also in the previous study, the influence of Blmpf2 on the fibrosis phenotype of female mice was not clear as the linkage data identified this locus in male mice only, and the study of chromosome 11 consomic mice implicated Blmpf2 in the response of mice of both sexes. In this study, there was no difference in gene expression between B6 male and female mice, thus it is presumed the Blmpf2 locus alters the fibrosis response in mice of both sexes.
In summary we have used gene expression and DNA sequencing data to identify putative fibrosis susceptibility genes from a set of positional candidate genes. Through this approach, 10% of the positional candidate genes were measured to be differentially expressed in the bleomycin treated lung and DNA sequence analysis revealed approximately one third of the linked genes had B6/C3H sequence variations which could affect the encoded protein. This subset of genes may be the most likely fibrosis candidate genes, although the positional candidate genes with sequence variations remain putative candidates. Additional testing to determine causality for a specific gene would include confirmation of sequence variation, target cell affected, and ultimately, an evaluation of the phenotype of mice with the B6 allele in C3H background mice, possibly involving a combination of knockout and transgenic replacement.42 This list of candidates should facilitate the identification of fibrosis susceptibility genes and the potential mechanistic interaction between the Blmpf loci.
The table is available as a downloadable PDF (printer friendly file).
If you do not have Adobe Reader installed on your computer,
you can download this free-of-charge, please Click here
Files in this Data Supplement:
- [view PDF] -
766 genes significantly over expressed in the lungs of treated mice compared to untreated controls and 635 genes relatively
367 ESTs differentially expressed in the lungs of bleomycin treated B6 mice compared to controls.
- [view PDF] - 766 genes significantly over expressed in the lungs of treated mice compared to untreated controls and 635 genes relatively under expressed.
This work was supported by funding from the Canadian Institutes of Health Research and Fonds de la Recherche en Santé Québec.
Competing interests: none declared
Ethical approval: Animal use was approved by the McGill University Animal Care Committee and was in accordance with the guidelines of the Canadian Council on Animal Care.