Article Text
Abstract
Background Deleterious germline variants in ATM and CHEK2 have been associated with a moderately increased risk of breast cancer. Risks for other cancers remain unclear.
Methods Cancer associations for coding variants in ATM and CHEK2 were evaluated using whole-exome sequence data from UK Biobank linked to cancer registration data (348 488 participants), and analysed both as a retrospective case-control and a prospective cohort study. Odds ratios, hazard ratios, and combined relative risks (RRs) were estimated by cancer type and gene. Separate analyses were performed for protein-truncating variants (PTVs) and rare missense variants (rMSVs; allele frequency <0.1%).
Results PTVs in ATM were associated with increased risks of nine cancers at p<0.001 (pancreas, oesophagus, lung, melanoma, breast, ovary, prostate, bladder, lymphoid leukaemia (LL)), and three at p<0.05 (colon, diffuse non-Hodgkin’s lymphoma (DNHL), rectosigmoid junction). Carriers of rMSVs had increased risks of four cancers (p<0.05: stomach, pancreas, prostate, Hodgkin’s disease (HD)). RRs were highest for breast, prostate, and any cancer where rMSVs lay in the FAT or PIK domains, and had a Combined Annotation Dependent Depletion score in the highest quintile.
PTVs in CHEK2 were associated with three cancers at p<0.001 (breast, prostate, HD) and six at p<0.05 (oesophagus, melanoma, ovary, kidney, DNHL, myeloid leukaemia). Carriers of rMSVs had increased risks of five cancers (p<0.001: breast, prostate, LL; p<0.05: melanoma, multiple myeloma).
Conclusion PTVs in ATM and CHEK2 are associated with a wide range of cancers, with the highest RR for pancreatic cancer in ATM PTV carriers. These findings can inform genetic counselling of carriers.
- Genetics
- Whole Exome Sequencing
Data availability statement
Data may be obtained from a third party and are not publicly available. Requests for access to UK Biobank data should be made to the UK Biobank Access Management Team (access@ukbiobank.ac.uk).
This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.
Statistics from Altmetric.com
WHAT IS ALREADY KNOWN ON THIS TOPIC
While previous research shows there is evidence for association between variants in ATM or CHEK2 and multiple cancer types in individual smaller studies, the associations have not been consistently evaluated across all cancer types and, with the exception of breast cancer, the strengths of association are unclear.
WHAT THIS STUDY ADDS
We examined data from a large cohort study to derive relative and absolute risks for all cancer types for carriers of protein-truncating variants (PTVs) and rare missense variants (rMSVs) in ATM and CHEK2.
ATM PTVs were associated with significantly increased risk for 12 of 24 sites examined (nine at p<0.001), with the relative risk (RR) being highest for pancreatic cancer (approximately sevenfold). Carriers of rMSVs had increased risks of four cancers, with a RR of approximately 1.5.
For CHEK2 PTVs, statistically significant risks were observed for nine of the 24 sites examined (three at p<0.001). Carriers of rMSVs had increased risks of five cancers with the risk being highest for lymphoid leukaemia (approximately twofold).
HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY
ATM and CHEK2 are included on many cancer gene panels used in family cancer clinics, and the risk estimates from these analyses can inform genetic counselling for carriers.
The estimated absolute risks for pancreatic cancer in ATM PTV carriers (11% in males and 8% in females by age 85) are notably higher than for other major pancreatic susceptibility genes including BRCA2, CDK2NA, and PALB2. Our findings can also inform National Institute for Health and Care Excellence guidelines for pancreatic cancer, which do not currently include ATM.
Introduction
Gene panel testing for cancer susceptibility is now an important part of clinical practice. To provide reliable genetic counselling, it is important to be able to provide risks for all cancer types. Many susceptibility genes, notably BRCA1, BRCA2, and the mismatch repair genes, are associated with multiple types of cancer,1 2 but in general, cancer pleiotropy is poorly understood. Here we consider rare variants in two genes, Checkpoint Kinase 2 (CHEK2) and Ataxia-Telangiectasia Mutated (ATM), that are included on many cancer gene panels. Variants in these genes are known to be associated with moderately increased risks of breast cancer, but the risks of many other cancer types are unclear.
CHEK2 encodes a protein kinase, activated in response to DNA damage,3 and with a role in cell-cycle arrest, DNA repair, and cell death.4 5 The most common protein‐truncating variant (PTV) in Western European populations is the CHEK2*1100delC. Previous research has shown an association between the 1100delC deletion and risk of breast cancer in women, with the estimated risk being approximately twofold.6 7 A similar association has been seen for other PTVs in aggregate.6 8 Rare CHEK2 missense variants (MSVs), in aggregate, have also been associated with breast cancer risk, with a relative risk (RR) of ~1.4.6 There is evidence that the breast cancer risk differs by variant, using in silico and functional classifications of deleteriousness.9
Previous studies have found some evidence of an association between PTVs and cancer other than breast cancer, with most evidence being based on the 1100delC variant: these include increased risks of prostate cancer,10 colorectal cancer,11 12 malignant melanoma, gastric cancer, and bladder cancer.13 Previous studies have reported evidence that carriers of the I157T variant have increased risk of prostate,10 colorectal,14 ovarian,15 pancreatic,16 gastric,17 thyroid,18 kidney,19 and colon cancer.19 There is some evidence of a decreased risk of lung cancer.20 CHEK2 PTVs have also been shown to influence ovarian ageing in women through inhibited DNA damage sensing.21
ATM 22 encodes a protein kinase involved in DNA repair, apoptosis, and regulation of cell-cycle checkpoints.22 23 It is recruited and activated by double-strand DNA breaks. ATM activates CHEK2 through phosphorylation. Deleterious mutations in ATM cause the rare autosomal recessive disorder Ataxia-Telangiectasia (A-T). Patients with A-T have a large excess risk of cancers in childhood, in particular, lymphomas and leukaemias, but are also at increased risk of adult cancers, including breast cancer. There is evidence both from studies of families of A-T patients,24 and from large case-control studies, of an association between pathogenic variants in ATM and increased risk of breast cancer,6 with an estimated RR of approximately twofold. Previous studies indicate subsets of rare missense variants (rMSVs) from the PIK, FAT, and FATC domains are also associated with an excess risk of breast cancer.25
For cancers other than breast, there is evidence for an association between mutations in ATM and cancer of the prostate26 and pancreas.27 Some studies have found evidence for a higher risk of colorectal cancer,24 melanoma,28 gastric,29 ovarian, and lung cancer.30
While there is evidence for association between variants in ATM or CHEK2 and multiple cancer types, the associations have not been consistently evaluated across all cancer types and, with the exception of breast cancer, the strengths of association are unclear. In this paper, we examine data from a large cohort study to derive relative and absolute risks for all cancer types, for carriers of PTVs and rMSVs in CHEK2 and ATM. We also examine the evidence for variation in risk for rMSVs according to previously defined in silico classifications.
Materials and methods
Dataset
UK Biobank is a population-based prospective study of approximately 500 000 participants in the UK.31 Participants were enrolled between 2006 and 2010 and were aged between 40 and 69 on recruitment. Whole-exome sequencing (WES) data were available at the time of analysis for 454 756 UK Biobank participants.32 Details of the WES variant calling, filtering, and classification are given in the online supplemental appendix. In ATM, rMSVs were categorised by functional protein domain33 and pathogenicity score predicted by the Combined Annotation Dependent Depletion (CADD; version 1.6) algorithm.34 In CHEK2, rMSVs were classified as high or low risk using the deleteriousness score given by the Helix algorithm (version 4.4.1).35 These classifications were used as they provided the best discrimination for breast cancer risk in the analysis of the large BRIDGES dataset.
Supplemental material
Cancers occurring in study participants were identified through linkage to the national cancer registry data for England, Wales, and Scotland.36 Invasive cancers (codes C00-C97) and breast carcinoma in situ (CIS; code D05), defined using the 10th Revision of the International Classification of Diseases,37 were included as endpoints. Self-reported cancers were not included.
Study design
UK Biobank includes both a retrospective (based on data at study entry) and a prospective component (figure 1). The retrospective analyses were based on cases diagnosed with cancer before entry to the study. Controls were individuals who had neither a self-reported nor registry-reported cancer before entry, other than non-melanoma skin cancer. A total of 348 488 males and females were included, of whom 18 838 were diagnosed with at least one cancer before the day of first assessment.
The prospective analysis included individuals without a self-reported cancer or registry-reported cancer before the start of follow-up, or a registry-reported cancer within 6 months of follow-up (except for non-melanoma skin cancer). Thus, follow-up started 6 months from the day of first assessment for the study, to avoid inadvertent inclusion of cases where onset was prior to entry. The event indicator was first diagnosis of any cancer by type, and follow-up was censored at the earliest of first diagnosed cancer of any type, date of death, or a last follow-up date dependent on the location of the assessment centre attended by participants (for England, this was 31 December 2020; Scotland, 30 November 2021; and Wales, 31 December 2016). A total of 328 919 individuals were included, with 37 802 cancers diagnosed over 3 484 613 person-years of follow-up.
Statistical analyses
Since most variants were too rare to estimate the associated risk, we conducted burden analyses which evaluated the risks associated with carrying any one of a set of variants: PTVs or rMSVs. rMSVs were further subdivided by functional protein domain and prediction scores. We present results for each variant type and gene for 23 invasive cancer types with at least 400 occurrences in the dataset, plus Hodgkin’s disease (HD) and CIS. We also conducted exploratory analyses for 65 rarer cancer types.
In the retrospective dataset, logistic regression was used to estimate the OR for each cancer type associated with carrying either a PTV or rMSV. For the prospective analyses, Cox proportional hazards regression models were used to estimate the HR for any cancer, and for each cancer type, associated with carrying either a PTV or rMSV. All models included sex and 10 ancestry informative principal components35 as covariates. Since the retrospective and prospective components are essentially independent, we also computed overall RR estimates combining the retrospective and prospective estimates (see online supplemental appendix). There were too few cases of male breast cancer to analyse separately; the estimates based on female alone and males and females combined were identical. We investigated whether the HRs varied by age by fitting HRs by age group (<50, 50–59, 60–69, 70–79, and ≥80 years) and by fitting models in which the log(HR) varied linearly with age, using the tt() function in R.
Cumulative risks for cancer were estimated by combining HR estimates from the prospective analyses with population cancer incidences in the UK, 2016–2018,38 using the same approach as that used by Schmidt et al.39 Where there was evidence of a trend in the HR by age, time-dependent HR risk estimates were used, otherwise a constant HR was assumed.
Binary logistic regression and Cox proportional hazards regression analyses were conducted using R version 4.1.0. Meta-analyses were also conducted in R using the meta package version 6.5-0, and the metagen function. All p values were two-sided, with p<0.05 considered significant.
Results
PTVs in ATM
In the pooled analysis (online supplemental table 1), PTVs in ATM were associated with increased risks of 12 of 24 cancers: oesophagus (RR: 3.90 (95% Confidence Interval (CI) 1.85 to 8.20, p=3.43×10−4)); colon (2.18 (1.37 to 3.48, p=1.04×10−3)); rectosigmoid junction (3.43 (1.10 to 10.66, p=0.033)); pancreas (7.35 (4.27 to 12.66, p=6.53×10−13)); lung (2.64 (1.61 to 4.32, p=1.18×10−4)); melanoma (2.28 (1.44 to 3.59, p=4.18×10−4)); breast (2.27 (1.77 to 2.91, p=1.18×10−10)); ovary (3.20 (1.66 to 6.16, p=5.12×10−4)); prostate (2.35 (1.78 to 3.11, p=2.41×10−9)); bladder (3.74 (1.94 to 7.21, p=8.30×10−5)); diffuse non-Hodgkin’s lymphoma (DNHL) (3.18 (1.51 to 6.69, p=2.28×10−3)); and lymphoid leukaemia (LL) (4.42 (2.11 to 9.29, p=8.67×10−5)). PTVs were also associated with CIS (3.22 (1.96 to 5.28, p=4.05×10−6)). Among rarer cancer types, we observed some evidence for increased risk for cancers of the small intestine (3 cases, RR: 6.12 (1.97 to 18.98, p=0.0017)), gallbladder (2 cases, RR: 10.00 (2.50 to 39.99, p=8.84×10−5)), and other and unspecified parts of the biliary tract (6 cases, RR: 20.77 (9.11 to 47.34, p=5.31×10−13)).
Pooled analyses also showed increased risk for any cancer in females (2.23 (1.88 to 2.64, p=1.86×10−20)) and males (2.33 (1.96 to 2.77, p=3.81×10−22)). These RRs remained elevated (though slightly reduced in females), once breast and prostate cancer were excluded (2.06 (1.63 to 2.59, p=1.03×10−9) and 2.30 (1.83 to 2.88, p=9.90×10−13), respectively).
The OR and HR estimates from the retrospective and prospective analyses were similar for each of the associated cancers, and the differences were not statistically significant for any cancer type. The estimates for pancreatic cancer were higher in the retrospective analysis (20.64 (4.97 to 85.76, p=3.10×10−5)) than in the prospective analysis (6.15 (3.39 to 11.14, p=2.18×10−9)); however, carrier counts were low, and CIs correspondingly wide.
There was some evidence of a decline in the HR with age for breast cancer (RR per year 0.95 (0.92 to 0.99, p=0.012)) and any cancer in females (0.97 (0.95 to 0.99, p=0.013)), with the highest HR for breast cancer under age 50 years (3.92 (2.16 to 7.10, p=6.64×10−6)). There was no evidence for a trend in the HR by age for prostate nor pancreatic cancer (online supplemental tables 5 and 9a).
rMSVs in ATM
Pooled analyses (online supplemental table 2) showed that rMSV carriers had higher risks of cancer of the stomach (1.54 (95% CI 1.07 to 2.23, p=0.022)), pancreas (1.60 (1.20 to 2.13, p=1.27×10−3)), prostate (1.15 (1.04 to 1.27, p=7.21×10−3)), and HD (1.72 (1.02 to 2.90, p=0.043)). There was some increased risk for any cancer in males (1.13 (1.06 to 1.21, p=2.60×10−4)); removing prostate cancer made little difference to the estimate.
There was no evidence for a trend in the HR by age for breast cancer, prostate cancer, pancreatic cancer, or any cancer in females or males (online supplemental table 6).
rMSV subset by functional protein domain and CADD score
Pooled analyses (online supplemental table 11a–f) showed that RRs were highest for breast cancer (1.52 (95% CI 1.14 to 2.04, p=4.85×10−3)) and prostate cancer (1.67 (1.23 to 2.29, p=1.20×10−3)) where variants lay inside the FAT or PIK domains, and had a CADD score within quintile 5 (>3.74) (compared with RRs for other variants: p-diff <0.0001 for both breast and prostate cancer). Similarly, the RRs for all cancers combined for variants in this category were higher than for all other variants combined, both in females (1.40 (1.15 to 1.71, p=8.74×10−4); p-diff <0.0001) and males (1.34 (1.09 to 1.65, p=6.57×10−3); p-diff <0.0001).
For pancreatic cancer, the RR was highest where variants lay outside the FAT and PIK domains (1.56 (1.11 to 2.20, p=0.011)), but not significantly different from variants inside these domains (p-diff=0.65).
PTVs in CHEK2
In the pooled analyses (online supplemental table 3), PTVs were associated with an increased risk of nine cancer types: oesophagus (RR: 2.13 (95% CI 1.10 to 4.10, p=0.025)); melanoma (1.45 (1.00 to 2.09, p=0.049)); breast (2.44 (2.08 to 2.86, p=3.61×10−28)); ovary (2.33 (1.37 to 3.97, p=1.73×10−3)); prostate (1.92 (1.59 to 2.32, p=1.18×10−11)); kidney (1.83 (1.06 to 3.18, p=0.032)); HD (4.26 (1.89 to 9.64, p=4.93×10−4)); DNHL (1.97 (1.09 to 3.57, p=0.025)); and myeloid leukaemia (ML) (2.81 (1.33 to 5.94, p=6.75×10−3)). PTVs were also associated with CIS (2.05 (1.35 to 3.13, p=8.48×10−4)). The OR and HR estimates from the retrospective and prospective studies were broadly similar for all the associated cancers and did not differ significantly for any cancer type.
The risk of all cancers combined was elevated in both females (1.81 (1.58 to 2.06, p=1.07×10−18)) and males (1.57 (1.37 to 1.79, p=4.21×10−11)). After excluding prostate cancer, the risk for males was reduced but still elevated (1.35 (1.12 to 1.62, p=1.99×10−3)), while the excess in females was no longer significant after excluding breast cancer (1.18 (0.96 to 1.46, p=0.124) and 1.28 (1.11 to 1.46, p=6.40×10−4) for both sexes combined).
We examined the variation in the HR by age for breast, prostate, pancreatic, and any cancer. There was evidence of association in all age groups for breast and prostate cancer, and no evidence of differences in the HR by age for any cancer (online supplemental tables 7 and 10a).
rMSVs in CHEK2
In the pooled analyses (online supplemental table 4), rMSVs were associated with a higher risk of five cancer types: melanoma (1.32 (95% CI 1.01 to 1.71, p=0.041)); breast cancer (1.50 (1.31 to 1.72, p=1.02×10−8)); prostate cancer (1.41 (1.21 to 1.64, p=1.43×10−5)); multiple myeloma (1.86 (1.13 to 3.05, p=0.014)); and LL (2.23 (1.42 to 3.52, p=5.22×10−4)). rMSVs also were associated with CIS (1.55 (1.13 to 2.14, p=6.82×10−3)).
Combined estimates also showed increased risk of any cancer in females (1.26 (1.15 to 1.39, p=1.09×10−6)), but the risk was no longer significant after excluding breast cancer (1.05 (0.91 to 1.21, p=0.501)). Risk of any cancer was also elevated for males (1.27 (1.14 to 1.40, p=4.80×10−6)) and after exclusion of prostate cancer (1.17 (1.02 to 1.34, p=0.022)).
Statistically significant risk estimates from the retrospective and prospective analyses were broadly similar, and the differences were not statistically significant for any cancer type. There was no evidence of a trend in the HR by age for breast cancer, prostate cancer, pancreatic cancer, or any cancer in females or males (online supplemental table 8).
rMSV Helix score
rMSVs were categorised into high (>0.5) or low (≤0.5) Helix scores (online supplemental table 12a–f). In the pooled analyses, the RR for any cancer in females was higher in the Helix-high group (1.43 (95% CI 1.23 to 1.65, p=1.60×10−6)) than in the Helix-low group (1.20 (1.06 to 1.36, p=3.72×10−3); p-diff <0.0001). A higher RR for breast cancer was also found in the Helix-high group (1.62 (1.31 to 2.01, p=9.53×10−6) versus 1.42 (1.20 to 1.69, p=6.37×10−5); p-diff=0.031) than in the Helix-low group. A similar pattern was observed for any cancer in males (Helix-high: 1.35 (1.15 to 1.60, p=3.61×10−4) and Helix-low: 1.22 (1.06 to 1.39, p=4.31×10−3); p-diff=4.34×10−4), but not for prostate cancer.
Absolute risks
Online supplemental figure 1 shows the cumulative risks of female breast cancer by gene and variant type. By age 80, the cumulative risk of breast cancer was 30% (95% CI 16% to 41%) for ATM PTV carriers and 25% (19% to 30%) for CHEK2 PTV carriers, compared with 12% in the general population. The corresponding average cumulative risks for rMSV carriers were 18% (15% to 21%) for CHEK2 and 13% (12% to 15%) for ATM.
For prostate cancer (online supplemental figure 2), the cumulative risks by age 80 were 31% (23% to 38%) for ATM PTV carriers and 25% (20% to 29%) for CHEK2 PTV carriers. The corresponding average cumulative risk for rMSV carriers was 19% (16% to 21%) for CHEK2 and 15% (13% to 16%) for ATM.
The risks of pancreatic cancer for ATM mutation carriers by sex and variant type are shown in figure 2. By age 80, the estimated cumulative risk of pancreatic cancer in ATM PTV carriers was 8% (2% to 15%) in males and 6% (1% to 10%) in females. The cumulative risks for rMSV carriers were 2% (1% to 3%) in males and 2% (1% to 2%) in females.
The cumulative risk of developing any cancer by the age of 80 for ATM PTV carriers was 64% (95% CI 46% to 76%) for females and 69% (61% to 76%) for males, compared with 33% for females and 39% for males in the general population. The corresponding cumulative risks for CHEK2 PTV carriers were 49% (43% to 54%) for females and 52% (47% to 58%) for males (online supplemental figures 3 and 4, respectively).
Discussion
The pooled analyses show clear evidence of association between carrying a PTV in either gene and a higher risk of multiple cancers. For ATM, the RR for breast cancer (2.27 in the combined analysis) was consistent with previous case-control and family-based analyses.6 The RR for CIS (3.22 (95% CI 1.96 to 5.28)) was consistent with that for invasive breast cancer, and a similar RR was found for prostate cancer (2.35). The excess risk of other (non-breast and prostate) cancer was more marked than for CHEK2 PTVs, with a pooled RR of approximately twofold in both males and females. ATM PTVs were associated with significantly increased risk for 12 of 24 sites examined (9 at p<0.001), and (as for CHEK2) no sites had a significantly reduced risk. A previously reported association for pancreatic cancer was confirmed,30 with the RR for pancreatic cancer being the highest for any cancer type. There was some similarity with the spectrum of cancers associated with CHEK2 PTVs (eg, oesophageal cancer, DNHL, and melanoma being common), presumably reflecting their related roles in DNA damage response; notable differences are of the association with colon and lung cancer in ATM PTV carriers not seen in CHEK2.
For ATM rMSVs, there was only a small excess cancer risk overall (RR 1.06 in females and 1.13 in males). Previous research has shown that breast cancer risk appears to be largely restricted to a subset of evolutionary conserved MSVs in the FAT and PIK domains.9 When analyses were restricted to these variants, risks for breast cancer, prostate cancer, any cancer in females, and any cancer in males were higher for variants which lay within the FAT or PIK domains, with a high CADD score, and largely restricted to these variants. For pancreatic cancer, there was no evidence of a difference in RR by in silico risk category, but the confidence limits were wide.
For CHEK2 PTVs, the estimated RR for breast cancer (approximately twofold) is similar to that previously reported for recent large case-control studies.6 8 The RR for prostate cancer (1.92) is somewhat lower than some previous estimates,19 40 but here we provide much stronger evidence for the association with prostate cancer risk, and more precise estimates. The association with CIS, with a RR similar to that for invasive disease, is also consistent with that reported previously for c.1100delC.39
There was also evidence for an excess risk of other cancers in aggregate in CHEK2 PTV carriers, although breast and prostate cancer explained more than 50% of the excess cancer risk, and the remaining excess was only statistically significant in males. Statistically significant increased risks were observed for seven of the other 22 sites considered: oesophagus, ovary, kidney, HD, DNHL, ML, and melanoma. It is notable that no common sites were associated with a reduced risk—this and the clear association with overall cancer risk after excluding breast and prostate cancer suggests that many of these associations are genuine. While some other sites (eg, kidney) have been previously suggested to be associated,19 none have been firmly established. We did not, however, find evidence to support the previous observation of an increased risk of colorectal cancer.
Rare MSVs in CHEK2 were also associated with increased risk of both breast and prostate cancer, with RRs of approximately 1.4, comparable to previous estimates for breast cancer,6 25 and to estimates for the association of the I157T variant and prostate cancer.10 40 The RR for breast cancer was higher for variants classified as likely deleterious by Helix score. These results confirm, with similar RR estimates, previous analyses showing that Helix scores are predictive of risk, and that variants with low scores are also associated with risk. A similar pattern was seen for all cancers in both males and females, but not for prostate cancer. The latter result might, however, be due to chance, given the results for all cancers combined. A significant excess risk in rMSV carriers was observed for several other cancer types (melanoma, multiple myeloma, and LL), which were also associated with PTVs. Taken together, this is broadly consistent with the hypothesis that CHEK2 MSVs confer increased, but on average lower, risks of a similar spectrum of cancers to PTVs, with risks being mutation dependent. High-throughput functional assays to classify the degree of CHEK2 abrogation are becoming available and may be able to define variant-specific risks more precisely.41–43 Previous studies have shown that the CHEK2*I157T variant is associated with a reduced risk for lung cancer.44 There were too few carriers of this variant to investigate individually, but there was no evidence of a reduced risk associated with rMSVs in aggregate for this site.
Strengths and limitations
A major strength of this study is that, as a very large cohort study, it was possible to assess the risks for many cancer types simultaneously. Moreover, as individuals were genotyped independently of phenotype, the analyses should free from biases due to differential genotyping of cases and controls. In contrast to family-based studies, the risk estimates should be broadly applicable to the population. A potential limitation is that UK Biobank, in common with similar cohorts, is biased towards healthy volunteers: total cancer incidence for the age-group 70–74 is 12–18% lower than in the general population.45 However, while this affects the power to detect associations, it is unlikely that the RRs will be materially biased. Further, we generated absolute risk estimates by applying these RRs to national incidence rates. A significant limitation is that more than 90% of the cohort is of European ancestry, and we therefore restricted the analyses to this population. Other analyses indicate that the RRs conferred by cancer susceptibility genes tend to be similar across populations of different ancestries6 46; nevertheless, extrapolation to other ancestries may be less reliable (particularly if there are ancestry-specific variants associated with different risks).
Implications
ATM and CHEK2 are included on many cancer gene panels used in family cancer clinics, and the risk estimates from these analyses can inform genetic counselling for carriers. The estimated absolute risks for pancreatic cancer in ATM PTV carriers (11% in males and 8% in females by age 85) are notably higher than for other major pancreatic susceptibility genes including BRCA2, CDK2NA, and PALB2, although the confidence limits are wide. Current National Institute for Health and Care Excellence guidelines for pancreatic cancer management do not include ATM, though it is included in other guidelines. Given the frequency of ATM variants, ATM PTVs would explain ~2% of pancreatic cancer cases (with potentially a higher proportion due to MSVs). Prognosis for pancreatic cancer is poor, with only 7% of patients diagnosed with pancreatic cancer between 2013 and 2017 in England surviving 5 years.47 ATM variant carriers may provide a suitable population in which to evaluate new methods of early detection of this disease. Moreover, given the high excess lifetime risk for all cancers, carriers may be a good target population for methods, such as those based on circulating tumour DNA, than can be used for multiple cancer types.48
It is important to note that cancer risks are determined by a combination of multiple factors, including rare gene variants such as those in ATM and CHEK2 studied here, but also lifestyle factors, and commoner genetic variants. Ideally, counselling should be based on synthesising all available data, using risk models such as BOADICEA/CanRisk that have been developed for breast, ovarian, and prostate cancer.49 50 It would be relatively straightforward to extend BOADICEA/CanRisk to incorporate the ovarian and prostate cancer risks associated with ATM and CHEK2 PTVs. An ongoing challenge is to extend such models to consider a wider range of cancers.
Our analyses demonstrate that ATM and CHEK2 PTVs are associated with increased risks of a wide range of cancers, with the overall cancer risk being higher for ATM. CHEK2 rMSVs are also associated with smaller risks of a similar range of cancers; for ATM (but not CHEK2), the risks appear to be restricted to a small subset of rMSVs. The absolute magnitudes of these risks (with more than half of PTV carriers being affected by age 80) are significant and may influence guidelines for counselling and management.
Data availability statement
Data may be obtained from a third party and are not publicly available. Requests for access to UK Biobank data should be made to the UK Biobank Access Management Team (access@ukbiobank.ac.uk).
Ethics statements
Patient consent for publication
Ethics approval
UK Biobank has approval from the North West Multi-centre Research Ethics Committee (MREC; Research Ethics Committee reference: 11/NW/0382). This research has been conducted using the UK Biobank Resource under application number 28126. Participants gave informed consent to participate in the study before taking part.
References
Footnotes
Contributors DFE conceived the study, supervised the work, and directed the overall analyses. TKM performed the statistical analyses. TKM and DFE drafted the manuscript. JRBP, EG, and NW developed the bioinformatics pipeline for variant calling and quality control. NW, NM, JD, and XY provided data management support. MN provided bioinformatics support. All authors reviewed and approved the paper. TKM is the guarantor.
Funding This work has been funded by European Union’s Horizon 2020 Research and Innovation Program (BRIDGES grant 634935), Wellcome Trust (grant v203477/Z/16/Z), Cancer Research UK grant (PPRPGM-Nov20\100002), and the Medical Research Council (unit program: MC_UU_12015/2, MC_UU_00006/2). This work was also supported by core funding from the NIHR Cambridge Biomedical Research Centre (NIHR203312). NW was supported by the International Alliance for Cancer Early Detection, an alliance between Cancer Research UK (C14478/A29329), Canary Center at Stanford University, the University of Cambridge, OHSU Knight Cancer Institute, University College London, and the University of Manchester. TKM was supported by the NIHR Applied Research Collaboration NW London.
Disclaimer The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.
Competing interests DFE is listed as a creator of the BOADICEA model which has been licensed by Cambridge Enterprise, University of Cambridge. EG and JRBP hold shares in, and are employees of, Insmed.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.