Evaluation of the genetic overlap between osteoarthritis with body mass index and height using genome-wide association scan data
- Katherine S Elliott1,*,
- Kay Chapman1,2,*,
- Aaron Day-Williams3,
- Kalliope Panoutsopoulou3,
- Lorraine Southam1,3,
- Cecilia M Lindgren1,
- the GIANT consortium4,*,
- Nigel Arden5,6,
- Nadim Aslam7,
- Fraser Birrell8,9,
- Ian Carluke9,
- Andrew Carr2,4,
- Panos Deloukas3,
- Michael Doherty10,
- John Loughlin11,
- Andrew McCaskie11,12,
- William E R Ollier13,
- Ashok Rai14,
- Stuart Ralston15,
- Mike R Reed9,
- Timothy D Spector16,
- Ana M Valdes16,
- Gillian A Wallis17,
- Mark Wilkinson18,19,
- the arcOGEN consortium4,
- Eleftheria Zeggini3
- 1Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- 2Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
- 3Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
- 4NIHR Biomedical Research Unit, University of Oxford, Oxford, UK
- 5MRC Epidemiology Resource Centre, University of Southampton, Southampton, UK
- 6Orthopaedic Department, Worcestershire Acute Hospitals NHS Trust, Worcester, UK
- 7Musculoskeletal Research Group, Newcastle University, Institute of Cellular Medicine, The Medical School, Newcastle upon Tyne, UK
- 8Northumbria Healthcare NHS Foundation Trust, Wansbeck General Hospital, Ashington, UK
- 9Academic Rheumatology, University of Nottingham, Nottingham, UK
- 10Institute of Cellular Medicine, Musculoskeletal Research Group, Newcastle University, Newcastle upon Tyne, UK
- 11The Newcastle upon Tyne Hospitals NHS Trust Foundation Trust, The Freeman Hospital, Newcastle upon Tyne, UK
- 12Centre for Integrated Genomic Medical Research, University of Manchester, Manchester, UK
- 13Rheumatology Department, Worcestershire Acute Hospitals NHS Trust, Worcester, UK
- 14Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
- 15Department of Twin Research and Genetic Epidemiology, King's College London, London, UK
- 16Wellcome Trust Centre for Cell Matrix Research, University of Manchester, Manchester, UK
- 17Academic Unit of Bone Metabolism, Department of Human Metabolism, University of Sheffield, Sheffield, UK
- 18Sheffield NIHR Bone Biomedical Research Unit, Centre for Biomedical Research, Northern General Hospital, Sheffield, UK
- Correspondence to Eleftheria Zeggini, Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, UK; email@example.com
- Received 28 May 2011
- Accepted 19 July 2012
- Published Online First 6 September 2012
Objectives Obesity as measured by body mass index (BMI) is one of the major risk factors for osteoarthritis. In addition, genetic overlap has been reported between osteoarthritis and normal adult height variation. We investigated whether this relationship is due to a shared genetic aetiology on a genome-wide scale.
Methods We compared genetic association summary statistics (effect size, p value) for BMI and height from the GIANT consortium genome-wide association study (GWAS) with genetic association summary statistics from the arcOGEN consortium osteoarthritis GWAS. Significance was evaluated by permutation. Replication of osteoarthritis association of the highlighted signals was investigated in an independent dataset. Phenotypic information of height and BMI was accounted for in a separate analysis using osteoarthritis-free controls.
Results We found significant overlap between osteoarthritis and height (p=3.3×10−5 for signals with p≤0.05) when the GIANT and arcOGEN GWAS were compared. For signals with p≤0.001 we found 17 shared signals between osteoarthritis and height and four between osteoarthritis and BMI. However, only one of the height or BMI signals that had shown evidence of association with osteoarthritis in the arcOGEN GWAS was also associated with osteoarthritis in the independent dataset: rs12149832, within the FTO gene (combined p=2.3×10−5). As expected, this signal was attenuated when we adjusted for BMI.
Conclusions We found a significant excess of shared signals between both osteoarthritis and height and osteoarthritis and BMI, suggestive of a common genetic aetiology. However, only one signal showed association with osteoarthritis when followed up in a new dataset.
Osteoarthritis is a common complex disease of synovial joints characterised by degeneration of hyaline cartilage and bone remodelling, usually affecting middle-aged to elderly individuals. It is a leading cause of pain and chronic disability worldwide.1 Comorbidities such as obesity are frequently observed with osteoarthritis and epidemiological studies have noted a link between osteoarthritis and obesity as measured by body mass index (BMI). In particular, reports show a consistent relationship between overweight measures and knee osteoarthritis.2 Some population studies have demonstrated that the weight of individuals at age 37 years (median) could predict the onset of knee osteoarthritis 36 years later.1 In addition, a decrease in BMI of two units over the 10 years preceding diagnosis can reduce the odds of knee osteoarthritis.3 A Norwegian population-based study of approximately 265 000 individuals concluded that the risk of developing hip osteoarthritis was dependent on the age at which weight gain was most dramatic. Younger adults (<20 years) are at greater risk compared with older individuals (>30 years).4 In a large prospective population-based cohort from Iceland the incidence of clinically severe osteoarthritis (as indicated by arthroplasty), in relation to measures of overweight, found that 36% and 50% of those with hip and knee osteoarthritis, respectively, had a BMI greater than 30.5 This is compared with a national prevalence of 17% of the adult population (the International Obesity Task force, http://www.iotf.org/). Furthermore, the Chingford Study has demonstrated that in middle-aged women a one unit increase in BMI is associated with a 10% increased risk of total knee replacement in the following 19 years.6 There have been a number of large-scale genome-wide association studies (GWAS) for obesity and/or BMI, establishing several genetic loci at genome-wide significance association levels.7–13 Based on the well-established epidemiological link, we hypothesise here that osteoarthritis and obesity may have a shared genetic background.14
Variation in human adult height is also highly heritable. Numerous studies have identified over 180 loci to be associated with the trait.15–18 There is weak or conflicting evidence for shared genetic determinants between osteoarthritis and height. However, for example, the functional single nucleotide polymorphism (SNP) (rs143383, T/C) in the 5′ untranslated region of the GDF5 gene, previously observed to be significantly associated with osteoarthritis in Asian and European cohorts, is also significantly associated with normal variation in human height.19–23 In addition, height itself has been reported to be a risk factor for non-generalised severe hip osteoarthritis even after adjusting for age, gender and BMI.24
The aim of this study was to carry out an investigation of the genetic overlap between osteoarthritis and the two traits of BMI and height by examining the overlap of SNPs association signals across the genome. This may uncover possible common mechanistic pathways.
Materials and methods
Description of datasets
Genome-wide summary statistics (effect size, p values) for BMI and height from the Genetic Investigation of Anthropometric Traits (GIANT) consortium GWAS were compared with genome-wide osteoarthritis data from the arcOGEN consortium. The GIANT consortium has brought together GWAS data from 46 studies.25 ,26 Overlap analysis with osteoarthritis utilised 2 400 344 SNPs and 32 387 individuals from the BMI dataset and 2 834 208 SNPs and 133 653 individuals from the height dataset.
The arcOGEN GWAS was carried out in two stages and includes a total of 7567 osteoarthritis cases from the UK (ascertained by radiographic evidence of disease, Kellgren–Lawrence score ≥2, or clinical evidence of the disease to a level requiring total joint replacement) genotyped on the Illumina HumanHap 610-Quad panel. Stage 1 of the arcOGEN GWAS was employed in the main overlap analysis and included 3177 osteoarthritis cases and 4894 population-based controls from the UK (WTCCC2).27 Genotypes of 17 SNPs that were imputed in arcOGEN stage 1 were validated by direct typing using Sequenom in the stage 1 cases (n=2949) and examining concordance. Replication of association with osteoarthritis for the signals highlighted from the overlap analysis (tables 1 and 2) was carried out using 4324 stage 2 cases from the arcOGEN Consortium and 6518 population-based controls (from the WTCCC2, T1DGC, ALSPAC study and PoBI studies) (see supplementary methods, available online only). SNPs that were not genotyped in the stage 2 arcOGEN GWAS or did not pass quality control were genotyped with Sequenom in 5165 cases and 6115 controls (see supplementary methods, available online only).
Analysis accounting for phenotypic information of height and BMI was carried out using 1671 unrelated female individuals from the osteoarthritis-free TwinsUK cohort as a control set. This cohort is ascertained to study the heritability of age-related diseases and contains full phenotypic information for osteoarthritis status as well as height and BMI.24 Additional quality control that was performed for this study is described in the supplementary methods (available online only). arcOGEN stage 1 female cases (n=1009 for height; n=1358 for BMI) were utilised for these analyses.
Osteoarthritis replication genotyping
Osteoarthritis association signals at directly typed variants, highlighted in tables 3 and 4, were followed up through in-silico replication using stage 2 arcOGEN GWAS data (see supplementary methods, available online only). Association signals at imputed variants were followed up by carrying out de-novo genotyping in 5165 arcOGEN stage 2 cases and 6115 WTCCC2 controls using the Sequenom MassArray iPLex Gold assay at the Wellcome Trust Sanger Institute. Genotypes were assigned using the MassArray TyperAnalyser software V.4.0 (Sequenom). All genotypes were confirmed manually and passed standard quality control checks (see supplementary methods, available online only).
We carried out pairwise comparisons between osteoarthritis and height and between osteoarthritis and BMI genome-wide summary statistics. For each comparison, we focused on the intersection of SNPs for which summary statistics were present in both GWAS. We then sorted these SNPs based on association p value for osteoarthritis. This list of SNPs was then thinned to an independent unlinked set using an r2 threshold of 0.05 based on HapMap CEU release #27. Starting with the first SNP in the list, any subsequent SNP with r2>0.05 was removed and then the next available SNP was taken. This continued until a set of independent SNPs was obtained (osteoarthritis–BMI n=62 280, osteoarthritis–height n=64 702).
We investigated the distribution of p values above and below given thresholds (0.5, 0.1, 0.05, 0.04, 0.03, 0.02, 0.01, 0.005, 0.001, 0.0005) for each trait. The distribution of counts in the resulting 2×2 contingency tables was analysed using the χ2 test. A significant excess of signals with p values less than the given threshold for both phenotypes was taken to indicate a concurrence of signals.
In addition, we examined the SNPs for each comparative analysis of osteoarthritis–height and osteoarthritis–BMI to see if there was an overabundance of discordant or concordant risk alleles between the datasets (see supplementary methods, available online only).
Based on the results obtained for the analysis of the signal overlap between the osteoarthritis–height and osteoarthritis–BMI comparisons (table 5), we selected p value thresholds of 0.001 (osteoarthritis–BMI) and 0.05 (osteoarthritis–height) for further follow-up. We permuted the p value signals for the entire datasets as well as for linkage disequilibrium (LD)-thinned data of r2 = 0.2 and 0.05. We generated 100 000 permutations of the arcOGEN case–control data using PLINK28 (make-perm-pheno command) and performed a GWAS for each permutation under the log-additive model. We carried out the overlap analysis for each permuted case–control dataset considering only SNPs that were directly typed in arcOGEN and present in the GIANT data. We thus generated a null distribution of p values. From this we calculated the probability of seeing an overlap p value equal to or less than the original p value for directly typed SNPs.
In addition, we sought to get a more precise empirical p value for the osteoarthritis–height comparison as this gave the most compelling results for the signal overlap analysis (table 5). Using the LD-thinned data (r2 = 0.05) and p value threshold of 0.05, we generated 500 000 000 permutations of the arcOGEN and GIANT height datasets by permuting which p value was associated with which SNP. We performed 500 000 000 overlap analyses by randomly choosing without replacement a permutation from each dataset to generate the null distribution of overlap p values, given a specific distribution of original p values. From this constructed null distribution of p values we calculated the probability of seeing an overlap p value equal to or less than the original overlap p value for the entire dataset.
Replication of osteoarthritis association for overlapping signals
Case–control association analysis under the log-additive model was carried out using PLINK for directly typed SNPs and SNPTEST for imputed SNPs.28 ,30 Combined estimates of OR and p values for stages 1 and 2 of arcOGEN were obtained using fixed-effect meta-analyses in GWAMA.29 Heterogeneity was checked using the Cochran's Q and I2 statistics.
Analysis with adjustment for BMI and height
In order to adjust for height and BMI as covariates, we repeated the case–control analysis for stage 1 of the arcOGEN dataset using the TwinsUK cohort as controls. Analysis was carried out using PLINK when the SNPs were directly typed or SNPTEST30 when they were imputed. The analysis was performed twice; with and without an adjustment for height and BMI.
Overlapping signals and permutations
Our findings suggest an excess of shared signals both between osteoarthritis and height and osteoarthritis and BMI. A comparison of signals indicates an excess of sharing at p value thresholds of 0.05, 0.04, 0.03, 0.02 and 0.01 for osteoarthritis and height; there is evidence for overlap between osteoarthritis and BMI at the p value thresholds of 0.005, 0.001 and 0.0005 (table 5).
To test the strength of the observed overlap we ran a series of permutations (table 6). The LD-thinned datasets provide the most robust results because the probability of seeing an overlap p value equal to or less than the original analysis p value by chance is unlikely. Both comparisons of osteoarthritis–height and osteoarthritis–BMI showed an excess of overlapping signals (p=1.4×10−3 for the 0.05 p value threshold and p=2.28×10−2 for the 0.001 p value threshold). Based on these results we performed 500 000 000 permutations of the entire LD-thinned (r2 = 0.05) height dataset. This showed that the probability of seeing a p value less than or equal to 3.04×10−5 was 3.3×10−5. A total of 17 SNPs with p values of 1.0×10−3 or less were shared between osteoarthritis and height (table 1), while four SNPs were shared between osteoarthritis and BMI (table 2). These SNPs were distributed throughout the genome (11 chromosomes in the osteoarthritis–height comparison and three chromosomes in the osteoarthritis–BMI comparison). Some of these signals, such as the ones near COL11A1, PTHLH and FTO are well-known loci with established associations with bone development, bone mineral density and obesity, respectively.8 ,19 ,31
Replication of osteoarthritis association
To evaluate the observed overlap further we attempted to replicate osteoarthritis association of these 21 signals employing stage 2 of the arcOGEN dataset (tables 3 and 4). Seven of the SNP failed quality control in stage 2 and proxies (r2>0.85) were sought. No proxies were found for two of the seven SNPs, rs2744718 and rs11198893. Of the 19 SNPs successfully taken forward for validation, rs12149832 on chromosome 16 within the FTO gene was the only one found to be associated (p<0.01) with osteoarthritis in the replication dataset (p=0.009, in the same direction). The combined p value of both stages increased in significance for this SNP relative to stage 1 alone (p=2.8×10−4 for stage 1 vs p=2.3×10−5 for stages 1 and 2 combined, table 4).
Adjustment for BMI and height
Adjustment for height and BMI (tables 7 and 8) only affected the signal at the FTO SNP rs12149832. Here we found an eightfold increase in the p value after adjustment for BMI (p=0.22576) compared with the unadjusted result (p=0.029219).
Identification of the genetic loci contributing to variation in quantitative traits such as height and BMI, and risk of osteoarthritis could help elucidate possible mechanistic pathways. There is an established genetic link between height and osteoarthritis. The pleiotropic action of GDF5 on human height is an example that may shed light on shared signalling functions and pathways affecting the two traits.19 Epidemiological evidence has also suggested a link between osteoarthritis and BMI.32 It is plausible that these traits also share genetic associations and we carried out a SNP-by-SNP pairwise comparison of GWAS data to investigate their genetic overlap.
We obtained evidence for overlap of association signals between osteoarthritis and height and between osteoarthritis and BMI at different definition thresholds, corroborated by permutation analyses to obtain empirical p values. We investigated specific signals that may be representative of these findings and looked at all SNPs with p≤1.0×10−3 for both comparisons. Some signals reside in the vicinity of genes, such as the structural protein collagen gene COL11A1 and the parathyroid hormone-related protein PTHLH that regulates endochondral bone development, which have previously been identified as possible candidates for osteoarthritis susceptibility.33–36 For the osteoarthritis–BMI comparison the FTO gene for obesity was highlighted.
Using a second dataset we attempted to replicate the osteoarthritis association of overlapping signals. The fact that the FTO locus was the only one to replicate in our second osteoarthritis dataset suggests that the other signals may have been false positive signals for osteoarthritis, or low power in the replication cohort. Adjustment for BMI attenuated this osteoarthritis signal, indicating that the primary association is with BMI.
The established osteoarthritis and height overlapping signal rs143383 located in GDF5 was not identified in this analysis. We found it to be strongly associated with height (p=1.94×10−50), but not associated with osteoarthritis in the arcOGEN dataset (p=0.602). Although association between the GDF5 locus and hip and knee osteoarthritis was first reported in a study of Japanese and Chinese individuals in 2007,20 it took several years and large-scale meta-analysis efforts to replicate the association robustly in European populations.21–23 In addition to allele frequency disparities between ethnic groups, this observation also highlights the limited power (<10% at α=5×10−8) of a dataset such as arcOGEN (comprising 3177 cases and 4894 controls) to detect a signal with modest effect (OR 1.15) and common risk allele frequency (∼0.60 for the GDF5 signal).27 Our results should be interpreted within the power constraints of our study. First, osteoarthritis is a heterogeneous disease and the definition of the cases here was primarily based on painful rather than structural osteoarthritis. Second, we examined GWAS platform SNP content rather than known causal variants. Finally, the osteoarthritis GWAS used population-based controls, which can dilute power due to misclassifications of cases as controls in a common disease such as osteoarthritis.
In conclusion, our genome-wide comparison of GIANT and arcOGEN generated evidence for an overall excess of overlapping signals between osteoarthritis and the two quantitative traits of BMI and height. The FTO signal was robustly associated with BMI and osteoarthritis, and showed evidence of association in the replication osteoarthritis dataset. This signal underpins the known epidemiological link between BMI and osteoarthritis, and represents the single largest genetic effect for BMI, which may have facilitated its identification as a shared locus. Better-powered GWAS datasets, along with large-scale replication samples, will help unveil additional shared loci and highlight common biological pathways.
↵*For GIANT and arcOGEN consortium members see supplementary material (available online only).
Funding arcOGEN (http://www.arcogen.org.uk/) is funded by a special purpose grant from Arthritis Research UK (grant 18030). This study was funded by The Wellcome Trust (098051 and WT079557MA), Arthritis Research UK, The Wishbone Trust, The Collisson Foundation, The Botnar Foundation, The Lord Nuffield Orthopaedic Trust, The Jean Shanks foundation, NIHR Musculoskeletal Biomedical Research Unit and EU Fp7 Treat-OA. For Newcastle we acknowledge the support of the UK NIHR BRC for Ageing and Age-related disease award to the Newcastle upon Tyne Hospitals NHS Foundation Trust and the support of the Northumberland, Tyne and Wear CLRN. KC is a Botnar Research Fellow.
Contributors Data generation: KSE, KC, ADW, KP, LS, CML; GIANT: NKA, AC, PD, MD, JL, AM, WERO, SR, TDS, AMV, GAW, JMW, AR, NA, FB, MRR, IC; arcOGEN: EZ. Data analysis: KSE, KC, ADW, KP, EZ. Manuscript preparation: KSE, KC, KP, ADW, EZ. Critical reading and approval of manuscript: KSE, KC, ADW, KP, LS, CML, NKA, AC, PD, MD, JL, AM, WERO, SR, TDS, AMV, GAW, JMW, AR, NA, FB, MRR, IC, EZ. The first two authors contributed equally.
Competing interests None.
Ethics approval Ethics approval for this study was obtained from National Research Ethics Service (REC ref# 07/H0606/150).
Provenance and peer review Not commissioned; externally peer reviewed.
This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/3.0/ and http://creativecommons.org/licenses/by-nc/3.0/legalcode