Article Text

This article has a correction. Please see:

Extended report
PADI4 genotype is not associated with rheumatoid arthritis in a large UK Caucasian population
  1. Marian L Burr1,
  2. Haris Naseem1,
  3. Anne Hinks1,
  4. Steve Eyre1,
  5. Laura J Gibbons1,
  6. John Bowes1,
  7. Anthony G Wilson2,
  8. James Maxwell2,
  9. Ann W Morgan3,
  10. Paul Emery3,
  11. Sophia Steer4,
  12. Lynne Hocking5,
  13. David M Reid5,
  14. Paul Wordsworth6,
  15. Pille Harrison6,
  16. Wendy Thomson1,
  17. Jane Worthington1,
  18. BIRAC Consortium7,
  19. YEAR Consortium8,
  20. Anne Barton1
  1. 1arc-Epidemiology Unit, University of Manchester, Manchester, UK
  2. 2School of Medicine and Biomedical Sciences, University of Sheffield, Sheffield, UK
  3. 3Section of Musculoskeletal Disease, Leeds Institute of Molecular Medicine, University of Leeds, UK
  4. 4Clinical and Academic Rheumatology, King's College Hospital NHS Foundation Trust, London, UK
  5. 5Bone Research Group, Department of Medicine and Therapeutics, University of Aberdeen, UK
  6. 6University of Oxford Institute of Musculoskeletal Sciences, Botnar Research Centre, Oxford, UK
  7. 7BIRAC Consortium
  8. 8YEAR Consortium
  1. Correspondence to Dr Anne Barton, arc-Epidemiology Unit, Stopford Building, Oxford Road, University of Manchester, Manchester M13 9PT, UK; anne.barton{at}


Background Polymorphisms of the peptidylarginine deiminase type 4 (PADI4) gene confer susceptibility to rheumatoid arthritis (RA) in East Asian people. However, studies in European populations have produced conflicting results. This study explored the association of the PADI4 genotype with RA in a large UK Caucasian population.

Methods The PADI4_94 (rs2240340) single nucleotide polymorphism (SNP) was directly genotyped in a cohort of unrelated UK Caucasian patients with RA (n=3732) and population controls (n=3039). Imputed data from the Wellcome Trust Case Control Consortium (WTCCC) was used to investigate the association of PADI4_94 with RA in an independent group of RA cases (n=1859) and controls (n=10 599). A further 56 SNPs spanning the PADI4 gene were investigated for association with RA using data from the WTCCC study.

Results The PADI4_94 genotype was not associated with RA in either the present cohort or the WTCCC cohort. Combined analysis of all the cases of RA (n=5591) and controls (n=13 638) gave an overall OR of 1.01 (95% CI 0.96 to 1.05, p=0.72). No association with anti-CCP antibodies and no interaction with either shared epitope or PTPN22 was detected. No evidence for association with RA was identified for any of the PADI4 SNPs investigated. Meta-analysis of previously published studies and our data confirmed no significant association between the PADI4_94 genotype and RA in people of European descent (OR 1.06, 95% CI 0.99 to 1.13, p=0.12).

Conclusion In the largest study performed to date, the PADI4 genotype was not a significant risk factor for RA in people of European ancestry, in contrast to Asian populations.

This paper is freely available online under the BMJ Journals unlocked scheme, see

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.


Rheumatoid arthritis (RA) is a complex autoimmune disease in which genetic and environmental factors contribute to the pathogenesis. Around 60% of the risk of RA is genetic, a third of which is accounted for by HLA-DRB1.1 Numerous polymorphisms outside the HLA region have recently been confirmed as RA susceptibility loci in Caucasians, but fewer have been tested across different ethnic populations and the question of whether racial heterogeneity exists—that is, whether possession of a particular allele may confer disease susceptibility in one ethnic group but not another—is contentious.2,,4 The peptidylarginine deiminase 4 (PADI4) gene is of particular interest as it has been tested in Asian, European and North American populations and its relative effect in relation to RA susceptibility across these groups remains controversial.5,,13

The PADI4 gene encodes the type 4 peptidylarginine deiminase enzyme which catalyses the post-translational modification of arginine to citrulline, generating citrullinated proteins. Antibodies to these peptides are highly specific for RA and often predate the development of disease, suggesting a critical role in the pathogenesis of RA. PADI4 therefore represents an attractive RA candidate gene and was first reported to be associated with RA in a Japanese population in 2003.5 This association has been consistently replicated in East Asian populations6 11 12; however findings in cohorts of European ancestry have been inconsistent. Studies in Spanish, Swedish and UK populations reported no evidence for association of PADI4 with RA.7 9 10 Conversely, PADI4 was found to be associated with RA in North American and German populations and two published meta-analyses suggested that PADI4 polymorphisms do confer susceptibility to RA in those of European descent, albeit to a lesser degree than in Asian subjects.10 13,,15 Consequently, it was hypothesised that these European studies were underpowered to detect a true but modest genetic effect. The present study was designed to address this issue by exploring the association between the PADI4 genotype and RA in a large UK population.

Materials and methods

Study design

The PADI4_94 single nucleotide polymorphism (SNP) (rs2240340) was selected for investigation as it has the strongest evidence for association with RA in Asians and Caucasians.5 12 14 15 It was genotyped in an independent UK Caucasian population of 3732 patients with RA and 3039 controls (see online supplement). In addition, imputed genotypes for the PADI4_94 SNP were compared between 1859 patients with RA and 2935 controls from the Wellcome Trust Case Control Consortium (WTCCC) study.16 Where linkage disequilibrium (LD) is high and confidence scores for imputed genotypes exceed 95%, the accuracy of imputation in predicting actual genotype counts exceeds 98.4%.17 An expanded reference group of 10 599 subjects was created by using imputed genotype data for PADI4_94 from the four non-autoimmune disease case subjects (hypertension, coronary artery disease, type 2 diabetes and bipolar disorder) genotyped as part of the WTCCC study and combining this with the genotype data from the healthy controls. The data from the present cohort and the WTCCC study were combined to provide a robust estimate of effect size for this SNP in the UK population, giving a combined sample size of 5591 cases of RA and 13 638 controls. In addition, imputed genotype data for the original WTCCC cohort (1860 cases of RA, 2938 controls) were used to investigate other SNPs spanning the PADI4 gene for evidence of association with RA.

Analysis of data

Allele and genotype frequencies were compared between patients with RA and controls using the χ2 test for trend implemented in PLINK. The threshold for significance was defined at p<0.05.


Meta-analysis of the results together with previous studies investigating association of PADI4_94 with RA in populations of European ancestry was performed (see online supplement). A random effects model was used and between-study heterogeneity assessed using the Cochran Q-statistic (p<0.1 considered significant).

Interaction analysis

Data were available for shared epitope (SE) and the PTPN22 R620W SNP (rs2476601) in the current cohort. The risk of RA associated with carriage of PADI4_94, PTPN22 and SE risk alleles (1 or 2 copies) alone and in combination was calculated using logistic regression. Interaction effects were quantified by calculating the attributable proportion,18 which assesses the proportion of the incidence that is due to interaction (ie, beyond the additive effects of each independent variant). There is evidence of biological interaction if the attributable proportion is not equal to 0.


The present cohort had >80% power to detect the published OR of 1.13 at the 5% significance level (α=0.05) with a risk allele frequency of 0.42.15


The genotyping success rate was >97% in both cases and controls. In this independent cohort of 3732 cases of RA and 3039 controls, no significant difference in PADI_94 allele or genotype frequencies was detected (table 1).

WTCCC cohort

Table 1

PADI4_94 (rs2240340) genotypes in current and WTCCC cohorts

Imputed minor allele frequencies for PADI_94 in the WTCCC cohort were similar to those previously reported in European populations and to those observed in the current study. No association between PADI4_94 genotype and RA was observed (table 1).

Combined analysis

Combined analysis of the current and WTCCC cohorts showed no evidence for association between PADI4_94 genotype and RA. No significant heterogeneity between these two cohorts was detected (phet=0.43, I2=0%) and meta-analysis under a random effects model yielded a similar overall OR of 1.00 (95% CI 0.95 to 1.05). There was no significant difference in genotype distribution between the WTCCC patients with non-autoimmune disease, the original WTCCC controls and the current controls (p=0.20, χ2=6.0). Excluding the WTCCC patients with non-autoimmune disease from the analysis did not affect the outcome (OR 1.02 (95% CI 0.97 to 1.08)).

Studies in Asian populations have shown that PADI4_94 exerts an allele dose-dependent effect in RA susceptibility, with the greatest effect being seen when comparing minor allele homozygotes (2/2) with major allele homozygotes (1/1).5 12 In contrast, we found no evidence for association in any of the genotypic models tested (table 1).


Stratification analysis creates more homogenous subsets of patients and thus may increase the power to detect association despite loss of sample size. Phenotype data were available for a proportion of patients within the current cohort. Stratification by autoantibody status, gender and SE revealed no evidence for association in any of the subgroups tested (table 2).

Table 2

PADI4_94 genotype in current cohort stratified by autoantibody status, carriage of SE, presence of erosions and gender


Five eligible studies investigating the PADI4_94 SNP for association with RA in Caucasian populations were identified.8,,10 13 19 Eight separate comparisons were available for the allelic model (minor allele 2 vs common allele 1) and seven for the genotypic model (1/2 vs 1/1 and 2/2 vs 1/1). No significant association between PADI4_94 and RA was detected for any of the models tested (figure 1). The pooled OR for allele 2 vs allele 1 was 1.06 (95% CI 0.99 to 1.13, p=0.12). The OR for 1/2 vs 1/1 was 1.04 (95% CI 0.92 to 1.17, p=0.53) and for 2/2 vs 1/1 was 1.07 (95% CI0.96 to1.19, p=0.20). Significant between-study heterogeneity was noted (phet=0.06, I2=49.1%). Analysis restricted to European cohorts (ie, excluding North American cohorts whose genetic background may be more ethnically diverse) gave an OR of 1.01 (95% CI 0.96 to 1.07) with no significant heterogeneity (phet=0.31, I2=16.8%).

Figure 1

Meta-analysis of PADI4_94 in populations of European descent. Odds ratio (OR), minor allele (2) vs common allele (1). Weight expressed as percentage.

Interaction analysis

Significant interaction between SE and PTPN22 was detected in anti-CCP positive RA, consistent with previous reports (tables 3 and 4).20 In contrast, there was no significant interaction between PADI4_94 and either PTPN22 or SE. In a model containing all three factors, the highest ORs were seen with possession of SE alleles in conjunction with the PTPN22 risk allele, regardless of the presence or absence of the PADI4_94 putative risk allele (data not shown).

Table 3

Details of studies included in meta-analysis and subsequent analysis of SE, PTN22 and PADI4_94 risk allele combinations

Table 4

Odds ratios for developing rheumatoid arthritis (RA) according to presence or absence of SE, PTPN22 R620W and PADI4_94 risk alleles: allele 2 (minor allele) vs allele 1 (common allele)

Further PADI4 SNPs

Imputed genotype data were available for 1860 cases of RA and 2938 controls from the WTCCC study. A further 56 SNPs spanning the PADI4 gene with imputation confidence scores of over 99% were identified. No evidence for association with RA was detected for any of these SNPs (see table 1 in online supplement). Importantly, PADI4_89 and PADI4_90, which have previously been reported to be associated with RA in Caucasians, were not associated in this UK cohort (see table 2 in online supplement).13 21


In the largest study performed to date, we found no evidence for association between the PADI4_94 SNP and RA in a combined sample of over 19 000 UK subjects. The results were consistent across two large independent populations, lending weight to these findings.

This contrasts with the convincing evidence that PADI4 is an RA susceptibility gene in East Asian subjects.5 6 11 12 The strongest association is seen with PADI4_94, with an estimated OR of 1.31, making it the major genetic risk factor outside the HLA region in this group.12 14 15 Consistent with our findings, several other groups have failed to find an association between PADI4_94 and RA in populations of European ancestry. However, meta-analyses of published data in Caucasians demonstrated evidence for association with a summary OR of 1.13.14 15 For a study to have 80% power to detect an OR of 1.13 at p<0.05, more than 4200 subjects would be required. All previous studies have been underpowered to detect an OR of this level, conceivably accounting for the frequent failure to replicate the association. However, these meta-analyses were based on comparatively limited data (pooled total of 2950 RA cases and 2300 controls) and findings may have been influenced by publication bias and heterogeneity between studies. The present study circumvents these problems by using two independent cohorts, each with more than 80% power to detect the published OR, thus minimising the chance of a type II error. Furthermore, there was no significant difference when comparing minor allele homozygotes with major allele homozygotes (OR 1.02), which is where the greatest effect is seen in Asian populations (OR 1.73).12 Our findings suggest that previous reported associations between PADI4_94 and RA in European populations are false-positive results.

The differential effect of the PADI4_94 SNP in populations of Asian and European descent is unusual as, although the frequency of complex disease-associated polymorphisms may vary across ethnic groups, their genetic effects are usually consistent.2 There are several plausible explanations for this discrepancy. First, linkage disequilibrium varies between races, so PADI4_94 may be in linkage disequilibrium with the true disease-associated allele in Asian but not Caucasian populations. However, PADI4 SNP and haplotype frequencies are similar in the two populations.7 Second, it is possible that different PADI4 polymorphisms may be associated with RA in Caucasians. However, this would be unlikely given that we investigated numerous SNPs spanning the PADI4 gene within the WTCCC cohort and found no evidence for association across the locus as a whole. Third, the biological effect of PADI4_94 variants may be modulated by environmental exposures such as smoking or interactions with other genes which may differ between races. The PTPN22 R620W variant is a potential candidate for introducing variation via gene–gene interaction as it is a major risk factor for RA in Caucasians but has not been detected in Japanese populations. However, we found no interaction between PADI4_94 and PTPN22 in this study. Alternatively, it may be that PADI4 is associated only with a subgroup of patients with RA. Hoppe et al recently suggested that the association between PADI4 genotype and RA may be restricted to patients with more severe disease.22 It is unlikely, however, that this would account for the differences observed as the cohorts tested in the WTCCC and current studies were comparable with the Japanese cohort in terms of disease characteristics. Moreover, PADI4 genotype was not associated with the presence of cyclic citrullinated peptide (CCP) antibody, rheumatoid factor or erosions in this study, and we have previously found no relationship between PADI4 genotype and disease severity in UK patients with early inflammatory arthritis followed prospectively.23

Finally, the disparity in genetic effect may reflect genuine differences in pathogenic processes between European and Asian populations. The mechanism by which the PADI4 genotype may influence RA susceptibility has not yet been elucidated. Suzuki et al showed that the PADI4 susceptibility haplotype had significantly increased mRNA stability compared with the non-susceptibility haplotype. In theory this could result in increased PAD4 enzyme, with consequently increased protein citrullination which may break immune tolerance leading to production of anti-citrullinated peptide antibodies (ACPA) and disease.5 They went on to demonstrate an association between homozygosity for the PADI4 susceptibility haplotype and the presence of ACPA. However, we and many others have failed to find any link between PADI4 genotype and presence of anti-CCP antibodies, drawing this hypothesis into question.8 10 23,,25 Furthermore, given that individuals carrying SE alleles may be predisposed to mount an immune response to citrullinated peptides, the absence of a significant interaction between PADI4_94 and SE in this study is of relevance.26 We did confirm the previously reported interaction between PTPN22 and SE, supporting the validity of our data.20 In a Korean cohort, PADI4 and SE had additive effects with regard to the risk of RA, although no significant interaction was reported.11 It would be interesting to investigate this further in Asian populations as distribution of HLA subtypes varies across races and it has been suggested that this could lead to differential modulation of the genetic effects of PADI4.8 11

In conclusion, in contrast to Asian populations, the PADI4 genotype is not a significant risk factor for RA in people of European descent. Identification of biological mechanisms responsible for this disparity may yield novel insights into the pathogenic processes underpinning rheumatoid disease.


The authors thank the Arthritis Research Campaign for their support (arc grant reference number 17552), the Wellcome Trust Case Control Consortium (WTCCC) for providing the genotype data included and acknowledge support from the Manchester Biomedical Research Centre.


View Abstract

Supplementary materials

  • Web Only Data ard.2009.111294

    Files in this Data Supplement:


  • Competing interests None.

  • Ethics approval This study was conducted with the approval of the North West Research ethics committee (MREC 99/9/84).

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Membership of the BIRAC Consortium and YEAR Consortium is shown in the online supplement.

Linked Articles

  • Miscellaneous
    BMJ Publishing Group Ltd and European League Against Rheumatism