Background: HLA-DRB1 shared epitope (HLA-SE), PTPN22 and CTLA4 alleles are associated with cyclic citrullinated peptide (CCP) and rheumatoid arthritis (RA).
Objective: We examined associations between HLA-SE, PTPN22, CTLA4 genotypes and RA phenotypes in a large cohort to (a) replicate prior associations with CCP status, and (b) determine associations with radiographic erosions and age of diagnosis.
Methods: A total of 689 RA patients from the Brigham RA Sequential Study (BRASS) were genotyped for HLA-SE, PTPN22 (rs2476601) and CTLA4 (rs3087243). Association between genotypes and CCP, rheumatoid factor (RF) erosive phenotypes and age at diagnosis were assessed with multivariable models adjusting for age, sex and disease duration. Novel causal pathway analysis was used to test the hypothesis that genetic risk factors and CCP are in the causal pathway for predicting erosions.
Results: In multivariable analysis, presence of any HLA-SE was strongly associated with CCP+ (odds ratio (OR) 3.05, 95% CI 2.18–4.25), and RF+ (OR 2.53, 95% CI 1.83–3.5) phenotypes; presence of any PTPN22 T allele was associated with CCP+ (OR 1.81, 95% CI 1.24–2.66) and RF+ phenotypes (OR 1.84, 95% CI 1.27–2.66). CTLA4 was not associated with CCP or RF phenotypes. While HLA-SE was associated with erosive RA phenotype (OR 1.52, 95% CI 1.01–2.17), this was no longer significant after conditioning on CCP. PTPN22 and CTLA4 were not associated with erosive phenotype. Presence of any HLA-SE was associated with an average 3.6 years earlier diagnosis compared with absence of HLA-SE (41.3 vs 44.9 years, p = 0.002) and PTPN22 was associated with a 4.2 years earlier age of diagnosis (39.5 vs 43.6 years, p = 0.002). CTLA4 genotypes were not associated with age at diagnosis of RA.
Conclusions: In this large clinical cohort, we replicated the association between HLA-SE and PTPN22, but not CTLA4 with CCP+ and RF+ phenotypes. We also found evidence for associations between HLA-SE, and PTPN22 and earlier age at diagnosis. Since HLA-SE is associated with erosive phenotype in unconditional analysis, but is not significant after conditioning on CCP, this suggests that CCP is in the causal pathway for predicting erosive phenotype.
Statistics from Altmetric.com
Box 1 Criteria
1: x and y are correlated
2: y and z are correlated
3: x and z are uncorrelated, conditioning on y
If 1, 2 and 3 are true and since x (genotype) is not caused by either y or z, then using Cooper’s LCD the only possible relationship between x, y and z is:
Causal pathway 1:
HLA-SE (x)→CCP (y)→RF (z)
Causal pathway 2:
HLA-SE (x)→CCP (y)→erosion (z)
Genetic factors are thought to be responsible for up to 50%–60% of rheumatoid arthritis (RA) risk.1–3 Two genes have been unequivocally associated with RA susceptibility (Human leukocyte antigen HLA-DRB1 and PTPN22), while other genes demonstrate strong, but inconclusive risk (eg. CTLA, PADI4).4 Although the HLA associations with RA are complex,5 6 the majority of the genetic signal from HLA is explained by alleles at the HLA-DRB1 locus,7 and account for approximately 30% of the genetic risk of RA.1 In individuals of European ancestry, the associated HLA-DRB1 alleles share a region of sequence similarity or “shared epitope” at amino acid positions 70–74 in the third hypervariable region of the HLA-DRB1 molecule1 (Human leukocyte antigen shared epitope; HLA-SE). Outside HLA, the only genetic polymorphism that has been associated with RA susceptibility in populations of European ancestry and replicated across multiple independent studies is PTPN22.3 8–14 A missense allele (C→T) is associated with an increased risk of RA (rs2476601), with a summary odds ratio (OR) of 1.68 (1.53–1.84) from a meta-analysis.15 The HLA-SE alleles and the PTPN22 allele are more strongly associated with the phenotype cyclic citrullinated peptide positive (CCP+) RA.4 16–19 Cytotoxic T lymphocyte-associated antigen 4 (CTLA4) gene has an A→G single nucleotide polymorphism (SNP) in the 3′ untranslated region (rs3087243), that is associated with increased risk of RA in several populations,20–22 although the OR is much more modest (OR 1.20). In a large replication study CTLA4 was more strongly associated with CCP+ RA.4
Phenotyping of disease subgroups in rheumatoid arthritis (RA), such CCP status, is important in studies of RA genetic and environmental epidemiology.17 23–25 Environmental risk factors for RA differ in CCP+ and CCP– subgroups.25 HLA-SE demonstrates a significant gene-environment interaction with cigarette smoking for susceptibility to CCP+ RA23 26 and for CCP antibody status in RA24 but not for CCP– RA.
Studies of genetic predictors of individual RA phenotypes have suggested associations between several genes and autoantibody status (CCP, rheumatoid factor (RF)), erosive disease and age at onset of RA.4 16 17 23 27–30 Several prior studies have suggested that the association between HLA and erosions is due solely to CCP status; PTPN22 and CTLA4 have not been extensively studied for these phenotypes. We sought to test whether these genetic factors are associated radiographic erosions and early age of diagnosis using data from a large observational RA cohort. Thus, our goals were to (a) replicate association of HLA, PTPN22, CTLA4 with CCP phenotype; (b) determine if genotypes predicted radiographic erosions independent of the CCP association; and (c) to study whether age of diagnosis is associated with genetic variation. We performed conventional association testing as well as novel causal pathway analysis as proposed by Cooper31 and Li32 to test the hypothesis that genetic risk factors and CCP are in the causal pathway in predicting erosions.
PATIENTS AND METHODS
BRASS (Brigham Rheumatoid Arthritis Sequential Study) is a prospective observational study of 968 RA patients receiving care at the Brigham and Women’s Hospital, Boston, Massachusetts, USA. The goals of the study are: (1) to determine and validate biomarkers that predict drug response and toxicity in RA, (2) to determine and validate biomarkers that predict disease activity and prognosis in RA, and (3) to evaluate the natural history of treated RA by measuring clinical, functional and economic outcomes. Baseline evaluation includes demographic and clinical information, assessment of functional status, disease activity, comorbidity, laboratory testing and hand radiographs. At a physical exam, including joint examination, assessment of pain and disease activity by medical examiner and the patient were collected at baseline and yearly. Samples of blood for immunophenoptyping, including C reactive protein (CRP), cytokines, chemokines, and rheumatoid factor (RF), anticyclic citrullinated peptide (CCP) as well as blood specimens for DNA/RNA testing were collected and stored at baseline and yearly. During follow-up, patients were mailed a self-administered questionnaire every 6 months to collect information on disease severity, functional status, resource utilisation, level of fatigue, employment status, medications, adverse events and intercurrent health events. For hand radiographs, posteroanterior (PA) views of the hands are performed at baseline and years 3 and 5 during the study. This analysis was limited to the baseline radiographs. The study was approved by the Partners Institutional Review Committee.
Blood was collected at the baseline visit for genotyping. Samples were genotyped for HLA-SE alleles by low-resolution genotyping, and for PTPN22, and CTLA4 (CT60 allele) by Sequenom (Sequenom, San Diego, California) genotyping. HLA-SE alleles 01, 04, 10, and 14 were considered positive. Rheumatoid factor testing was performed by immunoturbidimetric technique on the Cobas Integra 700 analyser (Roche Diagnostics, Indianapolis, Indiana, USA), using reagents and calibrators from Roche. Anticyclic citrullinated peptide antibodies (CCP) were measured using a second generation ELISA assay (Inova Diagnostics, Inc., San Diego, California) with a titre of >20 considered as positive. Radiograph reports were reviewed by a study rheumatologist for evidence of erosions and coded as erosion present or absent.
Association between genotypes and dichotomous phenotypes for CCP, RF and erosive RA at baseline were assessed with logistic regression models adjusting for age, sex, and disease duration. Association between genotype and age at diagnosis of RA phenotype was assessed with general linear models adjusting for sex.
Since statistical correlation does not imply a causal relationship, we performed causal pathway analysis by the method described by Li et al.32 In brief, Cooper’s Local Causal Discovery algorithm (Cooper’s LCD) was used to explore the potential causal pathway between genotypes and phenotypes (box 1).31 Each causal pathway analysis tested three variables: (x) genotype, (y) phenotype, and (z) phenotype. From prior knowledge we know the genotype (x), for example, HLA-SE genotype, cannot be caused by any intermediate phenotypes (y, z), for example, RF and CCP antibody status. If the pairwise unconditional correlations exists between the three variables, but the genotype (x) and one phenotype (z) is un-correlated conditional on the other phenotype (y), only one causal path can be derived from x→y→z (box 1). We conducted causal pathway analyses of the association between HLA-SE (x), CCP (y), and RF (z), and between HLA-SE (x), CCP (y), and erosion (z) phenotype.
The BRASS research study began enrollment in March 2003, and has enrolled 968 patients to date. Genotype data and autoantibody status were available for 728 subjects at the time of this analysis. For this analysis, we included only Caucasian subjects, resulting in a sample of 689 Caucasian RA patients. Among these 689 patients, mean (SD) age is 58.0 (13.8) years, mean disease duration 15.4 (12.8) years, 110 (15.9%) are recent diagnosis RA, defined as <2 years disease duration, 560 (81.2%) are female of whom 179 (32%) are in the premenopausal age range (age<51). Education level is 21.5% with high school or less and 78.5% with some college education. At baseline, 323 (47%) of patients report starting a new therapy for RA within the prior 12 months (table 1). Baseline radiographs demonstrated presence of RA erosions in 374/627 (59.7%) of subjects with radiographs. There were 61.8% RF positive subjects, and 66.5% CCP positive subjects. Mean age at diagnosis of RA was 42.58 (15.1). Genotype frequencies were similar to other RA cohorts (table 2).4
We first attempted to replicate the finding that alleles within three genes, HLA-DRB1, PTPN22 and CTLA4, are associated with CCP+ RA. We created contingency tables for genotypes dichotomised by presence of any HLA-SE (single copy or double copy), the T allele of PTPN22 (single or double copy), the G allele of CTLA4 (single or double copy) and presence/absence of CCP and RF phenotypes. We tested for significance using logistic regression models assuming a multiplicative model for genotype, and adjusting for age, sex, and disease duration (table 3). Presence of any HLA-SE was strongly associated with CCP+ phenotype (OR 3.05, 95% CI 2.19–4.25, p = 2.9×10–9) and RF+ phenotype (OR 2.53, 95% CI 1.83–3.5, p = 4.2×10–7); presence of any PTPN22 T allele was associated with CCP+ phenotype (OR 1.81, 95% CI 1.24–2.66, p = 0.006) and RF+ phenotype (OR 1.84, 95% CI 1.27–2.66, p = 0.002); and CTLA4 was not associated with CCP (OR 1.04, 95% CI 0.7–1.55) or RF phenotypes (OR 0.94, 95% CI 0.64–1.39).
We next sought to determine whether these genetic variants were associated with two markers of disease severity, radiographic erosions and age of diagnosis. Presence of any HLA-SE was associated with erosive RA phenotype in unadjusted logistic regression analysis (p = 4.0×10−4), however this association was less strong in a logistic regression model that adjusted for age, sex and disease duration (OR 1.52, 95% CI 1.01–2.17, p = 0.02) (table 4). Presence of any PTPN22 T allele was not associated with erosive phenotype (OR 1.14, 95% CI 0.77–1.71), nor was CTLA4 (OR 1.34, 95% CI 0.87–2.15) (table 4).
Using general linear regression models for genotype as a predictor of age at diagnosis of RA, adjusted for sex, presence of any HLA-SE was on average associated with 3.6 years earlier age at diagnosis of RA compared with absence of HLA-SE (41.3 vs 44.9 years, p = 0.002) (table 5). PTPN22 was on average associated with 4.2 years earlier age at diagnosis of RA (39.5 vs 43.6 p = 0.002) with the earliest age at diagnosis in those with the TT genotype (37.8 years). Adjusting for sex slightly attenuated these relationships. CTLA4 genotypes were not associated with age at diagnosis of RA in this dataset.
We conducted two causal pathway analyses, adapted from Li et al,32 and illustrated in box 1. We asked whether a genetic variant (HLA-SE) contributed to RF phenotype, independent of CCP status (causal pathway 1) as a replication of the Li et al analysis. We also asked whether HLA-SE contributed to erosion phenotype, independent of CCP status (causal pathway 2).
Conditional analysis of causal pathway 1, of the association between HLA-SE (x), CCP (y), and RF (z) demonstrated that HLA-SE (x) and RF (z) are not associated when conditioning on CCP (y) (table 7). These results are similar to those shown in the causal pathway analysis by Li et al.32 Therefore the evidence supports a causal pathway from HLA-SE→CCP→RF, but not directly from HLA-SE→RF.
Conditional analysis of causal pathway 2, of the association between HLA-SE (x), CCP (y), and erosion (z) demonstrated that HLA-SE (x) and erosion (z) are not associated when conditioning on CCP (y) (table 8). Therefore the evidence supports a causal pathway from HLA-SE→CCP→ erosion, but not directly from HLA-SE→erosion. Since the analysis presented in table 7 demonstrated that SE (x) and RF(y) are not associated when conditioning on CCP, we did not test for a causal pathway from SE→RF→erosion.
In this large observational RA cohort, we demonstrated strong associations between two RA genetic risk factors (HLA-SE, PTPN22) and RA phenotypes. HLA-SE was strongly associated with CCP, RF, and erosive phenotypes, even after adjusting for age, sex, and disease duration. We demonstrated that PTPN22 was strongly associated with CCP and RF phenotypes, but not with erosive RA. We were unable to show any genotype-phenotype associations for CTLA-4, perhaps due to limited power as the published OR for susceptibility for CTLA-4 is 1.2 whereas for PTPN22 is 1.75 and for HLA-SE is 3.0. We found strong correlations between HLA-SE and the CCP and RF phenotypes, however, the HLA-SE association with the CCP phenotype was stronger than for RF phenotype. In this clinical cohort, using causal pathway analysis we replicated prior findings from the North American Rheumatoid Arthritis Consortium (NARAC),18 32 and the Leiden Early Arthritis Clinic (EAC)17 that suggest that HLA-SE is causally associated with the CCP phenotype, and CCP is causally associated with the RF phenotype, but HLA-SE is not causally associated with the RF phenotype.
Our findings for HLA-SE and erosions in unconditional analyses are consistent with a meta-analysis of 30 studies published from prospective cohorts and cross-sectional studies involving 3240 RA patients demonstrating an odds ratio of 2.0 (95% CI 1.8–2.2) for association of HLA-SE and erosions.27 Our causal pathway analysis extends these observations to study the role of CCP antibodies. In our cross-sectional study, CCP is strongly associated with erosions, but HLA-SE is not associated with erosions, after conditioning on CCP status, suggesting that there is no causal pathway directly from HLA-SE to erosions. Our causal pathway findings are consistent with two studies from the Leiden EAC.17 30 Among RA patients in the EAC prospective cohort there were large differences in rates of erosion progression between CCP+RA and CCP-RA, with an additional effect of the HLA-SE on erosion progression only among the CCP+ group but no association for HLA-SE with erosions among the CCP– group.17 Analysis of the undifferentiated arthritis patients in the EAC cohort followed prospectively for the development of RA demonstrated that HLA-SE alleles are primarily a risk factor for development of anti-CCP antibodies and are not an independent risk factor for progression to RA, after adjusting for CCP status.30
We found evidence of 3.6 years earlier age of diagnosis with HLA-SE in this large clinical cohort with a mean disease duration of 15 years. This work replicates other studies in which HLA-SE was associated with 6 years earlier age at onset in a seropositive RA cohort with <15 months of disease duration in the US28 as well as in a Korean population in which the specific allele, HLA-DRB1*0405, was associated with 4 years earlier onset.33 We found evidence for 4.2 years earlier diagnosis of RA for PTPN22, which is similar to the findings of 2 years earlier age of onset in samples from North America and Sweden.4 In a population from the UK, PTPN22 was associated with 8.6 years earlier onset in homozygotes, and 4.7 years earlier onset in heterozygotes.34
The BRASS cohort is a well-educated, primarily Caucasian population with long disease duration, treated at a tertiary referral centre in the United States, all factors that may limit generalisability. Although disease duration is similar to that in the North American Rheumatoid Arthritis Consortium (NARAC), the rates of seropositivity and erosive disease are lower, since by design NARAC recruited more severe RA patients.32 However age, disease duration, rates of erosive disease, and seropositivity are similar to those reported in the National Databank study of >14 000 patients enrolled from rheumatology practices across the US,35 36 suggesting that BRASS subjects are more similar to RA patients seen in the community. The causal pathway approach is a statistical method that requires a number of assumptions, as discussed in detail by Cooper et al,31 and it is possible that our dataset does not meet all of the assumptions. The approach does allow for the presence of potential confounders, as long as the variable “x”, in this case genotype, is not caused by the confounder.
In conclusion, genotype-phenotype analysis of a large clinical cohort demonstrates the importance of considering phenotypes when studying genetic predictors. We replicated association of HLA-SE, and PTPN22 with CCP+ RA compared with CCP– RA as well as associations with earlier diagnosis of RA, but were unable to demonstrate any associations for CTLA-4. Of the three genes studied, only HLA-SE was associated with radiographic erosions but this association was not independent of CCP status. The novel causal pathway analyses confirms prior studies that demonstrate the importance of antibodies to CCP in the pathogenesis of joint damage in RA and provides support to recent calls17 23 for considering CCP+ RA as a separate clinical entity within the overall RA phenotype.
We wish to thank the BRASS participants, the rheumatologists in the Robert B Brigham Arthritis Center for their extensive time and efforts on behalf of the study, and our talented team of research assistants.
Funding: EWK is supported by NIH grants R01 AR49880, and K24 AR0524-01. RMP is supported by NIH K08 A155314. The BRASS cohort and co-authors JC, RJG, NEM, EI, MEW, and NAS are supported by Millenium Pharmaceuticals. AP was formerly supported by Millenium Pharmaceuticals and is currently supported by Amgen. RR was formerly supported by Millenium Pharmaceuticals and is currently supported by Biogen/IDEC.
Competing interests: None declared.