Validation of the 28-joint Disease Activity Score (DAS28) and European League Against Rheumatism response criteria based on C-reactive protein against disease progression in patients with rheumatoid arthritis, and comparison with the DAS28 based on erythrocyte sedimentation rate
- 1Department of Epidemiology and Community Medicine, University of Ottawa, Ottawa, Canada
- 2Bristol-Myers Squibb, Princeton, New Jersey, USA
- 3Paris-Descartes University, Medicine Faculty and UPRES-EA 4058, AP-HP, Cochin Hospital, Paris, France
- 4University of Colorado, Denver, Colorado, USA
- 5Second Department of Medicine, Hietzing Hospital, Vienna, Austria
- 6Department of Rheumatology, Medical University of Vienna, Vienna, Austria
- 7University Medical Centre, Nijmegen, The Netherlands
- Professor G Wells, Department of Epidemiology and Community Medicine, University of Ottawa, 451 Smyth Road, Ottawa, Ontario, K1H 8M5, Canada;
- Accepted 9 April 2008
- Published Online First 19 May 2008
Objective: To validate and compare the definition of the Disease Activity Score 28 based on C-reactive protein (DAS28 (CRP)) to the definition based on erythrocyte sedimentation rate (ESR).
Methods: Data were analysed from two randomised, double-blind, placebo-controlled trials of abatacept of 6-month and 12-month duration in patients with rheumatoid arthritis. European League Against Rheumatism (EULAR) response criteria and the proportion of patients in remission (DAS28 <2.6) based on the two DAS28 definitions were examined. Trends in radiographic progression (erosion score, joint space narrowing score and total score) and physical function (Health Assessment Questionnaire Disability Index (HAQ-DI)) across the EULAR responder states (none, moderate and good) were analysed.
Results: There was general agreement in determining the EULAR responder state using both DAS28 definitions (κ = 0.80, 95% CI 0.76 to 0.83). Overall, there was 82.4% agreement on the EULAR response criteria; when disagreements occurred, the DAS28 (CRP) yielded a better EULAR response more often then DAS28 (ESR) (12.6% vs 4.9%, respectively). There was also agreement in determining remission: κ = 0.69 (95% CI 0.60 to 0.78). Radiographic progression decreased in patients treated with abatacept across EULAR states (from none to moderate to good) based on both definitions. For patients treated with placebo, the trend was not as pronounced, with radiographic scores higher for moderate vs non-responders. For physical function, similar trends were observed across the EULAR states for both DAS28 definitions.
Conclusions: The DAS28 (CRP) has been validated against radiographic progression and physical function. While the DAS28 (CRP) yielded a better EULAR response more often than the DAS28 (ESR), the validation profile was similar to the DAS28 (ESR), indicating that both measures are useful for assessing disease activity in patients with rheumatoid arthritis.
Agreement on response criteria in rheumatoid arthritis (RA) has allowed better standardisation and interpretation of clinical trial reports. In particular, American College of Rheumatology (ACR) criteria1 and the Disease Activity Score (DAS)2 are widely used. The DAS index combines information relating to the number of swollen and tender joints, in addition to a measure of general health, and the acute phase response. The DAS283 is based on a count of 28 swollen and tender joints, with a score ranging from 0 to 9.4,4 and can be used to objectively evaluate a patient’s response to treatment. An absolute level of disease activity can be selected as a clinically meaningful goal for therapeutic intervention; with a value of ⩽3.2 defined as the threshold for a low disease activity state and <2.6 as the threshold for remission.5 Alternatively, the European League Against Rheumatism (EULAR) response criteria combine the DAS28 score at the time of evaluation with the change in DAS28 score between two time points, and enable the user to define improvement or response to treatment.5 The thresholds for low disease activity and remission and the EULAR response criteria provide a standardised guide on how to interpret the DAS28 scores.4 6 7 The DAS28 based on erythrocyte sedimentation rate (DAS28 (ESR)) has been extensively validated for its use in clinical trials in combination with the EULAR response criteria.3 5 8–11
More recently, an alternative formulation of the DAS28 based on C-reactive protein (DAS28 (CRP)) has been proposed and developed.12 At the present time, the DAS28 (CRP) is not as well established as the DAS28 (ESR), and its validity is currently only inferred by comparison with the DAS28 (ESR). A comparison of the two DAS28 definitions and a formal validation of the DAS28 (CRP) is necessary so that the clinician or patient will have the same confidence using and interpreting the DAS28 (CRP) that they have come to expect when using the DAS28 (ESR). To properly evaluate the validity of the DAS28 (CRP), a well controlled study with assessment of radiographic progression and physical function is needed in a setting in which a change in patient clinical status has occurred. The ATTAIN (Abatacept Trial in Treatment of Anti-tumor necrosis factor (TNF) INadequate responders) and AIM (Abatacept in Inadequate responders to Methotrexate (MTX)) trials of the selective costimulation modulator abatacept (which modulates the CD80/CD86:CD28 signal required for full T cell activation) provide such an opportunity.13 14 Both studies were well controlled, and the measures required for validation of the DAS28 (CRP) were recorded. Significant improvements in clinical measures of disease activity, physical function and health-related quality of life were observed with active treatment vs placebo in both trials. In addition, radiographic assessments were taken during the AIM trial, and patients treated with abatacept demonstrated significant reductions in structural progression compared with patients treated with placebo.14 Using data from these two trials, the objective of this investigation was to: (1) compare the DAS28 (CRP) with the DAS28 (ESR) by crossclassifying EULAR response states and the proportion of patients achieving DAS28-defined remission; and (2) to validate the DAS28 (CRP) against assessments of radiographic progression and physical function.
Data from two phase III, multicentre, randomised, double-blind, placebo-controlled trials of abatacept in patients with active RA were used in this analysis. In the 6-month ATTAIN trial, patients with an inadequate response to anti-TNF therapy received abatacept or placebo, plus ⩾1 disease-modifying antirheumatic drug.13 In the 12-month AIM trial, patients with an inadequate response to MTX received abatacept or placebo, plus MTX.14
Data are presented for all patients in the ATTAIN and AIM trials for whom ESR and CRP measurements were available.
Measures of disease activity
Efficacy assessments were taken on study visit days, prior to infusion. The DAS28 considers 28 tender and swollen joint counts, general health (GH; patient assessment of disease activity using a 100 mm visual analogue scale (VAS) with 0 = best, 100 = worst), plus levels of an acute phase reactant (either ESR (mm/h) or CRP (mg/litre)). DAS28 values were calculated as follows: DAS28 (CRP) = 0.56*√(TJC28) +0.28*√(SJC28)+0.014*GH+0.36*ln(CRP+1)+0.96; DAS28 (ESR) = 0.56*√(TJC28)+0.28*√(SJC28)+0.014*GH+0.70*ln(ESR), where TJC = tender joint count and SJC = swollen joint count.
EULAR response states were classified as follows: good responders were patients with an improvement of >1.2 and a present score of ⩽3.2; moderate responders were patients with an improvement of >0.6 to ⩽1.2 and a present score of ⩽5.1, or an improvement of >1.2 and a present score of >3.2; non-responders were any patients with an improvement of ⩽0.6, or patients with an improvement of >0.6 to ⩽1.2 and a present score of >5.1.4 DAS28-defined remission was classified as a score of <2.6.
Physical function was measured at baseline and every 3 months using the Health Assessment Questionnaire Disability Index (HAQ-DI).15 In the AIM trial, joints in the hands, wrists and feet were assessed radiographically at baseline and 1 year for changes in erosion score (ES), joint space narrowing (JSN) score and total score (TS) using the Genant-modified Sharp scoring system.16–19
Validation analyses for the disease activity score 28 (CRP)
To assess the extent of agreement between the two DAS28 definitions, the EULAR response criteria based on each definition were calculated by treatment group and for pooled treatment groups, with crossclassification for each of the datasets. The proportion of patients who achieved remission according to the two DAS28 definitions was calculated for the pooled treatment groups. Κ coefficients with quadratic weights were calculated and Bland–Altman plots20 were constructed.
To examine the extent to which the DAS28 (CRP) reflects structural damage and physical function, trends in radiographic progression and HAQ-DI scores across the EULAR responder states based on the DAS28 (CRP) and the DAS28 (ESR) were assessed, and trends based on the two DAS28 definitions were compared.
Sensitivity to change
To examine the ability of the two DAS28 definitions to detect a treatment effect, the following measures were assessed:21 treatment difference, relative percentage improvement, standardised response mean (SRM) and relative efficiency in relation to the tender joint count.
Of the 258 and 133 patients randomised and treated with abatacept or placebo, respectively, in the ATTAIN trial, 171 patients treated with abatacept and 75 patients treated with placebo had ESR and CRP measurements available for analysis. Of the 433 and 219 patients randomised and treated with abatacept or placebo, respectively, in the AIM trial, 351 patients treated with abatacept and 155 patients treated with placebo had ESR and CRP measurements available for analysis. Baseline demographics and clinical characteristics were generally similar for the abatacept and placebo groups in each study (table 1). Baseline demographics and clinical characteristics were comparable between this subset of patients, and the subset of patients who did not have DAS28 (CRP) and DAS28 (ESR) available (data not shown).
Table 2 shows the crossclassification of the EULAR response criteria based on DAS28 (ESR) and DAS28 (CRP) at 6 months for patients in both treatment groups for the ATTAIN and AIM trials combined. There was general agreement in classifying patients as none, moderate and good EULAR responders using the two DAS28 definitions, with a κ (95% CI) of 0.80 (0.76, 0.83) indicating good agreement. The main diagonal numbers in table 2 indicate where the EULAR response criteria based on DAS28 (ESR) and DAS28 (CRP) are in agreement. Overall there was an 82.4% agreement between the EULAR response criteria based on the two definitions (table 2). When disagreements occurred, the DAS28 (CRP) yielded a better EULAR response state more often than the DAS28 (ESR) (12.7% vs 4.9%, respectively). In one instance, a patient was classified as being a non-responder with DAS28 (CRP) but a good responder with DAS28 (ESR). This patient had a CRP value of 22.4 mg/dl and an ESR value of 4.0 mm/h, corresponding to a DAS28 (CRP) score of 4.3 and a DAS28 (ESR) score of 2.4. The patient subsequently discontinued from the ATTAIN trial due to cholangiocarcinoma, which was the probable cause of the elevated CRP levels.
Similar patterns were observed when the ATTAIN and AIM trials were considered separately, and when treatment groups were considered combined or separately (data not shown). A total of 50/752 (6.6%) patients were classified as having achieved remission according to both DAS28 definitions. A total of 83 (11.0%) patients were classified as having achieved remission with the DAS28 (CRP) and 56 (7.4%) classified as having achieved remission with the DAS28 (ESR). The κ coefficient (95% CI) was 0.69 (0.60 to 0.78), indicating good agreement. A Bland–Altman plot (fig 1) shows a high degree of agreement between the two DAS28 definitions with most observations of the difference between DAS28 (ESR) and DAS28 (CRP) lying between the mean ±2SD (represented in the figure by central and outer horizontal dashed lines, respectively). A total of 3.3% of values fell outside 2SD above the mean difference and 1.5% fell outside 2SD below the mean difference, indicating that when disagreement of two definitions occurred, there were more instances in which DAS28 (CRP) was lower than DAS28 (ESR) compared with instances in which it was higher. The mean difference between DAS28 (CRP) and DAS28 (ESR) is −0.38, indicating a relatively small tendency for the DAS28 (ESR) to classify patients as having a higher score.
Radiographic progression was assessed in the AIM trial.14 For both definitions, there were several instances in which radiographic progression did not improve in a linear fashion across the EULAR responder states. This was particularly apparent in the placebo group, where ES, JSN score and TS improved in a quadratic manner with a sharp improvement for good vs moderate responders but a more modest improvement for moderate vs non-responders, and in the case of the DAS28 (CRP) definition a slight deterioration was observed for moderate vs non-responders (table 3). Similar patterns were observed for ES in the abatacept group and in both groups combined. The JSN score in the abatacept group using the DAS28 (ESR) definition followed a more linear pattern across the EULAR responder states, which in turn led to a linear pattern for the JSN scores in both groups combined and for TS in the abatacept and combined groups. For the DAS28 (CRP) definition, the JSN scores and TS followed the quadratic pattern.
Table 4 and fig 2 show data from the ATTAIN and AIM trials, illustrating the mean improvements from baseline in HAQ-DI at 6 months in patients who were EULAR good, moderate or non-responders, based on the DAS28 (CRP) or DAS28 (ESR) definition. For both definitions, function improved in a linear fashion across the EULAR responder states, and there was agreement on the changes in HAQ-DI scores across the states. For the good and moderate states, there was improvement from baseline in HAQ-DI, and this improvement was more marked in good responders compared with moderate responders. For EULAR non-responders, HAQ-DI scores showed little improvement at 6 months. Similar results were observed for patients in the AIM trial at 1 year. For the combined (abatacept and placebo) group, the mean change from baseline in HAQ-DI was −0.92 and −1.03 for good responders −0.62 and −0.64 for moderate responders and −0.27 and −0.27 for non-responders, for DAS28 definitions based on CRP and ESR, respectively.
Measures to assess the sensitivity of the two DAS28 definitions demonstrate that both have a comparable ability to detect a treatment effect. For the DAS28 (CRP) and DAS28 (ESR), respectively, the treatment difference was −18.83 vs −13.22; the percentage improvement was −14.42 vs −7.44; the SRM was −0.31 vs −0.31 and the relative efficiency was 1.93 vs 1.97.
Using CRP for calculation of the DAS28 is an attractive alternative to ESR for a number of reasons. Firstly, CRP measurements are routinely used in clinical practice, and are often available in circumstances when ESR measurements are not. CRP levels are more sensitive to short-term changes in disease activity,22 whereas ESR can be influenced by a number of unrelated factors, such as age, gender or plasma proteins. Laboratory tests used to calculate CRP are faster than those used to measure ESR, and measurements can be standardised in a central laboratory for multicentre clinical trials.7 There is currently less clinical experience using DAS28 (CRP), and a formal validation study with respect to radiographic progression and functional assessment is needed.
Using data from two trials of abatacept, we have compared the DAS28 (CRP) with the DAS28 (ESR), and validated the DAS28 (CRP) against assessments of radiographic progression and physical function. Most patients were classified as having the same EULAR state regardless of which DAS28 definition is used. Despite this, several discrepancies were observed. Firstly, some patients (154/752; 20.5%) were classified as having a lower present DAS28 category when using the DAS28 (CRP) vs the DAS28 (ESR). In 76 out of these 154 cases, this resulted in patients being classified as being in a better EULAR state when measured with DAS28 (CRP). Secondly, certain patients were classified as having either large or moderate improvement, depending on which DAS28 definition was used; this type of discrepancy occurred at a similar frequency with both definitions, and would not be expected to result in more favourable results with one definition vs the other. Although the current analysis detected discrepancies in 17.6% of patients, these were moderate in magnitude; overall, there is general agreement between the two measures. The 2 DAS28 definitions also demonstrated agreement in classifying DAS28-defined remission, with the majority of discrepancies (33 cases) resulting in patients being classified as in remission with DAS28 (CRP) but not DAS28 (ESR). Considering the DAS28 (ESR) definition as the external comparison, there is support for the criterion validity of the DAS28 (CRP).
When a discrepancy occurs, the tendency is for the DAS28 (CRP) definition to yield a better response state. This has also been found in the work by Inoue et al23 and Matsui et al.24 To overcome this discrepancy, transforming the DAS28 (CRP) to more closely conform to the DAS28 (ESR) is an enticing idea. However, the concern is the generalisability of the transformation across different patient groups. Inoue et al23 derived an adjustment factor based on regressing DAS28 (ESR) on DAS28 (CRP) (ie, DAS28 (ESR) = 1.01 DAS28 (CRP)+0.590). Applying this adjustment factor to the current dataset led to a larger percentage being classified with a worse response state using DAS28 (CRP) (2.9% classified as better and 12.9% classified worse, compared with the unadjusted results of 12.7% and 4.9%, respectively). Other approaches can be considered, for example, an adjustment factor could be based on regressing ln(ESR) on ln(CRP+1), which is the essential difference between the two DAS28 definitions, and using this adjustment in the DAS28 (ESR) formula. Applying this adjustment, a more equitable division resulted, with 4.5% in a better and 8.6% in a worse state; again, the generalisability of the transformation may be an issue.
The construct validity of the DAS28 (CRP) was evaluated by examining the trends in radiographic and functional progression across the EULAR responder states based on the DAS28 (CRP). For both definitions of the DAS28 there was generally good agreement between reduced radiographic progression and EULAR responses, and classification as a good EULAR responder was predictive of a lower rate of radiographic progression in patients treated with abatacept. However, some deviations from this trend were observed. For the DAS28 (ESR) definition, radiographic scores were generally higher for the moderate responders than for non-responders in the placebo group. This was also true for the DAS28 (CRP) definition, in the abatacept and placebo groups. Changes in HAQ-DI scores showed very similar linear trends for both definitions, with improvements in HAQ-DI across the EULAR responder states from none to moderate to good. These trends held for patients treated with abatacept and placebo, and demonstrate that improvements in the DAS28 (CRP) and DAS28 (ESR) are associated with improvements in physical function.
The level of sensitivity of the DAS28 (CRP) and DAS28 (ESR) is an important issue to consider, and differences could potentially contribute to discrepancies observed between the two measures. However, as the SRMs were identical for the two measures, and the relative efficiencies were comparable, suggesting the sensitivity to change of the DAS28 (CRP) and DAS28 (ESR) are very similar.
The analyses described here were performed only on patients for whom ESR and CRP measurements were available. The majority of patients who did not have both measurements had missing ESR data. This was due to technical issues associated with the fact that ESR assessments were performed locally on standard kits, whereas CRP was measured by a central laboratory, allowing electronic data transfer to the main database. Although only a subset of patients from each trial had CRP and ESR measurements, baseline demographics and clinical characteristics for this subset were comparable with those that did not have both measurements, suggesting that they were likely to be representative of the full patient population.
Two key findings have emerged from the current investigation. Firstly, the DAS28 (CRP) has been validated with respect to functional and radiographic progression, and the validation profile was similar to that based on ESR. Secondly, there was a tendency for the DAS28 (CRP) to yield a lower score and in some instances a better EULAR response (12.7% vs 4.9% for the CRP and ESR, respectively). Often the two definitions were generally comparable, and formulating a conclusion on the EULAR response state or remission status based on the DAS28 (CRP) definition provides a reasonable basis for assessing a patient, provided the user is aware of the tendency of the DAS28 (CRP) to yield a better response than the DAS28 (ESR). Currently, the same cut-off points for EULAR response states originally derived for the DAS28 (ESR) are also applied for the DAS28 (CRP). To increase the level of agreement between the two DAS28 formulations, one solution would be to derive a new set of cut-off points tailored for use with DAS28 (CRP). The development of a robust approach leading to set of cut-off points that can be generally applied across populations may prove difficult, as illustrated when the translation derived by Inoue et al23 and Matsui et al24 is applied to the current dataset. Exploration of a robust method that will translate DAS28 (CRP) to DAS28 (ESR) and the derivation of a corresponding set of cut-off points for the disease activity state for the CRP formulation are needed before a transformation of the DAS28 (CRP) definition is advocated. An appropriate research agenda encompassing the assembly of large and diverse datasets, and an analysis plan involving regression based models will help to provide further insight into the potential for achieving a better alignment of the DAS28 (CRP)-defined and the DAS28 (ESR)-defined EULAR response states.
In conclusion, we have provided a validation of the DAS28 (CRP) assessment and corresponding EULAR response states against radiographic progression and physical function. While the DAS28 (CRP) yielded a better EULAR response more often than the DAS28 (ESR), the validation profile was similar to that of the DAS28 (ESR), indicating that both definitions of the DAS28 criteria can be used as benchmarks to assess patient improvement and treatment effect, and can aid in the description and interpretation of changes in disease activity in patients with RA.
The authors would like to thank Helen Clarke, Medicus International, for her editorial assistance.
Competing interests: GW has received consultancies/speaking fees/honoraria from Bristol-Myers Squibb; J-CB is an employee of Bristol-Myers Squibb and has stock options; JT is an employee of Bristol-Myers Squibb and has stock options; MD has received consultancies/speaking fees/honoraria from Bristol-Myers Squibb, Abbott, Wyeth, Centocor and Schering Plough; MS has received consultancies/speaking fees/honoraria from Bristol-Myers Squibb and Centocor; JS has received consultancies/speaking fees/honoraria from Bristol-Myers Squibb; DA has received consultancies/speaking fees/honoraria from Bristol-Myers Squibb; PICMvR has received consultancies/speaking fees/honoraria from Bristol-Myers Squibb, Abbott, Wyeth, Novartis and Schering Plough.
Funding: This study is based upon clinical trial results from studies sponsored by Bristol-Myers Squibb, Princeton, New Jersey, USA. The authors had responsibility for the analysis and data interpretation, and all authors were involved in the drafting and critical revision of the article; in addition, all authors have seen and approved the final article for submission. This study was supported in part by an unrestricted research grant-in-aid from Bristol-Myers Squibb.
Ethics approval: Ethics approval was obtained.
This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.