Article Text

Download PDFPDF

Use of digital x ray radiogrammetry in the assessment of joint damage in rheumatoid arthritis
  1. W B Jawaid1,
  2. D Crosbie1,
  3. J Shotton2,
  4. D M Reid2,
  5. A Stewart2
  1. 1Department of Rheumatology, NHS Grampian, Ward 3, Aberdeen Royal Infirmary, Aberdeen, AB25 2ZN, UK
  2. 2Department of Medicine and Therapeutics, University of Aberdeen, Aberdeen, UK
  1. Correspondence to:
    Professor D M Reid
    Department of Medicine and Therapeutics, University of Aberdeen, Medical School, Foresterhill, Aberdeen, AB25 2ZD, UK; d.m.reid{at}


Objective: To compare digital x ray radiogrammetry (DXR) with manual radiography for assessing bone loss in RA and examine the relationship of the scores obtained with other disease indices.

Methods: 225 consecutive consenting subjects attending the RA clinic were enrolled. An x ray examination was carried out; demographic details recorded; a self assessment questionnaire completed; blood taken for ESR measurement; and an assessment made by a trained nurse. All x ray films were scored manually using the modified Sharp technique by a single observer; 20 films were rescored by three readers. Films were assessed with the Pronosco X-Posure system, version 2.0. Analysis included χ2 tests, independent t tests, multiple linear regression, and partial correlations, as appropriate. The smallest detectable difference (SDD), coefficient of variation (CV), and coefficient of repeatability (CR) were determined from Bland and Altman plots.

Results: The DXR precision varied: SDD = 0.002–0.9; CV = 0.09–5.9%; CR = 0.002–0.792, but was better than that of the intra- and interobserver Sharp scores: SDD = 73.9; CV = 27.8%; CR = 33.0–47.6. The DXR measurements, bone mineral density (R2 = 0.210), metacarpal index (R2 = 0.222), and cortical thickness (R2 = 0.215), significantly predicted Sharp scores. In women, DXR measurements significantly correlated with modified HAQ scores but with no other disease indices. Sharp scores significantly correlated with assessor’s global assessment, swollen and tender joint counts, pain, HAQ, and DAS28.

Conclusion: DXR measurements are more precise than Sharp scores; both are related to long term disease activity in RA. DXR is simple to use, does not require intensive training, and may identify subjects not responding to standard treatment.

  • BMD, bone mineral density
  • CR, coefficient of repeatability
  • CV, coefficient of variation
  • AGA, assessor’s global assessment
  • BW, bone width
  • CT, cortical thickness
  • DAS28, 28 joint count disease activity score
  • DXA, dual energy x ray absorptiometry
  • DXR, digital x ray radiogrammetry
  • EMS, early morning stiffness
  • ESR, erythrocyte sedimentation rate
  • HAQ, Health Assessment Questionnaire
  • MCI, metacarpal index
  • PGA, patient global assessment
  • POR, porosity
  • RA, rheumatoid arthritis
  • SDD, smallest detectable difference
  • VAS, visual analogue scale
  • rheumatoid arthritis
  • joint erosions
  • hand bone density
  • radiogrammetry

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Many advances have been made in uncovering the pathogenesis of rheumatoid arthritis (RA) and this has led to the introduction of new treatments. Cytokines produced by macrophages, in particular tumour necrosis factor α, have been found in higher concentrations in rheumatoid joints.1,2 This has led to the development of treatments directed against tumour necrosis factor α and its receptors, which have been shown to slow radiographic joint damage, but these are expensive and can have serious side effects.3–6 It would be useful if these and other disease modifying drugs could be specifically targeted at an early stage of disease to those subjects who are unresponsive to safer and cheaper drugs and have continuing joint damage.

Current techniques of assessing long term disease progression in RA, although useful in comparing drug treatments in clinically controlled trials, are not very effective in assessing RA in individual subjects. Accordingly, they are rarely used as part of standard day to day clinical practice by rheumatologists.7

It is necessary to use various imaging techniques to assess structural damage. Although there is interest in using magnetic resonance imaging8 and ultrasound imaging 9,10 to assess joint damage, including cartilage loss and erosion counts, the most widely used methods are a variety of plain film x ray scoring techniques. The Sharp scoring method11 is such a technique and in its modified forms, such as that suggested by van der Heijde,12 is commonly used in clinical trials to assess erosions and joint space narrowing for joints of both the hands and feet. Because erosive damage of the periarticular bones is believed to be largely irreversible, the use of such an end point clinically might be considered to have limited benefit, with treatment always lagging behind disease progression. However, a measurement technique that showed a strong relationship with joint damage, which is in itself related to long term functional status, might have real clinical value.

Periarticular bone loss is the earliest radiological feature of RA. If quantitative assessment of hand bone mass is used as a surrogate, bone loss can be seen to occur early in the disease and predate erosive damage.13,14 Peripheral bone mass can be quantified by a number of techniques in RA including dual energy x ray absorptiometry (DXA),13–15 quantitative ultrasound,9,10,16 and possibly, magnetic resonance imaging.8 DXA is most widely used for estimating in vivo total hand bone mineral density (BMD) but may also be used to assess BMD specifically at the metacarpal joints.14 In the past few years, an improved method of radiogrammetry has been introduced using digitised plain hand radiographs17 to measure bone density, metacarpal index, cortical thickness, and porosity with high precision.18

In this study we examined the ability of digital x ray radiogrammetry (DXR) to assess bone loss in RA, compared the results with a manual plain radiograph scoring technique—the modified Sharp score19—and examined the relationship of both methods with other indices of disease activity and progression in RA.


The study was carried out at the Osteoporosis Research Unit, Aberdeen. The Grampian research ethics committee granted ethical approval.

Subjects attending the RA clinic were provided with details of the study, which included an information leaflet. Consecutive, consenting subjects were enrolled in the study. Subjects also underwent an x ray examination of their hands if one had not been carried out in the previous 2 years. JS collected the data between May 2001 and July 2002.

Subjects’ demographic details were recorded and a self assessment questionnaire was completed, which included data on the duration of early morning stiffness (EMS); pain, as assessed on a 10 cm horizontal visual analogue scale (VAS); and a patient global assessment (PGA) of disease activity, as assessed on a 10 cm horizontal VAS. A modified Health Assessment Questionnaire (HAQ) was also administered.20 Blood was taken from the subjects and the erythrocyte sedimentation rate (ESR) measured in mm after a 1 hour sample frame. Subject case notes were audited and current medications and dosages, rheumatoid factor positivity, duration of disease, and the presence or absence of erosions noted on the database. Assessments were also carried out by JS, a fully trained research nurse and included an assessor global assessment (AGA) of disease activity on a 10 cm horizontal VAS, and a 28 point tender and/or swollen joint count. If appropriate, the subjects then had a plain film x ray examination of both hands. Only those subjects who had a relevant x ray examination were assessed as part of this study.

Author WBJ scored all the available hand radiographs using the modified Sharp scoring method and the results were recorded. Twenty randomly selected subjects were rescored by WBJ a week after the initial assessment to allow calculation of short term intraobserver variation. Interobserver variation was calculated by examining 20 radiographs randomly selected for rescoring by three observers (DC, DMR, and WBJ).

All the hand radiographs were then assessed using the Pronosco X-Posure system, version 2.0 (Sectra Medical Systems, Sweden).18,21–24 This system provides the following measurements: BMD; porosity (POR); metacarpal index (MCI); cortical thickness (CT); bone width (BW). The Pronosco X-Posure system requires plain x ray films. The x rays films of the subjects in this study showed both hands on the same film. To evaluate radiographs using DXR the radiograph of each hand was separately scanned in and then analysed. BMD was calculated using formulae after assessment of the bone volume per area, assuming a cylindrical bone,18 thus negating the need for a phantom. Figure 1 shows regions of interest.

Figure 1

 Automatic selection of regions of interest by Pronosco.

All data were entered into a spreadsheet and analysed using SPSS 11.0.1 (SPSS Inc, USA) and Microsoft Excel (Microsoft Corp, USA). Tests used included χ2 tests, independent t tests, multiple linear regression analysis, and partial correlation, as appropriate. The precision was estimated by calculating the smallest detectable difference (SDD) and the standardised coefficient of variation (CV). The reproducibility of the repeated measurements was assessed using Bland and Altman plots and the coefficient of repeatability (CR) was calculated. Correlation coefficients were compared using Fisher’s z transformation.


Subject flow and comparison

Figure 2 shows the flow of subjects through the study. 537 subjects were on the original database, but only 225 met the entry requirements. Table 1 compares the demographics and disease activity indices for the current study group and the total study group. We found the HAQ score was significantly lower (p = 0.026) in the study group than in the whole group and disease duration was significantly shorter (p = 0.023), but there were no other significant differences between the groups.

Table 1

 Comparison between demographic and disease activity indices for all subject’s on database and those that were selected for the study, with numbers for each index shown in parentheses

Figure 2

 Flow of subjects through the study.


Sharp scores

The mean (SD) Sharp score for the study group was 79.38 (53.45). The intraobserver CV for the modified Sharp scoring techniques was 7.2%. The intraobserver Bland and Altman plot (fig 3) shows that variation at the extremes of the modified Sharp scores is smaller than the variation at intermediate Sharp scores. The CR was 14.8.

Figure 3

 Bland and Altman plot for the modified Sharp score intraobserver variation as scored by WBJ.

For the interobserver variation the CV was 27.8% and the SDD was calculated at 73.9 units for a mean Sharp score of 94. The CR were as follows; between DC and DMR 33.6 Sharp score units, between WBJ and DMR 33.0 units, and between DC and WBJ 47.6 units. Figure 4 shows an example of an interobserver Bland and Altman plot.

Figure 4

 Bland and Altman plot for the modified Sharp score interobserver variation as scored by DC and DMR.

DXR measurements

Table 2 summarises the intraobserver precision values for the different DXR variables scored on the same 20 radiographs assessed on two separate occasions 2 months apart. Of the DXR measurements, POR appears to be the least precise measurement while bone width and cortical thickness are the most precise. The CR ranged from 0.002 to 0.792 for the DXR measurements.

Table 2

 Precision of DXR measurements

Correlations between Sharp scores and DXR measurements

Table 3 shows the partial correlations between Sharp scores and DXR measurements. In male subjects correlation between DXR-MCI and the modified Sharp score and its component scores appears to be weaker than that in female subjects, although the differences are not significant (p>0.05 for all coefficients). The derived POR and BW did not relate to any aspect of the Sharp index.

Table 3

 Partial correlation analysis between Sharp score and its component scores and DXR measurements, where significant results are shown in bold

Correlation of radiological measures of disease severity with clinical disease activity measurements

Table 4 show the correlation between DXR measurements, Sharp scores, and clinical measures of disease activity. In female subjects for DXR measurements the only significant correlations seen were with HAQ scores (r = −0.218, p = 0.008). In contrast, Sharp scores significantly correlated with AGA, swollen and tender joint counts, VAS pain, HAQ, and 28 joint count disease activity score (DAS28) but with low r values of around 0.17–0.30 (p = 0.043–0.01). Sharp scores correlated slightly better with the HAQ than did DXR (r = +0.298, p<0.005). In contrast, for the smaller number of male patients, DXR measurements showed better correlation with certain disease measures, whereas the Sharp scores did not show significant correlation with any of the clinical measures.

Table 4

 Partial correlation analysis between structural measures and clinical measures, where bold indicates significant results


The precision results clearly indicate that DXR measurements are much more reproducible than Sharp scores, which vary greatly not only between observers but also when rescored by the same observer even a week later. The precision of DXR measurements in this study is better than that reported by Jorgensen et al.18 This discrepancy is probably due to the fact that we are simply measuring noise in the system, because the same radiograph is used each time. Perhaps a truer estimate would be given if two x ray pictures were taken at the same time and used for calculation of precision. The precision of the modified Sharp score was lower than that reported by Sharp et al,25 who found an interobserver CV of 17%. However, this may be because of the limited experience of two of the scorers or may be owing to the long duration of disease, making reading of the radiograph more difficult. A recent comparison of intra- and interobserver variation by Sharp et al has shown figures for the SDD which are very similar to those we achieved.26

Bone loss is an early feature of RA and precedes the permanent and irreversible erosive changes that occur.13,14 The Pronosco device can measure bone loss at the metacarpal joints, which are in the region most commonly affected by RA, and therefore may be able to identify subjects requiring a change in treatment before any irreversible joint damage occurs. It has been well established that BMD is reduced in patients with RA and the degree of loss is related to disease activity. Daragon et al found significantly reduced BMD of the whole hand using DXA at 6 months in patients with RA compared with subjects with other rheumatic diseases.27

This study has shown that both DXR and Sharp score measurements are related more to disease severity than to current disease activity. This has also been shown previously.28 DXR measurements are as good as Sharp scores in predicting HAQ and may even be better in male subjects. Their predictive value may not be so good in women owing to other confounding factors which influence the measurement of bone mass, including menopausal state and pre-disease bone mass status. Of the various DXR measures available, BMD seems numerically to correlate best with modified Sharp scores, but the differences between the correlation coefficients and the other DXR variables are not statistically significant.

At present, rheumatologists do not routinely use all of the RA measures mentioned in this study in their clinical review of subjects. Less than 10% of rheumatologists use pain scales, fewer of them use functional status questionnaires, and although joint counts are done, few are recorded in medical records.7 Wolfe and Zwillich showed that currently clinicians base their decisions on treatment change mainly on pain and joint count, which are immediately obvious during the subject interview and physical examination.29

In our study DXR measurements are related to long term disease activity in RA and therefore serial measurements in individual subjects may have the ability to assess response to treatment and this may be possible over sufficiently short time periods to improve patient care, owing to the high precision of the technique. Indeed, a recent study by Jensen et al30 has demonstrated that during a 2 year period, measurements of DXR BMD showed more rapid changes in those with active disease than in those with inactive disease and could distinguish those with erosive from those with non-erosive RA. A recent pilot longitudinal study from our own centre in subjects with early RA has identified how a measurement of the rate of change of DXR BMD in the first year of follow up may be able to identify those who become erosive by 4 years of disease with high specificity and reasonable sensitivity.31 DXR is simple to use, does not require intensive training, and if used in a clinical setting may have the ability quickly and cheaply to identify subjects not responding to standard treatment or may allow selection of subjects who would benefit from more aggressive treatment with newer, more expensive treatments at an earlier stage of the disease. Such a hypothesis will require testing in a formal prospective clinical trial.

In summary, we describe a method of assessing peripheral bone mass in RA that might have use in routine clinical care unlike the time consuming and rather imprecise measurement of joint space narrowing and erosion scores such as the Sharp score. Longitudinal studies will be required to determine the value of the technique in assessing suitability of subjects for expensive biological agents which can effectively limit joint destruction and bone loss.


We are grateful to the Arthritis Research Campaign (ARC) for supporting this work by a studentship for WBJ and for infrastructure support to AS and DMR.



  • Published Online First 26 August 2005

  • Competing interests: None