Objectives: To describe a novel scoring system for the assessment of tenosynovitis by magnetic resonance imaging (MRI) in patients with rheumatoid arthritis, and assess its intra- and inter-reader reliability in a multireader, longitudinal setting.
Methods: Flexor and extensor tenosynovitis were evaluated at the level of the wrist in 10 different anatomical areas, graded semi-quantitatively from grade 0 to 3 (total score 0–30), based on the maximum width of post-contrast enhancement within each anatomical area on axial T1-weighted MR images. Ten sets of baseline and 1-year follow-up MR images of the wrists of patients with rheumatoid arthritis with early and established disease were scored independently by four readers twice on 2 consecutive days. Intra- and inter-reader agreements were evaluated.
Results: The intrareader intraclass correlation coefficients (ICCs) were high for status scores (median ICCs 0.84–0.88) and slightly lower for change score (0.74). The smallest detectable difference (SDD) in % of the maximum score was 11.2–11.5% for status scores and 13.3% for change scores. Inter-reader single-measure ICCs were acceptable for both status scores (median 0.73–0.74) and change scores (0.67), while average-measures ICCs were very high for both status and change score (all ⩾0.94). The median scoring time per patient (baseline and follow-up images) was 7 min (range 3–10).
Conclusions: The introduced tenosynovitis scoring system demonstrates a high degree of multireader reliability, is feasible, and may be used as an adjuvant to the existing OMERACT RAMRIS score, allowing improved quantification of inflammatory soft tissue changes in patients with rheumatoid arthritis.
- CI, confidence interval
- ICC, intraclass correlation coefficients
- MDC, minimal detectable change
- MRI, magnetic resonance imaging
- RA, rheumatoid arthritis
- SDD, smallest detectable difference
Statistics from Altmetric.com
- CI, confidence interval
- ICC, intraclass correlation coefficients
- MDC, minimal detectable change
- MRI, magnetic resonance imaging
- RA, rheumatoid arthritis
- SDD, smallest detectable difference
Tenosynovitis is a common problem in rheumatoid arthritis (RA), and is observed in a large proportion of these patients.1,2 It is not specific for RA, and has also been described in other rheumatological diseases, such as systemic lupus erythematosus.3 The tenosynovium produces proinflammatory cytokines and proteolytic enzymes that are important in the tissue degradation seen in RA.4 The proliferation of the tenosynovial lining can lead to impaired function due to scarring and adhesions. Flexor tenosynovitis in the hands has also been identified as a risk factor for destructive erosion of the joints.5 The ongoing tenosynovial inflammation may ultimately lead to tendon rupture, which is a serious complication that leads to reduced hand function.6
Magnetic resonance imaging (MRI) is a powerful imaging modality that is now widely used in both scientific research and clinical settings to visualise joints of patients with RA. MRI is able to image structural damage and soft tissue changes, ie, synovitis, bone oedema, damage to cartilage and bone, as well as tendon pathology. Patients are likely to have ongoing tendon disease if MRI evidence of high-grade tendinopathy is identified in early disease.7 Clinicians should be aware of this so that preventive measures can be applied. If the tenosynovitis cannot be controlled by systemic drug therapy or local steroid injections, surgical intervention with tenosynovectomy or removal of bone spurs can prevent tendon ruptures and give long-term relief.6,8,9 When a rupture has occurred, reconstruction has a reasonable chance of restoring hand function as long as the number of affected tendons is limited.10
The complexity and the amount of information provided by MRI pose a challenge, and the issue is how best to use this modality as a reliable outcome measure. The OMERACT MRI-RA Working Group has for some years been working on this issue, and has developed a scoring system, the OMERACT RAMRIS, which includes a semiquantitative score for bone erosions, bone oedema and synovitis.11 RAMRIS has recently been validated in a multireader, longitudinal setting12, and the reliability of this scoring method is similar to that of the scoring methods used for conventional radiography, such as the Sharp and Larsen methods with modifications.13–15
Synovial inflammation is the primary lesion in RA, and studies suggest that progressive joint destruction does not occur in the absence of MRI synovitis.16–18 As the goal of modern drug therapy in early RA is to eliminate rather than inhibit radiographic erosive damage, future main outcome measures in both clinical trials and clinical practice should include a sensitive and robust measure of inflammation, such as MRI. Taking the frequent occurrence of tenosynovitis into account, scoring tenosynovitis in addition to the aspects covered by the RAMRIS, will allow a broader assessment of inflammation in patients with RA.
The aim of the present study was to introduce and describe a novel scoring system for the assessment of tenosynovitis, and assess the intra- and inter-reader reliability of this scoring system in a multireader, longitudinal setting in patients with RA with both early and established disease.
PATIENTS AND METHODS
Patient selection and image evaluation
MR images of the dominant wrist in 10 patients with RA, which showed progression of either erosions or joint space narrowing on conventional hand radiographs were included in the study (four with early RA and six with established disease, all fulfilling the American College of Rheumatology 1987 criteria for RA). The median (range) disease duration for the patients with early RA was 3.5 months (0.8–11.9), and for those with established RA 7.4 years (5.1–9.0). The patients received conventional RA treatment during the 12-month follow-up (methotrexate n = 6, prednisolone n = 3, combination therapy with hydroxychloroquine and sulphasalazine n = 1). The regional ethics committee evaluated the study, and all enrolled patients gave informed consent. The median interval between the first and second scan was 12 months, and ranged from 12 to 14 months. The paired images were read in chronological order on large-screen (21′′) radiological workstation monitors using a standard PACS software program (SECTRA IDS5, Sweden). This software package provides the readers with advanced features of image viewing, allowing the reader to adjust window/level settings, to zoom in/out, and to use a localiser that allows the accurate placement of specific lesions in two planes (axial and coronal), with the opportunity to measure distances and areas accurately.
The readers met for 2 days to perform the reading. The four readers had different levels of experience with MRI reading. MØ and BE were experienced MRI readers, and both had taken part in the development and testing of the OMERACT RAMRIS system; however, they had no previous experience in the tenosynovitis scoring system. EAH had some experience with the RAMRIS, and had previously used the tenosynovitis score in a pilot study, and thus had some experience using the score. NPK was familiar with MR-reading, including assessment of tenosynovitis, but did not have any previous experience with neither the RAMRIS nor the tenosynovitis scoring system. The four readers met for a brief training session the evening before the exercise, to review the scoring method and for initial training of the readers not familiar with the tenosynovitis scoring system. All readers read images independently at four different workstations in four separate locations. A technician coded the image sets and removed patient names. This procedure was repeated for a second reading the consecutive day, with rearrangement of the image sets in a different order and with a different coding. The score sheets from day 1 were sealed in envelopes until the second reading was completed.
MRI of the dominant wrist was performed at baseline and at 1 year, using a GE Signa 1.5T MRI scanner (General Electric (GE) Signa, Milwaukee, Wisconsin, USA) with a dedicated high-resolution wrist phased array coil. The same scanner and wrist coil were used for both examinations. The hand was placed in the wrist coil at the patient’s side with the coil anchored to the base tray to reduce motion artefacts. The MRI sequences in this study included the OMERACT recommended MRI core set of sequences,11 but only pre- and post-contrast T1-weighted axial images (slice thickness 3.0 mm) were used to evaluate the tenosynovitis. The image sequences were tested in a pilot study, and developed in collaboration with an experienced MR radiologist and a product specialist from GE. Details of the sequences can be found in a previous study.12 Experienced technicians reviewed the images immediately after acquisition, and a sequence was reobtained if the quality was not acceptable.
Anatomy of the wrist tendons
On the dorsal side the extensor tendons are stabilised by the extensor retinaculum, with septations dividing the extensor tendons into six discrete compartments (for details see fig 1 score sheet with illustration of anatomy of the wrist). The compartments contain the following tendons: (I) extensor pollicis brevis, abductor pollicis longus; (II) extensor carpi radialis brevis, extensor carpi radialis longus; (III) extensor pollicis longus; (IV) extensor digitorum communis, extensor indicus proprius; (V) extensor digiti quinti proprius; (VI) extensor carpi ulnaris. On the volar side the tendons can be divided into four discrete anatomical regions: (1) the flexor carpi ulnaris tendon (located ulnar to the carpal tunnel); (2) the flexor digitorum superficialis and profundus tendons (in the carpal tunnel, enclosed in a common sheath—the ulnar bursa); (3) the flexor pollicis longus tendon (located dorsally and radially to the median nerve as it passes through the carpal tunnel, and entering a continuous sheath that becomes the radial bursa); (4) the flexor carpi radialis (localised radially to the tendons enclosed in the ulnar bursa). Thus, the extensor and flexor tendons can be divided into 10 separate anatomical areas that can be evaluated for tenosynovitis.
The tenosynovitis scoring system
Both flexor and extensor tenosynovitis are evaluated. The 10 anatomical areas to be scored for tenosynovitis (described above) are evaluated between the radioulnar joint and the hook of the hamate. Tenosynovitis is visualised on MRI as tendon sheath fluid, sheath thickening and enhancement after intravenous contrast injection. As small amounts of fluid can be seen in normal tendon sheets,18 it is essential that the tenosynovitis is visible in at least two consecutive axial slices within the tendon sheet to be scored as abnormal. Tendon sheath abnormalities are graded semiquantitatively from grade 0 to grade 3, reflecting the maximum width (in mm) of enhancement within each anatomical area as described below:
Grade 0 (normal): no peritendinous effusion or synovial proliferation with enhancement.
Grade 1: <2 mm peritendinous effusion and/or synovial proliferation with enhancement.
Grade 2: ⩾2 and <5 mm peritendinous effusion and/or synovial proliferation with enhancement.
Grade 3: ⩾5 mm peritendinous effusion and/or synovial proliferation with enhancement.
The extent of the synovial enhancement is measured at the point of maximal thickness, perpendicular to the tendon surface. This grading scheme is based on a modification of a scheme that has been applied in another study of tenosynovitis.19 Missing tendons, for example due to physiological anatomical variants, are given a score of zero. The range of the total score is from zero to 30. The score sheet that was used including a detailed illustration of the anatomy of this region is provided in fig 1. Examples of different grades of tenosynovitis in patients with RA are found in fig 2. The readers recorded the time consumed scoring each image set according to the tenosynovitis scoring system.
Intrareader and inter-reader reliability were evaluated using a two-way mixed effect model, and single measure and average measure intraclass correlation coefficients (ICC) were calculated for both status scores and change scores. The average measure ICC corrects for the number of readers and was calculated for the inter-reader reliability. The ICC is presented as median (range) for intra-reader reliability due to the low number of values, and as mean with 95% CI for the inter-reader reliability analyses. ICC values are comparable with κ values; scores above 0.60 are considered good, and scores above 0.80 very good. Sensitivity to change was assessed by calculating the smallest detectable difference (SDD), derived from the limits of agreement method described by Bland and Altman.20 The SDD represents the smallest change score that can be discriminated from the measurement error of the scoring method, and is expressed in the same units of measurement as calculated for the score. Using SDD as the threshold level for relevant progression of joint damage ensures that an observed change with 95% confidence exceeds the measurement error. Minimal detectable change (MDC) is a way to express the SDD as a percentage of the maximum score of the method, to allow comparisons with other radiographic and clinical measures. An SDD of zero is perfect agreement, and there is no convention regarding any upper limit; however, a MDC of less than 20% is generally accepted to reflect a high potential to detect changes.20 All statistical analyses were performed using the Statistical Package for the Social Sciences for Windows, version 12 (SPSS, Chicago, Illinois, USA).
A total of 10 sets of MR images from wrist joints in patients with RA were scored by four readers. The intra-reader single-measure ICC, the SDD and the MDC are presented in table 1.
The intrareader ICCs were generally high for status scores (median baseline ICCs 0.84 and follow-up ICC 0.88) and slightly lower for change score (median ICC 0.74). The SDD and MDC of the tenosynovitis scoring system were acceptable and comparable with similar measures, ie the OMERACT RAMRIS score for MR images and Larsen and Sharp scores for conventional radiographs.12–15
Table 2 provides the inter-reader single measure and average measure ICCs for baseline, 1-year follow-up and change scores.
Inter-reader single-measure ICCs were acceptable for both status scores (baseline ICC 0.73 and follow-up ICC 0.74) and change score (median ICC 0.67), while the average measures ICCs were very high for both status and change score (all ICCs >0.94). An overview of the distribution of scores is shown in table 3, including the mean with 95% confidence intervals and median with minimum and maximum values for tenosynovitis scores for baseline, 12-month follow-up and change.
The mean tenosynovitis score ranged from 1.50 to 11.3, and for individual assessments from a minimum of 0 to a maximum score of 15 (out of 30). The complete scoring of one patient (baseline and follow-up images) was completed in median 7 min, ranging from 3 to 10 min.
In this study we assessed the intra- and inter-reader agreement of a novel tenosynovitis score in a longitudinal setting, allowing us to assess the reliability of this measure both for measuring status and progression. The reason for developing such a scoring system was based on work performed under the OMERACT umbrella. The OMERACT MRI in RA group has suggested MRI definitions of the important RA joint pathologies (synovitis, bone oedema and erosions), a “core set” of basic MRI sequences, and a scoring system for RA, namely the RAMRIS. Potential future research areas suggested at OMERACT 7 included “other structures/types of pathologies (e.g. cartilage, tenosynovitis)”.22 We have chosen to focus on tenosynovitis, and developed a simple scoring system that may be used as an adjuvant to RAMRIS.
Our scoring system graded tenosynovitis semiquantitatively from 0 to 3, ie the same range as for RAMRIS synovitis score. Tenosynovitis is evaluated on T1-weighted axial images pre- and post-contrast enhancement and these MRI sequences are part of the RAMRIS core set of sequences, ie no extra sequences are required. Tehranzadeh et al23 recently described that contrast-enhanced T1-weighted images allowed earlier detection of tenosynovitis in acute and subacute stages compared with T2-weighted sequences, and that the majority of patients demonstrated higher tenosynovitis scores using contrast-enhanced T1-weighted sequences. Thus, it appears that the standard RAMRIS sequences are sufficient for evaluating tenosynovitis. McQueen et al24 have previously described an extensive scoring system for the evaluation of tendinopathy, which comprises tendon signal change, tendon size and tendon sheath signal change. To evaluate these measures, it is necessary to obtain sequences not included in the RAMRIS core set. As the RAMRIS is already quite extensive and time consuming, we felt it was important to develop a simple scoring system that can easily be evaluated within the framework of the RAMRIS core set of MRI sequences, and only requires minimal time to score. Ostendorf et al3 has previously described a semiquantitative scoring system for evaluating tenosynovitis in systemic lupus erythematosus, where proliferative and oedematous tenosynovitis is scored separately as “not present”, “slight”, “moderate” or “severe”. Although our tenosynovitis scoring system also is semiquantitative, each level of scoring is precisely defined quantitatively in millimetres as described in the Methods section, and can easily be measured using standard MRI imaging software, thus minimising bias caused by the reader’s judgement.
We decided to focus on the wrist area in the development of the tenosynovitis score, as both the finger and wrist tendons are visible in this area, and we aimed at including all relevant tendons. The reason for not including evaluation of the distal part of the finger tendons is for feasibility issues—it is difficult to obtain high-quality MRIs with a large field-of-view that allows simultaneous examination of both the wrist area and the fingers, and it would be time-consuming to evaluate the finger tendons separately both proximally and distally. Tendons that lie in the same anatomical compartment were scored as one entity, and the results were not weighted. The fact that both the intra- and inter-reader ICCs were so high, despite three of the readers had never used this scoring method previously, underlines the feasibility of this scoring system. The one reader that had experience with the use of this method had a slightly higher intra-reader ICC, which implies that the ICCs found in this study may be further improved by training.
Modern treatment strategies in patients with RA consist of early aggressive therapy, and the aim is total suppression of joint inflammation, the elimination of radiographic progression and no functional disability. This aim is within reach with new drugs and treatment strategies, but early initiation of therapy as well as sensitive monitoring of disease progression and changes in inflammatory activity are required. MRI has superior sensitivity to detect joint inflammation and structural damage compared with conventional methods,24–26 and, potentially, has the desired sensitivity in such a setting.
In this multireader study we have found that this novel tenosynovitis scoring system demonstrates adequate reliability, in line with existing scoring systems, is feasible, and may be used as an adjuvant to the existing OMERACT RAMRIS score, to be able to better quantify soft tissue changes reflecting inflammation in RA. Further prospective studies are needed to explore the sensitivity to change of this scoring system, as well as the clinical relevance of the MRI tenosynovitis score. An additional important item is to examine whether the combination of the RAMRIS synovitis and bone marrow oedema scores together with this novel tenosynovitis score is a more responsive marker of inflammatory activity in RA than each of the single components alone.
We thank research nurse Margareth Sveinsson for collecting clinical data, research coordinator Tone Omreng for organising the data collection, technician Marianne Ytrelid for technical assistance, medical illustrator Kari Toverud (MS CMI) for help with fig 1 and Petter Mowinckel (MSc) for statistical advice.
Published Online First 28 March 2007
Funding: This study was supported in part by grants from The Research Council of Norway, The Norwegian Rheumatism Association, The Norwegian Women Public Health Association, Grethe Harbitz Legacy and Marie and Else Mustad’s Legacy.
Competing interests: None
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.