Article Text
Abstract
Objective To assess whether ultrasonographic scoring of (i) both parotid and submandibular salivary glands and (ii) all individual components of the Hocevar scoring system, is needed for classifying patients as primary Sjögren’s syndrome (pSS).
Methods Ultrasound examination of the major salivary glands (sUS) was performed in 204 consecutive patients clinically suspected (n=171) or diagnosed (n=33) with pSS.
Parenchymal echogenicity, homogeneity, hypoechogenic areas, hyperechogenic reflections and salivary gland posterior border were scored in left and right parotid and submandibular glands. Logistic regression analyses were performed to assess which glands and sUS components contributed significantly to classification as pSS or non-pSS according to the 2016 American College of Rheumatology-European League Against Rheumatism (ACR-EULAR) criteria.
Results 116 (57%) patients were classified as pSS, the remaining as non-pSS. Instead of scoring both sides (area under the curve; AUC=0.856, Nagelkerke R2=0.526), multivariate analysis showed that sUS scoring of only right (AUC=0.850; R2=0.518) or left (AUC=0.852; R2=0.511) parotid and submandibular glands is sufficient to predict ACR-EULAR classification. Moreover, all individual components of the Hocevar scoring system significantly predicted classification. Multivariate analysis showed that parenchymal echogenicity and hypoechogenic areas contributed independently to ACR-EULAR classification (AUC=0.857; R2=0.539). Scoring these components in one parotid and one submandibular gland highly predicted ACR-EULAR classification (AUC=0.855; R2=0.539). Scoring only hypoechogenic areas on one side showed almost similar results (AUC=0.846; R2=0.498).
Conclusion sUS examination of parotid and submandibular glands on one side is sufficient to predict classification of patients according to the ACR-EULAR criteria. To further increase feasibility of sUS in outpatient clinics worldwide, only hypoechogenic areas can be scored.
- sjøgren’s syndrome
- ultrasonography
- autoimmune diseases
Statistics from Altmetric.com
Introduction
Primary Sjögren’s syndrome (pSS) is a chronic, systemic autoimmune disease with inflammation of the exocrine glands being the most common feature, leading to keratoconjunctivitis sicca and a sensation of oral dryness (xerostomia). Extreme fatigue and several extraglandular manifestations are also common.1 2
Ultrasound of major salivary glands (sUS) is an upcoming imaging technique to assess the involvement of major salivary glands in pSS. It is gaining more acceptance in daily clinical practice, as it is non-irradiating, non-invasive and easy to perform.3–6
A meta-analysis has recently identified 33 scoring systems used to evaluate the major salivary glands in patients with SS.3 This heterogeneity within the scoring systems was related to the type of salivary glands examined, the ultrasonographic characteristics evaluated and the cut-off points applied. The scoring system of Hocevar et al 7 is the most extensive and has previously been applied in several studies.6–10 However, this scoring system is time consuming and therefore less feasible in daily clinical practice. In order to become part of the routine in daily clinical practice in the outpatient rheumatologic clinics worldwide, sUS should be as easy, fast and efficient as possible. Previously, more simple scoring systems have been used,11–14 but none of these systems were constructed on the basis of a multivariate regression model.
According to the Outcome Measures in Rheumatology Clinical Trials (OMERACT) filter, three aspects are considered important when it comes to the applicability of a new diagnostic method, that is, truth, discrimination and feasibility.15 Truth questions whether sUS measures what is intended. Discrimination evaluates whether sUS differentiates between situations of interest. Both aspects were previously investigated in our validity and reliability studies.6 10 Feasibility examines whether sUS can be applied easily, given constraints in time, money and interpretability. Since time is precious in daily clinical practice, the feasibility of sUS could be increased by reducing the number of glands and/or the number of components scored. Therefore, the aim of this study was to assess whether ultrasonographic scoring of (i) both parotid and submandibular glands and (ii) all five components of the Hocevar scoring system,7 is needed for classifying patients as pSS.
Materials and methods
Patients
In this cross-sectional study derived from our daily clinical practice, consecutive patients clinically suspected or diagnosed with pSS were included. The largest group consisted of tertiary referrals with clinical suspicion of pSS, referred to the University Medical Center Groningen (UMCG) for evaluation at our multidisciplinary Sjögren’s expertise centre. The remaining part were participants of our REgistry of Sjögren syndrome in Umcg—LongiTudinal (RESULT) cohort, in which consecutive patients with pSS are included. All patients were aged ≥18 years and underwent a full (diagnostic) workup, including an ultrasound of the major salivary glands. Patients with an incomplete workup or an associated systemic autoimmune disease (eg, rheumatoid arthritis, systemic lupus erythematosus) were excluded.
Ultrasonography
The ultrasonographic scanner (Esaote MyLabSeven, Genova, Italy), equipped with a high-resolution linear scanner (4–13 MHz) was used. During the examination, patients were lying in supine position, with their neck slightly extended and head turned towards the opposite side. The scoring system by Hocevar et al 7 was used and the following components were investigated in both parotid and submandibular salivary glands: (i) parenchymal echogenicity compared with thyroid gland or when there was co-incident thyroid gland disease compared with surrounding anatomical structure, graded 0–1; (ii) homogeneity, graded 0–3; (iii) presence of hypoechogenic areas, graded 0–3; (iv) hyperechogenic reflections, graded 0–3 in the parotid glands and 0–1 in the submandibular glands and (v) clearness of salivary gland border, graded 0–3. Ultrasonographic examination was performed by three well-trained ultrasonographers (AJS, KD and JFN). These ultrasonographers previously showed good to excellent interobserver reliability and excellent intraobserver reliability on the total Hocevar score.6
ACR-EULAR criteria
Patients were classified as pSS or non-pSS according to the recently published 2016 American College of Rheumatology-European League Against Rheumatism (ACR-EULAR) classification criteria.16 Routinely, all items of these criteria were assessed, including serology and histopathology. To assess fulfilment of the salivary gland histology item, labial as well as parotid glands biopsy results were considered.17 18
Statistical analysis
Descriptive parameters were expressed as number of patients (%) for categorical data and mean (SD) for continuous data. Mann-Whitney U test was used to evaluate the differences in ultrasound scores between pSS and non-pSS patients according to the ACR-EULAR criteria. Associations between different ultrasound scores were analysed using Spearman’s correlation coefficient (ρ); ρ was interpreted as poor (0.0–0.2), fair (0.2–0.4), moderate (0.4–0.6), good (0.6–0.8) or excellent (0.8–1.0) association.19 Receiver operating characteristic (ROC) analysis was performed to determine the accuracy of sUS to predict classification according to the ACR-EULAR criteria. Area under the curve (AUC) was interpreted as no discrimination (0–0.5), poor (0.5–0.7), fair (0.7–0.8), good (0.8–0.9) or excellent (0.9–1.0) accuracy.20
The Hocevar total score (four glands, five components) was evaluated using univariate logistic regression analysis with ACR-EURLAR classification as dependent variable. Univariate and multivariate logistic regression analyses applying the Forward Wald method were performed for ultrasound total score for (i) all separate glands, (ii) all individual components of the Hocevar scoring system and (iii) combining i and ii.7 The explained variance was evaluated using Nagelkerke R2. IBM SPSS Statistics V.23 (SPSS, Chicago, Illinois, USA) was used to perform the statistical analyses. P values <0.05 were considered to be statistically significant.
Results
Between October 2014 and February 2017, 224 consecutive patients were evaluated in our multidisciplinary Sjögren’s expertise centre. Twenty patients had to be excluded because of incomplete workup (n=14) or presence of an associated systemic autoimmune disease (n=6). Of the 204 included patients, mean age was 52 (13.8) years and 85% were female (table 1). This cohort consisted of tertiary referrals with clinical suspicion of pSS (n=171), and patients previously diagnosed with pSS from our RESULT cohort (n=33). A salivary gland biopsy was available in 184 patients. In the remaining 20 patients, the result of the biopsy would not influence classification as pSS or non-pSS and therefore these patients were not willing to undergo a salivary gland biopsy as part of the routine diagnostic workup. In this cohort, 116 (57%) patients were classified as pSS according to the ACR-EULAR criteria.
Online supplementary figure 1 shows ultrasonographic images of normal submandibular and parotid glands and images of submandibular and parotid glands suggestive of pSS. Ultrasound scores of all four glands differed significantly between pSS and non-pSS patients(online supplementary figure 2).
Supplementary file 1
Scoring only one parotid and one submandibular gland is sufficient
In the total group, there were very high correlations between the ultrasound scores of both parotid glands (ρ=0.909), both submandibular glands (ρ=0.868) and between the combination of left parotid and submandibular glands versus the right parotid and submandibular glands (ρ=0.926; figure 1). In other words, patients who scored high on their left parotid gland also scored high on their right parotid gland and the same is true for the submandibular glands.
Correlations between the ultrasound scores of the left parotid and left submandibular gland (ρ=0.731) as well as the right parotid and right submandibular gland (ρ=0.734) were slightly lower (figure 1). In summary, the ultrasound scores of the same type of salivary glands are interchangeable (ie, right vs left parotid gland; right vs left submandibular gland), but this is not always true for the score of the different types of glands (ie, parotid vs submandibular gland).
The accuracy of sUS to predict ACR-EULAR classification based on the original Hocevar score (four glands, five components) was good (AUC=0.856; figure 2). Accuracy was comparable when only the scores of the right parotid and right submandibular gland (AUC=0.850), or only the scores of the left parotid and left submandibular gland (AUC=0.852) were included (figure 2).
In the univariate analysis, the explained variance (R2)for the total Hocevar score of all four glands was 0.526 (table 2 and table 3). sUS scores of all individual glands significantly predicted ACR-EULAR classification, with R2 ranging from 0.419 for the left parotid gland to 0.473 for the left submandibular gland (table 2B).
In the multivariate analysis, only the sUS scores of the right parotid and left submandibular gland independently predicted ACR-EULAR classification (R2=0.527), due to high correlation between both parotid glands and between both submandibular glands. When only the right parotid and right submandibular gland (R2=0.518) or only the left parotid and left submandibular gland (R2=0.511) were included, this resulted in models with equally high explained variance. In summary, scoring of only one side is enough to predict ACR-EULAR classification. Whether left or right side is scored does not make any difference.
Scoring only parenchymal echogenicity and hypoechogenic areas is sufficient
The distribution of scores for the individual Hocevar components differed significantly between pSS and non-pSS patients(online supplementary figure 3). In univariate analysis, all individual components significantly predicted ACR-EULAR classification. ACR-EULAR classification was predicted best by hypoechogenic areas (R2=0.505). The correlation between the components homogeneity and hypoechogenic areas was very high (ρ=0.927; table 4), suggesting that these two components measure the same and that scoring both is not necessary. The multivariate analysis, including all five components measured in all four glands, showed that only parenchymal echogenicity and hypoechogenic areas independently predicted ACR-EULAR classification (R2=0.539), with a good AUC of 0.857.
Scoring two glands and one or two components increases feasibility of ultrasound in pSS
Taking into account only the right parotid and right submandibular glands, the results of the univariate analysis of the individual components of the Hocevar scoring system were similar compared with the model including all four glands (table 2B). In the multivariate analysis, only parenchymal echogenicity and hypoechogenic areas independently predicted ACR-EULAR classification (R2=0.539), with a good AUC of 0.855 (figure 2). Taking into account only the left parotid and left submandibular glands, similar results were found (table 2C; figure 2).
Thus, sUS examination of parotid and submandibular glands on one side and scoring parenchymal echogenicity and hypoechogenic areas, is sufficient to predict classification of patients according to the ACR-EULAR criteria.
However, this system can be simplified even further by only scoring the hypoechogenic component of the Hocevar system, without decreasing the accuracy. When only scoring the hypoechogenic areas of the right parotid and right submandibular glands, the explained variance was 0.498, with a good AUC of 0.846. The explained variance and accuracy were again similar for the left parotid and left submandibular glands (figure 2).
Discussion
We proved that ultrasound scoring of hypoechogenic areas in one parotid and one submandibular gland is enough to predict ACR-EULAR classification of patients with or without pSS.
Thus far, there is no agreement regarding which ultrasonographic scoring system should be used in the evaluation of major salivary glands of patients suspected or clinically diagnosed with pSS. Based on a recent meta-analysis,3 the Hocevar scoring system7 was incorporated in our daily clinical practice to address all relevant aspects of sUS, since this system appeared to most comprehensive and was therefore also recently applied in our reliability and validity studies.6 10 As mentioned before, more simple scoring systems have been used previously,11–14 but, to the best of our knowledge, none of these simpler scoring systems were constructed applying a multivariate regression model. Ideally, in every rheumatologic outpatient clinic around the world the same scoring system for evaluation of salivary glands in the diagnostic workup of pSS should be applied. In order to reach such a goal, an essential first step was to perform a multivariate logistic regression analysis using the results of the most comprehensive scoring system commonly applied.7 Having access to a reliable, simpler sUS scoring system would greatly increase the feasibility of sUS as a routine procedure to be applied in the diagnostic workup of patients with pSS.
There was an excellent correlation between the ultrasound scores of the right and left parotid glands and the ultrasound scores of the right and left submandibular glands. Correlation between ultrasound scores of parotid and submandibular glands was somewhat lower, in agreement with previous reports.21 These results indicate that scoring of the two parotid glands is interchangeable and the same applies to both submandibular glands, but that ultrasound scores of the parotid compared with the submandibular glands can differ within patients. In some patients, the parotid glands are more affected, whereas in others the submandibular glands are more affected. Therefore, ultrasound abnormalities might be underdiagnosed in a subgroup of patients when only parotid glands or only submandibular glands are taken into account. This is confirmed by the multivariate regression model, in which only one parotid gland and one submandibular gland were found to independently contribute to ACR-EULAR classification as pSS or non-pSS. Our results therefore show that scoring of one side is adequate, but that the parotid as well as the submandibular gland should be scored.
Previously, studies found more pathological changes in the submandibular glands compared with the parotid glands in patients suspected with pSS.22 23 Takagi et al therefore suggested to reduce the sUS examination to only the submandibular glands in order to save time.22 On the other hand, very recently it was suggested to measure only the parotid glands with a simplified scoring system.14 Based on our results, we cannot confirm either suggestion. Furthermore, neither of these studies performed multivariate regression analyses to actually proof their statements.
Another important reason to include sUS examination of the parotid glands, besides that it was included in our model, is that suspicion of the presence of a mucosa-associated lymphoid tissue lymphoma in the parotid gland might rise at an earlier stage. We therefore recommend to compile a global view of all four glands, especially when patients experience unilateral complaints, but to actually score only one side. With this global view, unilateral and bilateral morbidities of the salivary glands can be distinguished, increasing the specificity of the examination.
At group level, all five individual components of the Hocevar scoring system were significant predictors of ACR-EULAR classification. However, the differences in explained variance were quite large, that is, with the lowest value for hyperechogenic reflections and the highest value for hypoechogenic areas, indicating that not every component is equally highly predictive of ACR-EULAR classification. In the multivariate analysis, parenchymal echogenicity and hypoechogenic areas were found to independently predict ACR-EULAR classification. It might even be considered to measure only hypoechogenic areas of one parotid and one submandibular gland, since parenchymal echogenicity added only slightly to the explained variance and diagnostic accuracy. Taking into account the reliability of the individual components of the Hocevar score, reducing the scoring system to only hypoechogenic areas increases the applicability even more. Recently, we showed that the interobserver reliability of hypoechogenic areas was good (kappa >0.7). In contrast, the reliability of parenchymal echogenicity was only fair.6 Two other studies showed similar results even though slightly different scoring systems were applied.24 25 Both articles also contain ultrasonographic images of major salivary glands with different stages of ultrasonographic abnormalities suggestive of pSS.24 25
Previously, it was assumed that the presence of hypoechoic rounded multiple parenchymal lesions is a pathognomonic feature of pSS and that only evident parenchymal inhomogenicity is of true diagnostic value.12 13 We agree with these assumptions. In the Hocevar scoring system, homogeneity and hypoechogenic areas are scored separately. On the other hand, in the scoring system used by Theander and Mandl,12 only parenchymal homogeneity is measured. However, when taking a closer look at their scoring system, in fact a combination of homogeneity and hypoechogenic areas is scored.12 Approximately the same component was measured in a German cohort.13 In our study, the correlation between homogeneity and hypoechogenic areas was excellent. Therefore, it can indeed be interpreted that both components measure the same concept in salivary glands of patients suspected or diagnosed with pSS. When there is a homogenous gland, there are no hypoechogenic areas present and when the gland is very inhomogeneous, this is usually due to the presence of hypoechogenic areas and less due to the presence of hyperechogenic reflections.
Reduction of the examination duration has the advantage that the sUS examination is easier to learn and takes less time to perform in clinical practice. By increasing the feasibility of sUS, it will facilitate outpatient rheumatologic clinics worldwide to integrate the ultrasound in the daily clinical practice when it comes to evaluating patients clinically suspected or diagnosed with pSS. We demonstrated that it is adequate to measure only one out of five components of the Hocevar scoring system, in only one side, without recommending a particular side.
The next step, before incorporation of sUS in the classification criteria is possible, would be to define exactly how to score hypoechogenic areas and decide the optimal cut-off point based on data from multiple centres.
In conclusion, in our cohort study derived from daily clinical practice, we showed that it is sufficient to measure parenchymal echogenicity and hypoechogenic areas in one parotid and one submandibular gland to predict classification as pSS or non-pSS according to the ACR-EULAR criteria. Taking into account the relatively small difference in diagnostic accuracy as well as the reliability of the individual ultrasound components, we pose that only measuring hypoechogenic areas in one parotid and one submandibular gland also is sufficient. As the severity of sUS findings in the parotid and submandibular glands may differ, one parotid and one submandibular gland of either the left or the right side should be scored. Simplification of sUS scoring will increase the feasibility of sUS in daily clinical practice and will facilitate worldwide implementation of the ultrasound in the outpatient rheumatologic clinics.
References
Footnotes
Handling editor Tore K Kvien
Contributors EM, designed the study, collected, analysed and interpreted data, drafted the article and approved the version to be published; SA, designed the study, analysed and interpreted the data, critically revised the article and approved the version to be published; JFvN, KD and AJS, collected data, critically revised the article and approved the version to be published; FGMK and FKLS, critically revised the article and approved the version to be published and AV and HB, designed the study, critically revised the article and approved the version to be published.
Funding The Dutch Arthritis Foundation, research grant for the REgistry of Sjögren syndrome in Umcg—LongiTudinal (RESULT) cohort; METc 2014/491. Bristol-Myers Squibb, unrestricted research grant for the RESULT cohort. Horizon 2020, a research project supported by European Commission: HarmoncSS primary Sjögren’s Syndrome (pSS) towards improved satisfaction, treatment and health policy making (H2020-SC1-2016-RTD, proposal 731944).
Competing interests None declared.
Ethics approval This study was conducted in accordance with the Medical Ethics Committee of the UMCG; waiver 016/120.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement No additional data available.
Presented at Part of this study has been presented at the ACR 2017 in San Diego and the corresponding abstract has been published as a conference abstract.