Discordance between Patient and Physician Assessments of Disease Severity in Systemic Sclerosis

MARIE HUDSON; ANN IMPENS; MURRAY BARON; JAMES R. SEIBOLD; BRETT D. THOMBS; JENNIFER G. WALKER; the Canadian Scleroderma Research Group; RUSSELL STEELE

doi:10.3899/jrheum.100354

Abstract

Objective. To describe the magnitude and correlates of discordance between patient and physician assessments of disease severity in patients with systemic sclerosis (SSc).

Methods. Subjects were patients enrolled in the Canadian Scleroderma Research Group Registry. The outcomes of interest were patient and physician global assessments of disease severity (scales ranging from 0–10). Predictors of disease severity represented the spectrum of disease in SSc (skin involvement, severity of Raynaud’s phenomenon, shortness of breath, gastrointestinal symptoms and pain, number of fingertip ulcers, tender and swollen joints, creatinine, and fatigue). The results of the analysis were validated in an independent sample of patients with SSc from the United States.

Results. Patients perceived greater disease severity than physicians (mean difference 0.78 ± 2.65). The agreement between patient and physician assessments of disease severity was, at best, modest (intraclass correlation 0.3774; weighted κ 0.3771). Although both patients and physicians were influenced by skin scores, breathlessness, and pain, the relative importance of these predictors differed. Patients were also influenced by other subjective symptoms, while physicians were also influenced by disease duration and creatinine. The predictors explained 56% of the deviance in the patient global assessments and 29% in the physician assessments. These findings were confirmed in the US dataset.

Conclusion. Patients and physicians rate SSc disease severity differently in magnitude and are influenced by different factors. Patient-assessed and physician-assessed measures of severity should be considered as complementary and used together in future studies of SSc.

Discordance of assessments between patients and physicians occurs when patients and physicians assign different values to a health trait¹. Discordance between patient and physician assessments of disease activity has been described in several rheumatic diseases, including rheumatoid arthritis (RA)^2,3, systemic lupus erythematosus^1,4,5, and ankylosing spondylitis⁶. In those studies, when rating disease activity, patient assessments were more strongly associated with subjective symptoms, such as pain, psychological well-being, and function, while physician assessments were more strongly associated with objective findings, including laboratory tests. Discordance has the potential to impede patient care; patients may fail to comply with medical instructions if they are poorly informed of their condition or if physicians fail to appreciate the full effect of disease on their patients.

Little is known about the presence and magnitude of possible discordance in the assessment of disease activity in systemic sclerosis (SSc) in part probably because measuring disease activity in SSc is particularly difficult. Unlike systemic lupus erythematosus and RA, SSc is not characterized by episodes of acute inflammation (manifested by synovitis, pleuritis, dermatitis, and nephritis) that can be easily differentiated from quiescent phases. Instead, the clinical features of SSc are attributable to vascular and connective tissue fibrosis that is more difficult to appreciate and quantify than inflammation and, when it becomes measurable, has often progressed to permanent damage. Many patients, especially those with limited skin involvement, have an indolent course without clear signs of inflammation. Further, elevated acute-phase proteins are inconsistently associated with early SSc, leading some to argue that patients with SSc may have an impaired acute-phase response^7,8.

Given the difficulty of measuring disease activity in SSc and of separating it from disease damage, disease severity has been proposed as an appropriate measure of disease status in SSc. Indeed, Medsger defines disease severity in SSc as the total effect of disease on organ function at a given point in time, including both reversible (activity) and irreversible (damage) components⁹ and, given the difficulties in defining disease activity, this is likely to be a better measure of disease status and possible discordance in SSc.

Thus, we undertook this study to (1) identify the extent to which patient and physician assessments of disease severity differed, and (2) identify and compare the predictors of patient and physician assessments of disease severity in patients with SSc.

MATERIALS AND METHODS

Design

We performed a cross-sectional study of a Canadian sample of patients with SSc and confirmed the results using a sample of patients with SSc from the United States.

Study subjects

The Canadian subjects were patients enrolled in the Canadian Scleroderma Research Group (CSRG) Registry. Patients in this registry are recruited from the practices of rheumatologists across Canada. They must have a diagnosis of SSc made by the referring rheumatologist, be > 18 years of age, be fluent in English or French, and likely to be compliant with study procedures and visits. The patients included were those whose baseline visit was between September 2004 and 2008. The US patients were recruited from the University of Michigan Scleroderma Program between December 2005 and April 2006. A total of 105 sequential ambulatory patients with SSc were recruited and consented to participate in a study on hand functioning. Four subjects did not complete the study.

Outcome measures

The patient and physician global assessments of disease severity in the Canadian patients were done using numerical rating scales (NRS) ranging from 0–10. The NRS scale is simple to complete and score and has been shown to be as reliable and responsive as visual analog scales (VAS) to measure disease activity and function in ankylosing spondylitis¹⁰ and more reliable to assess pain in patients with RA¹¹. Physicians were asked to “rate the patient’s overall health for the past week” and the NRS was anchored by the descriptors “no disease” and “very severe disease.” Patients were asked to “rate your disease in the past week” and the NRS was anchored by “no disease” and “very severe limitation.” The patient and physician global assessments of disease severity in the US patients were assessed using a VAS ranging from 0–100 mm, anchored by the descriptors “no severity” and “extremely severe.” The scores on the VAS of 0–100 were divided by 10 to be comparable to the NRS ratings ranging from 0–10. Although the wording of the anchors on the global assessments differed slightly, the scores ranging from 0–10 were assumed to be equivalent.

Predictor variables

Potential predictors of disease severity were chosen to represent the spectrum of disease in SSc, and included severity of Raynaud’s phenomenon (RP), skin involvement, fingertip ulcers, shortness of breath, joint symptoms, gastrointestinal (GI) symptoms, kidney involvement, pain, and fatigue. In both samples, the methods for data collection were similar. The extent of skin involvement was recorded using the modified Rodnan skin score. Similarly, the number of fingertip ulcers and a simplified 28 swollen and tender joint count¹² were recorded by physical examination by a well-trained health professional using standardized definitions. Creatinine was documented by laboratory testing.

Data on RP, GI symptoms, shortness of breath, and pain were assessed using a self-report measure, the Scleroderma-Health Assessment Questionnaire (S-HAQ)¹³. The S-HAQ consists of the Disability Index of the HAQ (HAQ-DI) and items to measure symptoms specific for SSc using VAS scales. The HAQ-DI is a self-administered measure intended to assess functional ability in arthritis¹⁴. The disease-specific questions in the S-HAQ relate to the severity of various symptoms, including RP, GI symptoms, shortness of breath, and pain in the past week. Each item is anchored by the adjectives “does not interfere” and “very severe limitation” and scored separately. The Canadian patients answered the disease-specific questions on the S-HAQ using an 11-point NRS, while a 0–100 mm VAS was used by the US patients.

Finally, fatigue was measured using the Vitality subscale of the Medical Outcomes Study Short Form-36 (SF-36) questionnaire^15,16. The SF-36 Vitality subscale includes 4 Likert items with 5 response options each (all of the time to none of the time) that assess patients’ level of fatigue during the previous 4 weeks. Scores are normalized with a mean of 50 and SD of 10. Scores below 50 represent worse fatigue and above 50, less. The SF-36 Vitality subscale has been used to measure fatigue in general population samples and in patients with medical illness and injury. A recent systematic review concluded that the SF-36 Vitality subscale has good evidence for validity, reliability, sensitivity to change, and feasibility in RA¹⁷.

Statistical analysis

The initial analyses were done using the Canadian data. The standard measure of agreement for quantitative measures is the intra-class correlation coefficient (ICC) and for ordered categorical variables, the weighted κ statistic. Using the disease severity scores ranging from 0–10 in turn as continuous or ordinal variables, we calculated the ICC and the weighted κ statistic. We also fit a linear mixed model that isolated heterogeneity due to the physicians from overall disagreement to determine whether physician heterogeneity was responsible for disagreement between patient and physician assessments.

We undertook subsequent analyses to identify the predictors of patient and physician global assessments of disease severity, using generalized linear models (in particular, normal, Poisson, and negative binomial regression models). We fit 3 separate sets of models for each of the patient and physician global assessments of disease severity. The first set included all the selected covariates of severity. The second set included only the physician-recorded correlates. The third set included only the patient-recorded correlates of severity. In all regression models, we adjusted for demographic variables (age, gender, ethnicity, education) as well as disease duration. In multivariate analyses using generalized linear models, we found that a negative binomial regression model fit the data well for 4 of the 6 regression models. We observed underdispersion rather than overdispersion in the 2 other models (both of which used the patient-reported variables), so the results between the negative binomial and Poisson models yielded very similar results. Model fit was assessed using percentage deviance explained, which is analogous to R² in standard linear regression models. Finally, because we identified differences in predictors of patient and physician global assessments of disease severity, we undertook a regression analysis to identify the predictors of the differences. We used normal linear regression to predict the difference between patient and physician severity scores, as there was no reason (using either model selection criteria or diagnostics) that suggested a normal assumption was inappropriate.

Lastly, we sought to confirm our findings by running the results of our models in the US data. We used the estimated regression coefficients from the Canadian data to calculate predicted physician and patient severity assessments for the US data and estimated the association between the predicted assessments and the observed assessments using simple linear regression.

At the time of analysis, the CSRG had 936 patients entered in its registry, of which 742 had complete data for the variables of interest in this study. The US sample had 101 subjects, of whom 61 had complete data. Data between patients included and excluded from the analyses were compared and there were no systematic differences. Therefore, only patients with complete data were included in the analyses. All statistical analyses were performed with SPSS v. 13 and the R statistical package¹⁸.

Ethical considerations

Each patient provided informed written consent to participate in the data collection process and ethics committee approval for our study was obtained at each site.

RESULTS

There were 803 patients included in this study, of which 742 were Canadian and 61 from the United States (Table 1). In the Canadian sample, 87% were women, mean age was 55.5 (± 12.4) years, and mean disease duration since the onset of the first non-RP disease manifestation was 10.7 (± 9.0) years. In the US sample, 86% were women, mean age was 51.4 (± 11.4) years, and mean disease duration since the onset of the first non-RP disease manifestation was 7.5 (± 8.4) years. On a scale ranging from 0 to 10, with 0 being the lowest and 10 being the greatest disease severity, the mean patient and physician global assessments of disease severity were 3.63 (± 2.54) and 2.85 (± 2.27), respectively, in the Canadian sample and 4.25 (± 2.59) and 2.04 (± 1.78), respectively, in the US sample. The mean difference between patient and physician assessment was 0.78 (± 2.65) in the Canadian sample and 2.21 (± 2.65) in the US sample. The positive values suggest that, on average, patients perceived greater disease severity than physicians. Of note, the difference in patient and physician ratings of disease severity in diffuse patients was 0.53 (95% CI 0.23, 0.84), and in limited patients, 0.92 (95% CI 0.66, 1.17). This was not statistically significant.

View this table:

Table 1.

Baseline characteristics of study subjects. Fatigue was measured using the Medical Outcomes Study Short Form-36 questionnaire vitality subscale. Scores are normalized with a mean of 50 and standard deviation of 10. Scores below 50 represent worse fatigue and above 50, more vitality. Values are mean (SD) unless otherwise indicated.

Agreement between patient and physician global assessments of disease severity in the Canadian data

Using the disease severity scores ranging from 0–10 either as continuous or ordinal variables, we observed very similar ICC and weighted κ statistics (0.3774 and 0.3771, respectively). The values for these statistics indicate at best only fair agreement between patient and physician assessments of disease severity. We observed a slight difference in the extent of agreement in the 2 disease subsets [ICC of 0.29 (95% CI 0.19, 0.39) in the limited subset and ICC of 0.41 (95% CI 0.31, 0.50) in the diffuse subset], although this was not statistically significant.

A linear mixed model was used to assess the extent to which interphysician variability was responsible for the lack of agreement between the patient and physician severity scores. We did observe statistically significant variability between physicians in their assessments [Bayesian Information Criterion (BIC) of 3202 for a model that accounted for physician heterogeneity vs 3215 for a model that did not]. A difference of 6–10 in the value of the BIC indicates strong evidence against the null hypothesis and a difference of more than 10 indicates very strong evidence¹⁹. Thus, a difference of 13 suggests very strong evidence against the model, assuming no between-physician heterogeneity in assessments of disease severity. Nevertheless, only about 5% of the overall variability in patient severity scores could be explained by the differences among the physicians themselves.

Thus, based on these analyses, we concluded that agreement between patient and physician assessments of disease severity was, at best, modest. Interphysician variability in assessments accounted for only a small part of the differences in assessments.

Predictors of patient and physician global assessments of disease severity in the Canadian sample

We identified similarities and differences in the predictors of patient and physician global assessments of disease severity (Table 2). The OR reported in Table 2 represent the relative increase in the response (i.e., the patient or physician assessments of severity) for a 1-unit increase in the covariate of interest (e.g., skin score, shortness of breath, etc.). Thus, although skin scores, shortness of breath, and pain were significant predictors of both patient and physician global assessments of disease severity when all covariates were included in the models, their relative effects on physician and patient assessments differed. Thus, an increase of 1 unit in skin score was associated with about a 3% increase in the physician assessment of severity, controlling for all other variables (i.e., about a 15% increase for a 5-unit increase in skin score). In contrast, we estimated only a corresponding 0.9% increase in patient severity assessment for a 1-unit increase in skin score (again controlling for all other variables) or a 4.5% increase in mean patient assessment for a 5-unit increase in skin score. The OR estimates for shortness of breath were fairly similar in the models predicting patient (1.062) and physician (1.094) assessments of severity separately. However, pain had a larger effect in the model predicting patient-assessed severity (1.121), compared to its effect in the model predicting physician-assessed severity (1.032).

View this table:

Table 2.

Negative binomial regression results to identify predictors of the physician (MD) and patient (Pt) global assessments of disease severity in the Canadian data. This table contains the estimated OR with 95% CI for the 6 different models. Values in bold type indicate CI that do not overlap with 0. Note that creatinine was transformed by taking the square root in order to improve model assumptions and decrease the influence of outlying points. Results are given as square root of creatinine. The coefficient < 1 for fatigue reflects the fact that for the measurement of fatigue, lower scores represent worse fatigue.

In addition, significant predictors of patient assessments included severity of RP, GI symptoms, and fatigue. The coefficient < 1 for fatigue reflects the fact that for the measurement of fatigue, lower scores represent worse fatigue, while for the global assessment, lower scores represent less-severe disease. In turn, other significant predictors of physician assessments included disease duration, with early disease being considered worse, and creatinine. The regression models using all patient-reported and clinical covariates explained 56% of the deviance in the patient global assessments and 29% in the physician assessments, respectively. As expected, the patient-reported variables by themselves explained much more deviance in the patient assessment than the physician assessment (54% vs 14%) and the clinical variables by themselves explained more deviance in the physician assessment than the patient assessment (18% vs 5%). We also noted (but do not show) a significant interaction between disease duration and skin score in the models for the physician assessments (p < 0.001) that indicated that the amount by which the physician score would increase for high skin scores would be smaller for patients with longer disease duration.

Finally, given that we found differences in the predictors of patient-assessed and physician-assessed severity, we regressed the difference between the patient and physician assessments to determine what was most associated with the discordance between them (Table 3). Pain, GI symptoms, RP, and fatigue were associated with significantly higher values for the difference (i.e., contributed more to the patient assessment than the physician assessment). Increased skin score and creatinine were associated with significantly lower values for the difference (i.e., contributed more to the physician assessment than the patient assessment). Further, we again found a significant interaction between skin score and duration in this model that suggested that the longer the disease duration, the less an increased skin score would be associated with the difference (data not shown).

View this table:

Table 3.

Linear regression results to identify the predictors of the difference between patient and physician global assessments of severity in the Canadian data. Values in bold indicate CI that do not include 0. Note that creatinine was transformed by taking the square root in order to improve model assumptions and decrease the influence of outlying points on the results. Results are given in terms of the square root of creatinine.

Confirmation of the models in the US sample

To confirm our findings, we used the regression coefficients obtained from the Canadian data to predict physician assessments of severity, patient assessments of severity, and the difference between patient and physician assessments in the US patients. In these analyses, we allowed for the US and Canadian data to have different overall means, so as to examine the relationship of severity with the covariates, rather than the overall population mean. We found that the regression coefficients derived from the Canadian data explained 15.7% of the variability in the physician global assessments in the US data. This can be compared to an estimated prediction R² of 25.1% in the Canadian data. Similarly, regression coefficients from the Canadian data explained 43.4% of the variability in the patient assessment scores in the US data, compared to a prediction R² of 54.8% on the Canadian patient assessments. Finally, the Canadian model for the differences in assessments explained 22.3% of the variability in the difference in assessments in the US data, compared to a prediction R² of 33.3% for the Canadian data. Thus, prediction in the US data using the Canadian models was reasonably good.

We also investigated whether individual variables had a different relationship with disease severity in the Canadian and US samples. We found no strong evidence that the relationship between any of the covariates and the patient or physician assessments depended on the sample (data not shown).

DISCUSSION

We found some similarities but also important differences in how patients and physicians rate disease severity in SSc. On average, patients rated disease severity as worse than physicians did. Patient and physician severity ratings were associated with both physician-rated skin scores and patient-reported shortness of breath and pain in their assessments of severity, although skin was more strongly associated for physicians than patients and pain was a more robust correlate for patients than physicians. Patient severity assessments were also significantly influenced by self-reported estimates of the severity of RP, GI symptoms, and fatigue, while physician global severity ratings were influenced by disease duration and creatinine.

Our report demonstrated that, using global assessments, patients and physicians rate disease severity differently in magnitude and are influenced by different factors. The implications of our findings are 2-fold. First, our findings suggest that traditional biomedical assessments of disease status in SSc (e.g., physician assessments of skin involvement or laboratory tests such as creatinine) may be supplemented by patient-derived information. In other words, patient-reported severity allows for more aspects of the disease to be captured than physician-reported assessments. In fact, it is striking that the predictors of importance for patients but not physicians were indeed in relation to symptoms for which good outcome measures in SSc are currently lacking (in particular GI symptoms and fatigue) or where patient reports are the only means of obtaining the information (in particular RP).

Second, in the absence of a gold standard to measure disease severity in SSc, both patient and physician global assessments of disease severity could be used together, to better approximate “true” disease severity. Indeed, in a study of RP in patients with SSc, both physician and patient assessments of RP activity were found to be valid and reliable and the authors recommended that both be included in the core set of measures for use in future clinical trials in this area²⁰. Similarly, although definitive validation of patient and physician global assessments of disease severity in SSc has yet to be done, our data suggest that the 2 measures may provide complementary data and both should be considered as outcome measures in this highly heterogeneous disease.

There are limitations that should be considered in interpreting the results of our study. First, patients in the CSRG registry are a convenience sample of patients with SSc. Their median disease duration since the onset of non-RP symptoms was 10 years, suggesting a sample of patients with generally stable disease. Moreover, patients with very severe SSc who were too sick to participate or who died earlier in their disease course were not included. This may have resulted in an overrepresentation of healthier patients in our SSc sample (survival cohort), and results may therefore not be generalizable to the full spectrum of SSc. Despite these limitations, the demographic and clinical characteristics of the CSRG Registry patients in this study were consistent with other outpatient SSc samples that have been reported in the research literature²¹.

Second, it is possible that the strong association between patient-assessed severity and symptoms (e.g., pain, fatigue, severity of RP) occurred because both outcome and predictors were self-reported and the relationship reflects, to some degree, characteristics of the patient that influence how distress is reported on self-report questionnaires²². As a result, the relationships between outcome and predictors may be overstated in the models for patient-assessed severity reported in our study. On the other hand, there are currently no good substitutes for patient-reported symptoms such as pain, fatigue, and severity of RP, and this limitation is thus largely inevitable.

Finally, both samples of patients were composed of predominantly white, female patients with SSc. Consequently, this limits the generalizability of our results as far as patients with SSc from other ethnic groups or men are concerned.

The strength of our study lies in its large, multicenter sample of Canadian patients and validation of the results in an independent sample of patients with SSc.

We showed that patients and physicians rate SSc disease severity differently in magnitude and are influenced by different factors. Thus, patient-assessed and physician-assessed measures of severity should be considered as complementary and should be used together in future studies of SSc.

APPENDIX

Investigators of the Canadian Scleroderma Research Group: M. Baron, Montreal, Quebec; J. Pope, London, Ontario; J. Markland, Saskatoon, Saskatchewan; D. Robinson, Winnipeg, Manitoba; N. Jones, Edmonton, Alberta; N. Khalidi, Hamilton, Ontario; P. Docherty, Moncton, New Brunswick; E. Kaminska, Hamilton, Ontario; A. Masetto, Sherbrooke, Quebec; D. Smith, Ottawa, Ontario; E. Sutton, Halifax, Nova Scotia; J-P. Mathieu, Montreal, Quebec; M. Hudson, Montreal, Quebec; S. Ligier, Montreal, Quebec; T. Grodzicky, Montreal, Quebec; S. Mittoo, Winnipeg, Manitoba; M. Fritzler, Advanced Diagnostics Laboratory, Calgary, Alberta.

Footnotes

Supported in part by the Canadian Institutes of Health Research, the Scleroderma Society of Canada, the Cure Scleroderma Foundation, and educational grants from Actelion Pharmaceuticals and Pfizer Inc. Dr. Hudson is funded by a New Investigator Award from the Canadian Institutes of Health Research. Additional funding was provided by the Jonathan and Lisa Rye Scleroderma Research Fund and the Marvin and Betty Danto Research Fund at the University of Michigan.

Accepted for publication July 16, 2010.

REFERENCES

1.↵
1. Yen JC,
2. Neville C,
3. Fortin PR
. Discordance between patients and their physicians in the assessment of lupus disease activity: relevance for clinical trials. Lupus 1999;8:660–70.
OpenUrl Abstract/FREE Full Text
2.↵
1. Hanly JG,
2. Mosher D,
3. Sutton E,
4. Weerasinghe S,
5. Theriault D
. Self-assessment of disease activity by patients with rheumatoid arthritis. J Rheumatol 1996;23:1531–8.
OpenUrl PubMed
3.↵
1. Nicolau G,
2. Yogui MM,
3. Vallochi TL,
4. Gianini RJ,
5. Laurindo IM,
6. Novaes GS
. Sources of discrepancy in patient and physician global assessments of rheumatoid arthritis disease activity. J Rheumatol 2004;31:1293–6.
OpenUrl Abstract/FREE Full Text
4.↵
1. Neville C,
2. Clarke AE,
3. Joseph L,
4. Belisle P,
5. Ferland D,
6. Fortin PR
. Learning from discordance in patient and physician global assessments of systemic lupus erythematosus disease activity. J Rheumatol 2000;27:675–9.
OpenUrl PubMed
5.↵
1. Alarcon GS,
2. McGwin G Jr.,
3. Brooks K,
4. Roseman JM,
5. Fessler BJ,
6. Sanchez ML,
7. et al.
Systemic lupus erythematosus in three ethnic groups. XI. Sources of discrepancy in perception of disease activity: a comparison of physician and patient visual analog scale scores. Arthritis Rheum 2002;47:408–13.
OpenUrl CrossRef PubMed
6.↵
1. Spoorenberg A,
2. van Tubergen A,
3. Landewe R,
4. Dougados M,
5. van der Linden S,
6. Mielants H,
7. et al.
Measuring disease activity in ankylosing spondylitis: patient and physician have different perspectives. Rheumatology 2005;44:789–95.
OpenUrl Abstract/FREE Full Text
7.↵
1. Kucharz EJ,
2. Grucka-Mamczar E,
3. Mamczar A,
4. Brzezinska-Wcislo L
. Acute-phase proteins in patients with systemic sclerosis. Clin Rheumatol 2000;19:165–6.
OpenUrl CrossRef PubMed
8.↵
1. Medsger TA Jr.
Assessment of damage and activity in systemic sclerosis. Curr Opin Rheumatol 2000;12:545–8.
OpenUrl CrossRef PubMed
9.↵
1. Medsger TA Jr.,
2. Silman AJ,
3. Steen VD,
4. Black CM,
5. Akesson A,
6. Bacon PA,
7. et al.
A disease severity scale for systemic sclerosis: development and testing. J Rheumatol 1999;26:2159–67.
OpenUrl PubMed
10.↵
1. Van Tubergen A,
2. Debats I,
3. Ryser L,
4. Londoño J,
5. Burgos-Vargas R,
6. Cardiel MH,
7. et al.
Use of a numerical rating scale as an answer modality in ankylosing spondylitis-specific questionnaires. Arthritis Rheum 2002;47:242–8.
OpenUrl CrossRef PubMed
11.↵
1. Ferraz MB,
2. Quaresma MR,
3. Aquino LR,
4. Atra E,
5. Tugwell P,
6. Goldsmith CH
. Reliability of pain scales in the assessment of literate and illiterate patients with rheumatoid arthritis. J Rheumatol 1990;17:1022–4.
OpenUrl PubMed
12.↵
1. van Gestel A,
2. Haagsma C,
3. van Riel P
. Validation of rheumatoid arthritis improvement criteria that include simplified joint counts. Arthritis Rheum 1998;41:1845–50.
OpenUrl CrossRef PubMed
13.↵
1. Steen VD,
2. Medsger TA Jr.
The value of the Health Assessment Questionnaire and special patient-generated scales to demonstrate change in systemic sclerosis patients over time. Arthritis Rheum 1997;40:1984–91.
OpenUrl PubMed
14.↵
1. Fries JF,
2. Spitz P,
3. Kraines RG,
4. Holman HR
. Measurement of patient outcome in arthritis. Arthritis Rheum 1980;23:137–45.
OpenUrl PubMed
15.↵
1. Ware JE Jr.,
2. Sherbourne CD
. The MOS 36 item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care 1992;30:473–83.
OpenUrl PubMed
16.↵
1. Ware J,
2. Kosinski M,
3. Bjorner J,
4. Turner-Bowker D,
5. Gandek B,
6. Maruish M
. User’s manual for the SF-36v2 health survey. 2nd ed. Lincoln, RI, USA: QualityMetric Inc.; 2007.
17.↵
1. Hewlett S,
2. Hehir M,
3. Kirwan JR
. Measuring fatigue in rheumatoid arthritis: a systematic review of scales in use. Arthritis Rheum 2007;57:429–39.
OpenUrl CrossRef PubMed
18.↵
R Development Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2008.
19.↵
1. Kass RE,
2. Raftery AE
. Bayes factors. J Am Stat Assoc 1995; 90:773–95.
OpenUrl CrossRef
20.↵
1. Merkel PA,
2. Herlyn K,
3. Martin RW,
4. Anderson JJ,
5. Mayes MD,
6. Bell P,
7. et al.
Measuring disease activity and functional status in patients with scleroderma and Raynaud’s phenomenon. Arthritis Rheum 2002;46:2410–20.
OpenUrl CrossRef PubMed
21.↵
1. Chifflot H,
2. Fautrel B,
3. Sordet C,
4. Chatelus E,
5. Sibilia J
. Incidence and prevalence of systemic sclerosis: a systematic literature review. Semin Arthritis Rheum 2008;37:223–35.
OpenUrl CrossRef PubMed
22.↵
1. Meehl P
. Why summaries of research on psychological theories are often uninterpretable. Psychol Rep 1990;66:195–244.
OpenUrl CrossRef

In this issue

Download PDF

Bookmark this article

Cited By...

More in this TOC Section

Show more Articles

[1] 1.↵
Yen JC,
Neville C,
Fortin PR
. Discordance between patients and their physicians in the assessment of lupus disease activity: relevance for clinical trials. Lupus 1999;8:660–70.
OpenUrl Abstract/FREE Full Text

[2] Yen JC,

[3] Neville C,

[4] Fortin PR

[5] 2.↵
Hanly JG,
Mosher D,
Sutton E,
Weerasinghe S,
Theriault D
. Self-assessment of disease activity by patients with rheumatoid arthritis. J Rheumatol 1996;23:1531–8.
OpenUrl PubMed

[6] Hanly JG,

[7] Mosher D,

[8] Sutton E,

[9] Weerasinghe S,

[10] Theriault D

[11] 3.↵
Nicolau G,
Yogui MM,
Vallochi TL,
Gianini RJ,
Laurindo IM,
Novaes GS
. Sources of discrepancy in patient and physician global assessments of rheumatoid arthritis disease activity. J Rheumatol 2004;31:1293–6.
OpenUrl Abstract/FREE Full Text

[12] Nicolau G,

[13] Yogui MM,

[14] Vallochi TL,

[15] Gianini RJ,

[16] Laurindo IM,

[17] Novaes GS

[18] 4.↵
Neville C,
Clarke AE,
Joseph L,
Belisle P,
Ferland D,
Fortin PR
. Learning from discordance in patient and physician global assessments of systemic lupus erythematosus disease activity. J Rheumatol 2000;27:675–9.
OpenUrl PubMed

[19] Neville C,

[20] Clarke AE,

[21] Joseph L,

[22] Belisle P,

[23] Ferland D,

[24] Fortin PR

[25] 5.↵
Alarcon GS,
McGwin G Jr.,
Brooks K,
Roseman JM,
Fessler BJ,
Sanchez ML,
et al.
Systemic lupus erythematosus in three ethnic groups. XI. Sources of discrepancy in perception of disease activity: a comparison of physician and patient visual analog scale scores. Arthritis Rheum 2002;47:408–13.
OpenUrl CrossRef PubMed

[26] Alarcon GS,

[27] McGwin G Jr.,

[28] Brooks K,

[29] Roseman JM,

[30] Fessler BJ,

[31] Sanchez ML,

[32] et al.

[33] 6.↵
Spoorenberg A,
van Tubergen A,
Landewe R,
Dougados M,
van der Linden S,
Mielants H,
et al.
Measuring disease activity in ankylosing spondylitis: patient and physician have different perspectives. Rheumatology 2005;44:789–95.
OpenUrl Abstract/FREE Full Text

[34] Spoorenberg A,

[35] van Tubergen A,

[36] Landewe R,

[37] Dougados M,

[38] van der Linden S,

[39] Mielants H,

[40] et al.

[41] 7.↵
Kucharz EJ,
Grucka-Mamczar E,
Mamczar A,
Brzezinska-Wcislo L
. Acute-phase proteins in patients with systemic sclerosis. Clin Rheumatol 2000;19:165–6.
OpenUrl CrossRef PubMed

[42] Kucharz EJ,

[43] Grucka-Mamczar E,

[44] Mamczar A,

[45] Brzezinska-Wcislo L

[46] 8.↵
Medsger TA Jr.
Assessment of damage and activity in systemic sclerosis. Curr Opin Rheumatol 2000;12:545–8.
OpenUrl CrossRef PubMed

[47] Medsger TA Jr.

[48] 9.↵
Medsger TA Jr.,
Silman AJ,
Steen VD,
Black CM,
Akesson A,
Bacon PA,
et al.
A disease severity scale for systemic sclerosis: development and testing. J Rheumatol 1999;26:2159–67.
OpenUrl PubMed

[49] Medsger TA Jr.,

[50] Silman AJ,

[51] Steen VD,

[52] Black CM,

[53] Akesson A,

[54] Bacon PA,

[55] et al.

[56] 10.↵
Van Tubergen A,
Debats I,
Ryser L,
Londoño J,
Burgos-Vargas R,
Cardiel MH,
et al.
Use of a numerical rating scale as an answer modality in ankylosing spondylitis-specific questionnaires. Arthritis Rheum 2002;47:242–8.
OpenUrl CrossRef PubMed

[57] Van Tubergen A,

[58] Debats I,

[59] Ryser L,

[60] Londoño J,

[61] Burgos-Vargas R,

[62] Cardiel MH,

[63] et al.

[64] 11.↵
Ferraz MB,
Quaresma MR,
Aquino LR,
Atra E,
Tugwell P,
Goldsmith CH
. Reliability of pain scales in the assessment of literate and illiterate patients with rheumatoid arthritis. J Rheumatol 1990;17:1022–4.
OpenUrl PubMed

[65] Ferraz MB,

[66] Quaresma MR,

[67] Aquino LR,

[68] Atra E,

[69] Tugwell P,

[70] Goldsmith CH

[71] 12.↵
van Gestel A,
Haagsma C,
van Riel P
. Validation of rheumatoid arthritis improvement criteria that include simplified joint counts. Arthritis Rheum 1998;41:1845–50.
OpenUrl CrossRef PubMed

[72] van Gestel A,

[73] Haagsma C,

[74] van Riel P

[75] 13.↵
Steen VD,
Medsger TA Jr.
The value of the Health Assessment Questionnaire and special patient-generated scales to demonstrate change in systemic sclerosis patients over time. Arthritis Rheum 1997;40:1984–91.
OpenUrl PubMed

[76] Steen VD,

[77] Medsger TA Jr.

[78] 14.↵
Fries JF,
Spitz P,
Kraines RG,
Holman HR
. Measurement of patient outcome in arthritis. Arthritis Rheum 1980;23:137–45.
OpenUrl PubMed

[79] Fries JF,

[80] Spitz P,

[81] Kraines RG,

[82] Holman HR

[83] 15.↵
Ware JE Jr.,
Sherbourne CD
. The MOS 36 item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care 1992;30:473–83.
OpenUrl PubMed

[84] Ware JE Jr.,

[85] Sherbourne CD

[86] 16.↵
Ware J,
Kosinski M,
Bjorner J,
Turner-Bowker D,
Gandek B,
Maruish M
. User’s manual for the SF-36v2 health survey. 2nd ed. Lincoln, RI, USA: QualityMetric Inc.; 2007.

[87] Ware J,

[88] Kosinski M,

[89] Bjorner J,

[90] Turner-Bowker D,

[91] Gandek B,

[92] Maruish M

[93] 17.↵
Hewlett S,
Hehir M,
Kirwan JR
. Measuring fatigue in rheumatoid arthritis: a systematic review of scales in use. Arthritis Rheum 2007;57:429–39.
OpenUrl CrossRef PubMed

[94] Hewlett S,

[95] Hehir M,

[96] Kirwan JR

[97] 18.↵
R Development Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2008.

[98] 19.↵
Kass RE,
Raftery AE
. Bayes factors. J Am Stat Assoc 1995; 90:773–95.
OpenUrl CrossRef

[99] Kass RE,

[100] Raftery AE

[101] 20.↵
Merkel PA,
Herlyn K,
Martin RW,
Anderson JJ,
Mayes MD,
Bell P,
et al.
Measuring disease activity and functional status in patients with scleroderma and Raynaud’s phenomenon. Arthritis Rheum 2002;46:2410–20.
OpenUrl CrossRef PubMed

[102] Merkel PA,

[103] Herlyn K,

[104] Martin RW,

[105] Anderson JJ,

[106] Mayes MD,

[107] Bell P,

[108] et al.

[109] 21.↵
Chifflot H,
Fautrel B,
Sordet C,
Chatelus E,
Sibilia J
. Incidence and prevalence of systemic sclerosis: a systematic literature review. Semin Arthritis Rheum 2008;37:223–35.
OpenUrl CrossRef PubMed

[110] Chifflot H,

[111] Fautrel B,

[112] Sordet C,

[113] Chatelus E,

[114] Sibilia J

[115] 22.↵
Meehl P
. Why summaries of research on psychological theories are often uninterpretable. Psychol Rep 1990;66:195–244.
OpenUrl CrossRef

[116] Meehl P

Main menu

User menu

Search

Discordance between Patient and Physician Assessments of Disease Severity in Systemic Sclerosis

Abstract