Three methods to construct predictive models using logistic regression and likelihood ratios to facilitate adjustment for pretest probability give similar results

doi:10.1016/j.jclinepi.2007.02.012

Journal of Clinical Epidemiology

Volume 61, Issue 1, January 2008, Pages 52-63

https://doi.org/10.1016/j.jclinepi.2007.02.012 Get rights and content

Abstract

Objective

To compare three predictive models based on logistic regression to estimate adjusted likelihood ratios allowing for interdependency between diagnostic variables (tests).

Study Design and Setting

This study was a review of the theoretical basis, assumptions, and limitations of published models; and a statistical extension of methods and application to a case study of the diagnosis of obstructive airways disease based on history and clinical examination.

Results

Albert's method includes an offset term to estimate an adjusted likelihood ratio for combinations of tests. Spiegelhalter and Knill-Jones method uses the unadjusted likelihood ratio for each test as a predictor and computes shrinkage factors to allow for interdependence. Knottnerus' method differs from the other methods because it requires sequencing of tests, which limits its application to situations where there are few tests and substantial data. Although parameter estimates differed between the models, predicted “posttest” probabilities were generally similar.

Conclusion

Construction of predictive models using logistic regression is preferred to the independence Bayes' approach when it is important to adjust for dependency of tests errors. Methods to estimate adjusted likelihood ratios from predictive models should be considered in preference to a standard logistic regression model to facilitate ease of interpretation and application. Albert's method provides the most straightforward approach.

Section snippets

Background

Evaluation of the diagnostic value of clinical history, examination, and subsequent tests relies on the ability to combine multiple items of diagnostic information. It is through combining several items that good predictive accuracy is achieved. Application to individual patients requires tailoring results according to pretest probabilities of disease [1].

Studies that evaluate combinations of items usually create a diagnostic rule, scoring system or predictive model, which can be used to rate

Illustrative example

The Clinical Assessment of the Reliability of the Examination-Chronic Obstructive Airway Disease (CARE-COAD) study group designed a series of multinational studies to obtain reliable information on the accuracy of the history and physical examination in diagnosing obstructive airways disease (OAD) [12]. The CARE-COAD1 study [13] recruited 309 consecutive patients and noted four items from the history (age, sex, chronic OAD history, smoking history) and two from the clinical examination (wheeze,

Methods

For illustration, we demonstrate the models using only two binary tests of OAD history and age group (Table 2). We first apply the independence Bayes' approach, and then the conventional and alternative logistic regression approaches.

Results for full illustrative example

Parameter estimates for analyses of the COAD1 data sets including all four tests are shown in Table 3, for the independence Bayes, conventional and Albert's logistic regression models, and the SKJ approach. Likelihood ratios are only estimable from the independence Bayes and SKJ models, and are converted into odds ratios (by computing ratios of likelihood ratios) solely for comparison with the logistic regression models.

Likelihood ratios adjusted for dependence using the SKJ approach are all

Discussion

Predictive models are frequently published in the medical literature, both for diagnostic and prognostic applications. Although some models are constructed using Bayesian reasoning, logistic regression is frequently used to take account of dependence between tests. Logistic regression estimates a log odds ratio for each test, simultaneously taking account of other tests included in the model [19]. Although the log odds ratio provides a measure of test performance, it is difficult for clinicians

Acknowledgments

We are grateful to Sharon Straus for providing the CARE-COAD data sets. The work was supported by the National Health and Medical Research Council (NHMRC) grants Grants No. 211205 and No. 402764 to the Screening and Test Evaluation Program. Jon Deeks is supported by a UK Department of Health NCCRCD Senior Research Scientist in Evidence Synthesis award. This work was undertaken as a Master's thesis by the first author.

References (22)

F.A. McAlister et al.
Why we need large, simple studies of the clinical examination: the problem and a proposed solution
Lancet
(1999)
L. Irwig
Modelling result-specific likelihood ratios
J Clin Epidemiol
(1992)
J. Hilden
Statistical diagnosis based on conditional independence does not require it
Comput Biol Med
(1984)
D.G. Fryback
Bayes' Theorem and conditional nonindependence of data in medical diagnosis
Comput Biomed Res
(1978)
D.L. Sackett et al.
Clinical epidemiology: a basic science for clinical medicine
(1991)
T. McGinn et al.
Diagnosis: Clinical prediction rules
A. Laupacis et al.
Clinical prediction rules. A review and suggested modification of methodological standards
JAMA
(1997)
K.G.M. Moons et al.
Test research versus diagnostic research
Clin Chem
(2004)
J.A. Ingelfinger et al.
Biostatistics in clinical medicine
(1994)
D.G. Kleinbaum et al.
Logistic regression: a self-learning text
(2002)

J.J. Deeks et al.

Diagnostic tests 4: likelihood ratios

BMJ

(2004)

Cited by (25)

The RCT-based and the prognostic likelihood ratio
2021, Journal of Clinical Epidemiology
A risk score for the prediction of advanced age-related macular degeneration: Development and validation in 2 prospective cohorts
2014, Ophthalmology
To develop a clinical eye-specific prediction model for advanced age-related macular degeneration (AMD).
The Age-Related Eye Disease Study (AREDS) cohort followed up for 8 years served as the training dataset, and the Blue Mountains Eye Study (BMES) cohort followed up for 10 years served as the validation dataset.
A total of 4507 AREDS participants (contributing 1185 affected vs. 6992 unaffected eyes) and 2169 BMES participants (contributing 69 affected vs. 3694 unaffected eyes).
Using Bayes' theorem in a logistic model, we used 8 baseline predictors—age, sex, education level, race, smoking status, and presence of pigment abnormality, soft drusen, and maximum drusen size—to devise and validate a macular risk scoring system (MRSS). We assessed the performance of the MRSS by calculating sensitivity, specificity, and the area under the receiver operating characteristic curve (i.e., c-index).
Advanced AMD.
The internally validated c-index_AREDS (0.88; 95% confidence interval, 0.87–0.89) and the externally validated c-index_BMES (0.91; 95% confidence interval, 0.88–0.95) suggested excellent performance of the MRSS. The sensitivity and specificity at the optimal macular risk score cutoff point of 0 were 87.6% and 73.6%, respectively. An application for the iPhone and iPad also was developed as a practical tool for the MRSS.
The MRSS was developed and validated to provide satisfactory accuracy and generalizability. It may be used to screen patients at risk of developing advanced AMD.
Diagnostic accuracy retrospectively of electrocardiographic findings and cancer history for tamponade in patients determined to have pericardial effusion by transthoracic echocardiogram
2013, American Journal of Cardiology
Citation Excerpt :
To allow readers to combine LRs from multiple findings,20,21 we used Spiegelhalter-Knill-Jones multivariate regression modeling to account for correlations between multiple diagnostic tests.22,23 We collapsed categories with similar LRs within each ECG finding and constructed CIs for adjusted LRs using a bias-corrected nonparametric bootstrap.23 All data were analyzed using Stata version 12.0 (StataCorp LP, College Station, Texas).
Unexpected pericardial effusions are often found by frontline providers who perform computed tomography. To study the hypothesis that electrocardiographic findings and whether cancer is known or suspected importantly change the likelihood of tamponade for such providers, all unique patients with moderate or large pericardial effusions determined by transthoracic echocardiography during a 6-year period were retrospectively identified. Electrocardiograms were evaluated by blinded investigators for electrical alternans (total and QRS), low voltage (limb leads only, precordial leads only, and both), and tachycardia (>100 QRS complexes/min). Medical records were reviewed to determine whether cancer was known or suspected and whether tamponade was diagnosed. Tamponade was present in 66 patients (27% of 241) with moderate or large pericardial effusions. No tachycardia lowered the odds of tamponade the most (likelihood ratio 0.4, 95% confidence interval 0.3 to 0.6) but by a degree less than any single diagnostic element increased it when present. The combined presence of all 3 electrocardiographic findings and cancer increased the odds of tamponade 63-fold (likelihood ratio 63, 95% confidence interval 33 to 150), whereas their combined absence decreased the odds only fivefold (likelihood ratio 0.2, 95% confidence interval 0.2 to 0.3). In conclusion, electrocardiography findings and cancer rule in tamponade better than they rule it out. Combining these diagnostic elements improves their discriminatory power but not sufficiently enough to rule out tamponade in patients with moderate or large pericardial effusions.
Risk predictions for individual patients from logistic regression were visualized with bar-line charts
2012, Journal of Clinical Epidemiology
Citation Excerpt :
There is a more sophisticated method for updating the pretest probabilities that accounts for the assessment order of the clinical information [16]. This more sophisticated method may yield different LR estimates but similar posterior probabilities in general compared with more straightforward approaches [17]. In a sequential risk model with interactions, the diagnostic value, that is, the LR, of a risk factor may depend on other risk factors already ascertained.
The interface of a computerized decision support system is crucial for its acceptance among end users. We demonstrate how combined bar–line charts can be used to visualize predictions for individual patients from logistic regression models.
Data from a previous diagnostic study aiming at predicting the immediate risk of acute coronary syndrome (ACS) among 634 patients presenting to an emergency department with chest pain were used. Risk predictions from the logistic regression model were presented for four hypothetical patients in bar–line charts with bars representing empirical Bayes adjusted likelihood ratios (LRs) and the line representing the estimated probability of ACS, sequentially updated from left to right after assessment of each risk factor.
Two patients had similar low risk for ACS but quite different risk profiles according to the bar–line charts. Such differences in risk profiles could not be detected from the estimated ACS risk alone. The bar–line charts also highlighted important but counteracted risk factors in cases where the overall LR was less informative (close to one).
The proposed graphical technique conveys additional information from the logistic model that can be important for correct diagnosis and classification of patients and appropriate medical management.
Correlation between serial tests made disease probability estimates erroneous
2009, Journal of Clinical Epidemiology
Citation Excerpt :
When important correlation between tests is found, likelihood ratios should be reported for all combinations of the test results [4]. Alternatively, one could avoid biased estimates from correlated tests with valid prediction rules or statistical methods for calculating likelihood ratios that are independent of other test results [6,7]. Knowing the correlation between tests will facilitate the utilization of such prediction rules, because they could be approximated with the substitution of a particular, unavailable component with an available, highly correlated test result [13,14].
The probability of a disease, given the result of two diagnostic tests, can be calculated by multiplying the odds of disease after the first test by the likelihood ratio of the second test.
To illustrate the error that occurs when calculating disease probability by combining the results of tests that are correlated.
Simulation study in which we randomly generated disease status and the results of two binary tests for a range of disease prevalence, test-operating characteristics, and correlation between tests. The primary outcome was the absolute difference between calculated and true probability of disease after two positive tests.
When the tests were correlated, the calculated probability of a disease exceeded the true probability of the disease. With perfect correlation, the true probability of the disease after two positive tests equaled that after a single positive test. Error arising from correlated tests increased as the difference in the calculated probability between the first and second positive tests increased. We noted several combinations of disease prevalence, test-operating characteristics, and test correlation where the absolute difference between calculated and true probability of disease exceeded 25%.
Disease probability is overestimated when the results of correlated tests are combined. Clinicians must consider the correlation between serial tests when calculating the posttest probability.
Symptoms associated with a positive result for a swab for SARS-CoV-2 infection among children in Alberta
2021, CMAJ
La recherche sur les enfants atteints d’une infection à coronavirus du syndrome respiratoire aigu sévère 2 (SRAS-CoV-2) a principalement porté sur les enfants amenés aux services des urgences. Nous avons voulu identifier les symptômes plus souvent associés à un frottis SRAS-CoV-2-positif chez les enfants non hospitalisés.
Nous avons procédé à une étude observationnelle chez des enfants soumis au dépistage et suivis pour une infection à SRAS-CoV-2 confirmée sur des prélèvements de sécrétions nasales, nasopharyngées, de la gorge et autres (p. ex., aspiration nasopharyngée, sécrétions trachéales ou non spécifiées) entre le 13 avril et le 30 septembre 2020 en Alberta. Nous avons calculé les rapports de vraisemblance (RV) positifs entre les symptômes autodéclarés et les frottis SRAS-CoV-2-positifs dans la cohorte entière et dans 3 analyses de sensibilité : tous les enfants présentant au moins 1 symptôme, tous les enfants, symptomatiques ou non, soumis au dépistage par suite d’une recherche de contacts, et tous les enfants de 5 ans et plus.
Nous avons analysé les résultats chez 2463 enfants soumis au dépistage de l’infection à SRAS-CoV-2; 1987 enfants se sont révélés positifs et 476 négatifs. Parmi les enfants SRAS-CoV-2-positifs, 714 (35,9 %) n’ont déclaré aucun symptôme. Même si la toux (24,5 %) et la rhinorrhée (19,3 %) étaient les 2 symptômes les plus fréquents chez les enfants ayant contracté le SRAS-CoV-2, elles étaient fréquentes également chez ceux dont les résultats étaient négatifs et ne permettaient pas de prédire un résultat positif (RV positif 0,96, intervalle de confiance [IC] à 95 % 0,81–1,14 et 0,87, IC à 95 % 0,72–1,06, respectivement). L’anosmie/agueusie (RV positif 7,33, IC à 95 % 3,03–17,76), les nausées et vomissements (RV positif 5,51, IC à 95 % 1,74–17,43), les céphalées (RV positif 2,49, IC à 95 % 1,74–3,57) et la fièvre (RV positif 1,68, IC à 95 % 1,34–2,11) ont été les symptômes les plus prédictifs d’un résultat SRAS-CoV-2-positif. Le RV positif pour la combinaison anosmie et agueusie, nausées et vomissements, et céphalées était de 65,92 (IC à 95 % 49,48–91,92).
Environ les deux tiers des enfants déclarés SRAS-CoV-2-positifs ont manifesté des symptômes, et les symptômes les plus étroitement associés à un frottis SRAS-CoV-2-positif étaient l’anosmie/agueusie, les nausées et les vomissements, les céphalées et la fièvre.

View all citing articles on Scopus

View full text

Original ArticleThree methods to construct predictive models using logistic regression and likelihood ratios to facilitate adjustment for pretest probability give similar results

Abstract

Objective

Study Design and Setting

Results

Conclusion

Section snippets

Background

Illustrative example

Methods

Results for full illustrative example

Discussion

Acknowledgments

Lancet

J Clin Epidemiol

Comput Biol Med

Comput Biomed Res

Clinical epidemiology: a basic science for clinical medicine

Diagnosis: Clinical prediction rules

Clinical prediction rules. A review and suggested modification of methodological standards

JAMA

Test research versus diagnostic research

Clin Chem

Biostatistics in clinical medicine

Logistic regression: a self-learning text

Diagnostic tests 4: likelihood ratios

BMJ

Original Article
Three methods to construct predictive models using logistic regression and likelihood ratios to facilitate adjustment for pretest probability give similar results