Email updates

Keep up to date with the latest news and content from Critical Care and BioMed Central.


Delirium and mortality risk prediction: a story in evolution

Eduard E Vasilevskis12, Jin H Han12, Ayumi Shintani12, Timothy D Girard12 and E Wesley Ely12*

Author Affiliations

1 Division of Allergy, Pulmonary, Critical Care Medicine, Center for Health Services Research, 6th Floor Medical Center East, #6109, Vanderbilt University Medical Center, Nashville, TN 37232-8300, USA

2 Geriatric Research, Education and Clinical Center (GRECC), 1310 24th Avenue S., VA Tennessee Valley, Nashville, TN 37212, USA

For all author emails, please log on.

Critical Care 2010, 14:449  doi:10.1186/cc9282

See related research by van den Boogaard et al.,

The electronic version of this article is the complete one and can be found online at:

Published:29 October 2010

© 2010 BioMed Central Ltd


We read with interest the article by van den Boogaard and colleagues, which proposed that delirium measured within 24 hours of admission did not improve the Acute Physiology and Chronic Health Evaluation (APACHE) II in-hospital mortality prediction [1]. Their data should be interpreted after considering the study design and statistical limitations.

First, the Confusion Assessment Method for the Intensive Care Unit (CAM-ICU) measurements include assessing the level of consciousness (using any valid sedation scale), which is highly correlated with the Glasgow Coma Score. Therefore it is not surprising that addition of delirium to the APACHE score (which includes the Glasgow Coma Score) on the first intensive care unit day does not alter predictions; however, earlier detection of delirium at the initial evaluation of Emergency Department patients is an important predictor of death [2]. We have found that the level of consciousness (via the Richmond Agitation-Sedation Scale) has been predictive of in-hospital mortality, but this relationship is not as strong as the independent value of delirium duration (via the CAM-ICU) for predicting long-term survival, even after adjusting for APACHE II score and sedatives [3,4].

Second, the authors base their conclusions upon comparisons of areas under the curve using the c statistic. Recent insights suggest that this analytic method is insensitive and open to type II error [5]. A more sensitive method to assess additive predictive ability applies likelihood ratio testing between models with and without additional risk factors. In addition, substantial improvements in risk reclassification may be apparent despite limited increases in the c statistic.

In sum, it may be true (but confirmation is required) that adding delirium to a measurement such as the APACHE score is not of value. Clinicians and hospital quality officers should continue to consider early detection of delirium and ongoing delirium detection as an important prognostic tool.

Authors' response

Mark van den Boogaard, Pieter Leffers and Lisette Schoonhoven

We thank Dr Vasilevskis and coworkers for their interest in our publication [1]. We are fully aware of the limitations of the c statistic as a measure for clinical usefulness of a predictive model - that is why we did not base our conclusions only on the lack of improvement of the c statistic, but also on the deteriorating ability to predict mortality [1].

As Cook pointed out in her publication, the evaluation of the clinical usefulness of risk-stratification models is not at all straightforward [5]; others make it clear that the last word about proper analysis and its interpretations has not yet been written [6,7]. This complicated issue needs further methodological development and thorough discussion. In addition to this, we would like to stress that showing the independent contribution of delirium after controlling for covariables in a Cox regression model is not a valid method to show the clinical usefulness of delirium as a predictor of mortality, not even when the corrected hazard ratio is high [5,8]. Also, showing the improved model fit from adding a variable to a model with the log-likelihood test does not serve that purpose [8].

As Vasilevskis and colleagues correctly point out, the probable reason why delirium does not add to the predictive properties of the APACHE score is that the latter already contains variables that essentially measure the same information about the clinical state of the patient. The predictive validity of a model is usually and mainly determined by its power to discriminate and/or by its ability to predict outcome (calibration). The reclassification index is a potentially interesting tool for evaluation of predictive models. Unfortunately this index is highly dependent on the width of the chosen categories of predicted risk. We do not know of category boundaries that would have a direct meaning for clinical decision-making [5,8]. Because proper interpretation of the index will not be possible, we have chosen not to include such an analysis.

In summary, despite shortcomings of various methods to determine the predictive value, our conclusion remains that delirium does not improve the predictive value of the APACHE score.


APCHE: Acute Physiology and Chronic Health Evaluation; CAM-ICU: Confusion Assessment Method for the Intensive Care Unit.

Competing interests

The authors declare that they have no competing interests.


  1. van den Boogaard M, Peters SAE, van der Hoeven JG, Dagnelie PC, Leffers P, Pickkers P, Schoonhoven L: The impact of delirium on the prediction of in-hospital mortality in intensive care patients.

    Crit Care 2010, 14:R146. PubMed Abstract | BioMed Central Full Text | PubMed Central Full Text OpenURL

  2. Han JH, Shintani A, Eden S, Morandi A, Solberg LM, Schnelle J, Dittus RS, Storrow AB, Ely EW: Delirium in the emergency department: an independent predictor of death within 6 months.

    Ann Emerg Med 2010, 56:244-252. PubMed Abstract | Publisher Full Text OpenURL

  3. Ely EW, Shintani A, Truman B, Speroff T, Gordon SM, Harrell FE Jr, Inouye SK, Bernard GR, Dittus RS: Delirium as a predictor of mortality in mechanically ventilated patients in the intensive care unit.

    JAMA 2004, 291:1753-1762. PubMed Abstract | Publisher Full Text OpenURL

  4. Pisani MA, Kong SYJ, Kasl SV, Murphy TE, Araujo KLB, Van Ness PH: Days of delirium are associated with 1-year mortality in an older intensive care unit population.

    Am J Respir Crit Care Med 2009, 180:1092-1097. PubMed Abstract | Publisher Full Text OpenURL

  5. Cook NR: Use and misuse of the receiver operating characteristic curve in risk prediction.

    Circulation 2007, 115:928-935. PubMed Abstract | Publisher Full Text OpenURL

  6. Janket SJ, Shen Y, Meurman JH: Letter by Janket et al regarding article, 'Use and misuse of the receiver operating characteristic curve in risk prediction'.

    Circulation 2007, 116:e133. PubMed Abstract | Publisher Full Text OpenURL

  7. Pepe MS, Janes H, Gu JW: Letter by Pepe et al regarding article, 'Use and misuse of the receiver operating characteristic curve in risk prediction'.

    Circulation 2007, 116:e132. PubMed Abstract | Publisher Full Text OpenURL

  8. Steyerberg EW: Clinical Prediction Models; A Practical Approach to Development, Validation, and Updating. 1st edition. Rotterdam: Springer Science+Business Media, LCC; 2009. OpenURL