Advanced Search
Zhang D, Liu Y Y, Zhang J, et al. Development and validation of a machine learning model for predicting in-hospital recurrent intensive care unit admission in critically ill patients with ischemic stroke based on the MIMIC-Ⅳ databaseJ. Chin J Clin Med, 2026, 33(3): 461-470. DOI: 10.12025/j.issn.1008-6358.2026.20260149
Citation: Zhang D, Liu Y Y, Zhang J, et al. Development and validation of a machine learning model for predicting in-hospital recurrent intensive care unit admission in critically ill patients with ischemic stroke based on the MIMIC-Ⅳ databaseJ. Chin J Clin Med, 2026, 33(3): 461-470. DOI: 10.12025/j.issn.1008-6358.2026.20260149

Development and validation of a machine learning model for predicting in-hospital recurrent intensive care unit admission in critically ill patients with ischemic stroke based on the MIMIC-Ⅳ database

  • Objective To develop and validate a prediction model for in-hospital recurrent intensive care unit (ICU) admission in critically ill patients with ischemic stroke (IS) based on machine learning (ML) algorithms.
    Methods Clinical data from 2 929 IS patients were included from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database. Least absolute shrinkage and selection operator (LASSO) regression was used to identify predictive factors, and the synthetic minority over-sampling technique (SMOTE) was employed to create a derivation cohort comprising 2 583 patients. These patients were randomly divided into a training set (n=2 066) and a test set (n=517) at an 8:2 ratio. Five ML algorithms, including decision tree, random forest, adaptive boosting (AdaBoost), gradient boosting decision tree (GBDT), and support vector machine (SVM), were performed to construct prediction models. Five-fold cross-validation was used to evaluate the performance of the model in the training set. The area under the receiver operating characteristic curve (ROC-AUC) and decision curve analysis (DCA) were used to assess and compare the models in the testing set. The best-performing model was interpreted by shapley additive explanations (SHAP).
    Results Among the 2 929 patients included, 704 (24.0%) experienced in-hospital recurrent ICU admission. Among the five ML models, the random forest model demonstrated the best predictive performance, with an AUC of 0.839 (95%CI 0.801–0.877). Feature importance analysis identified five most significant features affecting model prediction, including APS Ⅲ score, albumin, age, heart rate, and SOFA score.
    Conclusions ML-based models can effectively predict the risk of in-hospital recurrent ICU admission in critically ill patients with IS. The random forest model showed superior predictive performance, which may have potential applications in early clinical risk stratification and intervention.
  • loading

Catalog

    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return