Acessibilidade / Reportar erro

Early prediction of acute respiratory distress syndrome complicated by acute pancreatitis based on four machine learning models

Highlights

  • ML can be a practical and effective early prediction method of AP complicated by ARDS.

  • PaO2, CRP, PCT, LA, Ca2+, NLR, WBC, and AMY were used as the optimal subset of features to early identify AP patients with a high risk for developing ARDS in ML.

  • BC was the superior predictive model and EDTs could be promising for predicting large samples.

Abstract

Background

Acute Respiratory Distress syndrome (ARDS) is a common complication of Acute Pancreatitis (AP) and is associated with high mortality. This study used Machine Learning (ML) to predict ARDS in patients with AP at admission.

Methods

The authors retrospectively analyzed the data from patients with AP from January 2017 to August 2022. Clinical and laboratory parameters with significant differences between patients with and without ARDS were screened by univariate analysis. Then, Support Vector Machine (SVM), Ensembles of Decision Trees (EDTs), Bayesian Classifier (BC), and nomogram models were constructed and optimized after feature screening based on these parameters. Five-fold cross-validation was used to train each model. A test set was used to evaluate the predictive performance of the four models.

Results

A total of 83 (18.04%) of 460 patients with AP developed ARDS. Thirty-one features with significant differences between the groups with and without ARDS in the training set were used for modeling. The Partial Pressure of Oxygen (PaO2), C-reactive protein, procalcitonin, lactic acid, Ca2+, the neutrophil:lymphocyte ratio, white blood cell count, and amylase were identified as the optimal subset of features. The BC algorithm had the best predictive performance with the highest AUC value (0.891) than SVM (0.870), EDTs (0.813), and the nomogram (0.874) in the test set. The EDT algorithm achieved the highest accuracy (0.891), precision (0.800), and F1 score (0.615), but the lowest FDR (0.200) and the second-highest NPV (0.902).

Conclusions

A predictive model of ARDS complicated by AP was successfully developed based on ML. Predictive performance was evaluated by a test set, for which BC showed superior predictive performance and EDTs could be a more promising prediction tool for larger samples.

Keywords
Acute respiratory distress syndrome; Acute pancreatitis; Machine learning; Prediction model

Faculdade de Medicina / USP Rua Dr Ovídio Pires de Campos, 225 - 6 and., 05403-010 São Paulo SP - Brazil, Tel.: (55 11) 2661-6235 - São Paulo - SP - Brazil
E-mail: clinics@hc.fm.usp.br