Prediction of death status on the course of treatment in SARS-COV-2 patients with deep learning and machine learning methods

dc.authoridÇOLAK, CEMİL/0000-0001-5406-098X
dc.authoridGÜLDOĞAN, Emek/0000-0002-5436-8164
dc.authorwosidÇOLAK, CEMİL/ABI-3261-2020
dc.authorwosidGÜLDOĞAN, Emek/ABH-5460-2020
dc.contributor.authorKivrak, Mehmet
dc.contributor.authorGuldogan, Emek
dc.contributor.authorColak, Cemil
dc.date.accessioned2024-08-04T20:49:14Z
dc.date.available2024-08-04T20:49:14Z
dc.date.issued2021
dc.departmentİnönü Üniversitesien_US
dc.description.abstractBackground and Objective: The new type of Coronavirus (2019-nCov) epidemic spread rapidly, causing more than 250 thousand deaths worldwide. The virus, which first appeared as a sign of pneumonia, was later called the SARS-COV-2 with Severe Acute Respiratory Syndrome by the World Health Organization. The SARS-COV-2 virus is triggered by binding to the Angiotensin-Converting Enzyme 2 (ACE 2) inhibitor, which is vital in cardiovascular diseases and the immune system, especially in conditions such as cerebrovascular, hypertension, and diabetes. This study aims to evaluate the prediction performance of death status based on the demographic/clinical factors (including COVID-19 severity) by data mining methods. Methods: The dataset consists of 1603 SARS-COV-2 patients and 13 variables obtained from an open source web address. The current dataset contains age, gender, chronic disease (hypertension, diabetes, renal, cardiovascular, etc.), some enzymes (ACE, angiotensin II receptor blockers), and COVID-19 severity, which are used to predict death status using deep learning and machine learning approaches (random forest, k-nearest neighbor, extreme gradient boosting [XGBoost]). A grid search algorithm tunes hyperparameters of the models, and predictions are assessed through performance metrics. Steps of knowledge discovery in databases are applied to obtain the relevant information. Results: The accuracy rate of deep learning (97.15%) was more successful than the accuracy rate based on classical machine learning (92.15% for RF and 93.4% for k-NN), but the ensemble classifier XGBoost method gave the highest accuracy (99.7%). While COVID-19 severity and age calculated from XGBoost were the two most important factors associated with death status, the most determining variables for death status estimated from deep learning were COVID-19 severity and hypertension. Conclusions: The proposed model (XGBoost) achieved the best prediction of death status based on the factors as compared to the other algorithms. The results of this study can guide patients with certain variables to take early measures and access preventive health care services before they become infected with the virus. (c) 2021 Elsevier B.V. All rights reserved.en_US
dc.identifier.doi10.1016/j.cmpb.2021.105951
dc.identifier.issn0169-2607
dc.identifier.issn1872-7565
dc.identifier.pmid33513487en_US
dc.identifier.scopus2-s2.0-85099791734en_US
dc.identifier.scopusqualityQ1en_US
dc.identifier.urihttps://doi.org/10.1016/j.cmpb.2021.105951
dc.identifier.urihttps://hdl.handle.net/11616/99728
dc.identifier.volume201en_US
dc.identifier.wosWOS:000632520100013en_US
dc.identifier.wosqualityQ1en_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.indekslendigikaynakPubMeden_US
dc.language.isoenen_US
dc.publisherElsevier Ireland Ltden_US
dc.relation.ispartofComputer Methods and Programs in Biomedicineen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectSARS-COV-2en_US
dc.subjectData Miningen_US
dc.subjectDeep Learningen_US
dc.subjectExtreme Gradient Boostingen_US
dc.subjectMachine Learningen_US
dc.titlePrediction of death status on the course of treatment in SARS-COV-2 patients with deep learning and machine learning methodsen_US
dc.typeArticleen_US

Dosyalar