Classification of stroke with gradient boosting tree using smote-based oversampling method

dc.contributor.authorYağın, Fatma Hilal
dc.contributor.authorCicek, Ipek Balikci
dc.contributor.authorTunç, Zeynep
dc.date.accessioned2024-08-04T19:53:18Z
dc.date.available2024-08-04T19:53:18Z
dc.date.issued2021
dc.departmentİnönü Üniversitesien_US
dc.description.abstractThe aim of this study is to classify the disease with the gradient increasing tree classification method in an open access dataset containing data from patients with and without stroke disease. In addition, it is aimed to compare the results by balancing the data with the oversampling method Synthetic Minority Over-sampling Technique (SMOTE) which is one of the data balancing methods in the study. In this study, a dataset containing information about patients with and without stroke disease obtained from the address "https://www.kaggle.com/asaumya/healthcare-problem-prediction-stroke-patients" was used. In the study, SMOTE was used as the data balancing method, and the gradient boosting tree method was used in the modeling. The performance of the model was evaluated by Specificity, sensitivity, accuracy, positive predictive value and negative predictive values. Specificity, sensitivity, accuracy, positive predictive value and negative predictive values were obtained as 0.0887, 0.9772, 0.9339, 0.9544 and 0.1679, respectively, according to the modeling result using the gardient boosting tree method using the original version of the dataset. Specificity, sensitivity, accuracy, positive predictive value and negative predictive values were obtained as 0.0887, 0.9772, 0.9339, 0.9544 and 0.1679, respectively, according to the modeling result using the gardient boosting tree method using the SMOTE applied version of the dataset. When the results obtained from the study were examined, the modeling results made with the SMOTE applied dataset were obtained more consistently and realistically. As a result, it is suggested that researchers use dataset balancing methods to acquire more accurate results whenever they come across an unbalanced dataset problem.en_US
dc.identifier.doi10.5455/medscience.2021.09.322
dc.identifier.endpage1515en_US
dc.identifier.issn2147-0634
dc.identifier.issue4en_US
dc.identifier.startpage1510en_US
dc.identifier.trdizinid504840en_US
dc.identifier.urihttps://doi.org/10.5455/medscience.2021.09.322
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/504840
dc.identifier.urihttps://hdl.handle.net/11616/89666
dc.identifier.volume10en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofMedicine Scienceen_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleClassification of stroke with gradient boosting tree using smote-based oversampling methoden_US
dc.typeArticleen_US

Dosyalar