TwoClsBalancer: An Interactive Web Application for Handling the Class Imbalance Problem Based on Machine Learning

dc.contributor.authorArslan, Ahmet Kadir
dc.contributor.authorÇolak, Cemil
dc.contributor.authorÇolak, Mehmet Cengiz
dc.date.accessioned2024-08-04T19:54:44Z
dc.date.available2024-08-04T19:54:44Z
dc.date.issued2022
dc.departmentİnönü Üniversitesien_US
dc.description.abstractObjective: The main purpose of this research is to develop a novel user-friendly web tool based on machine learning approaches, which applies a variety of techniques to address the class imbalance problem. Material and Methods: Shiny, an open- source R package, was used to develop the proposed web tool. The interactive tool can handle the class imbalance problem for binary classification dataset(s) by implementing sampling-based methods. As a clinical application, the dataset retrospectively obtained from the database of the Cardiovascular Surgery Department of Turgut Özal Medical Center, İnönü University, Malatya, Türkiye was used in this web-based software. To overcome the class imbalance prob- lem, sampling-based methods were implemented on the original dataset. After this process, the classification of hypertension in patients with coronary artery disease was achieved by three clas- sification models. Results: According to the outputs of the devel- oped web application, the best classification performance was obtained by the support vector machines with radial basis func- tion kernel (SVM-RBF) model after applying the density-based synthetic minority over-sampling technique oversampling meth- od. The accuracy, sensitivity, specificity, precision, f-measure, and g-mean metrics of the relevant model were calculated as 0.99, 0.99, 0.99, 0.95, 0.97, and 0.97, respectively. Conclusion: The oversampling methods used in this research indicated a more positive contribution to the classification performance of the models as compared to the undersampling methods. When the undersampling methods were applied, the three classification models did not demonstrate successful classification perfor- mance, whereas the SVM-RBF model outperformed the other two models when the oversampling methods were implemented. The designed interactive web application is freely accessible through http://biostatapps.inonu.edu.tr/twoclsbalancer.en_US
dc.identifier.doi10.5336/biostatic.2022-88932
dc.identifier.endpage179en_US
dc.identifier.issn1308-7894
dc.identifier.issn2146-8877
dc.identifier.issue3en_US
dc.identifier.startpage168en_US
dc.identifier.trdizinid1170268en_US
dc.identifier.urihttps://doi.org/10.5336/biostatic.2022-88932
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/1170268
dc.identifier.urihttps://hdl.handle.net/11616/90092
dc.identifier.volume14en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofTürkiye Klinikleri Biyoistatistik Dergisien_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleTwoClsBalancer: An Interactive Web Application for Handling the Class Imbalance Problem Based on Machine Learningen_US
dc.typeArticleen_US

Dosyalar