Machine learning-based forecasting of coronary artery disease risk factors and diagnostic insights

dc.contributor.authorDogan, Zekeriya
dc.contributor.authorCicek, İpek Balıkcı
dc.date.accessioned2024-08-04T19:42:44Z
dc.date.available2024-08-04T19:42:44Z
dc.date.issued2023
dc.departmentİnönü Üniversitesien_US
dc.description.abstractAim: As people’s quality of life and habits have changed, Coronary Artery Disease (CAD) has become the leading cause of death globally. It is a complicated cardiac disease with various risk factors and a wide range of symptoms. An early and accurate diagnosis of CAD allows for the quick administration of appropriate treatment, which contributes to a decreased mortality rate. Machine learning (ML) algorithms for CAD prediction and treatment decisions are quickly being developed and implemented in clinical practice. Predictive models based on machine learning algorithms may aid health personnel in the early diagnosis of CAD, lowering mortality. Thus, this study goal is to forecast the elements that may be connected with CAD using tree-based approaches, which are one of the machine learning methods, and to discover which factor is more effective on CAD. Materials and Methods: The open-access heart disease dataset was used within the scope of the study to investigate the risk factors related with CAD. The data set used contains the values of 333 patients, as well as 20 input and 1 target variables. The 10-fold cross validation approach was employed in the modeling, and the data set was divided as 80%: 20% as training and test datasets. For model assessment, the measures of accuracy (ACC), balanced accuracy (b-ACC), sensitivity (SE), specificity (SP), positive predictive value (ppv), negative predictive value (npv), and F1-score were utilized. Results: The values of ACC, b-ACC, SE, SP, ppv, npv, and F1-score performance metrics were 9 98.5%, 98.8%, 97.7%, 100%, 100%, 95.8% and 98.8%, respectively, as a consequence of the estimate model results created with the XGBoost approach, which has the best performance among tree-based models. When the groups with or without CAD were compared, a statistically significant difference was found in terms of the age variable. There is also a significant relationship between the active, lifestyle, ihd, dm, ecgpatt, qwave variables and the presence/absence of the CAD variable. When the variable significance values obtained as a result of modeling with the highest performing XGBoost are examined, it is seen that the variables that most associated with CAD are ekgpatt: normal, ekgpatt: ST-depression, ekgpatt: T-inversion, qwave: yes, age, bpdias, height, LDL, HR, IVSD: with LVH, bpsyDM. Conclusion: According to the performance criteria of the forecasting models used, CAD gave distinctively successful results in forecasting. By identifying risk factors associated with CAD, the proposed machine learning models can provide clinicians with practical, cost-effective and beneficial assistance in making accurate predictive decisions.en_US
dc.identifier.doi10.5455/annalsmedres.2023.08.204
dc.identifier.endpage1118en_US
dc.identifier.issn2636-7688
dc.identifier.issue9en_US
dc.identifier.startpage1112en_US
dc.identifier.trdizinid1205590en_US
dc.identifier.urihttps://doi.org/10.5455/annalsmedres.2023.08.204
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/1205590
dc.identifier.urihttps://hdl.handle.net/11616/88628
dc.identifier.volume30en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofAnnals of Medical Researchen_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleMachine learning-based forecasting of coronary artery disease risk factors and diagnostic insightsen_US
dc.typeArticleen_US

Dosyalar