Pilot-Study to Explore Metabolic Signature of Type 2 Diabetes: A Pipeline of Tree-Based Machine Learning and Bioinformatics Techniques for Biomarkers Discovery
dc.authorid | AL-Hashem, Fahaid/0000-0001-5795-9966 | |
dc.authorid | AHMAD, IRSHAD/0000-0003-0077-3065 | |
dc.authorid | Yagin, Fatma Hilal/0000-0002-9848-7958 | |
dc.authorid | Ahmad, Fuzail/0000-0002-1189-2206 | |
dc.authorid | Alkhateeb, Abedalrhman/0000-0002-1751-7570 | |
dc.authorid | Ahmad, Irshad/0000-0002-6012-9207 | |
dc.authorwosid | AL-Hashem, Fahaid/GPS-8057-2022 | |
dc.authorwosid | AHMAD, IRSHAD/R-4469-2018 | |
dc.authorwosid | Yagin, Fatma Hilal/ABI-8066-2020 | |
dc.contributor.author | Yagin, Fatma Hilal | |
dc.contributor.author | Al-Hashem, Fahaid | |
dc.contributor.author | Ahmad, Irshad | |
dc.contributor.author | Ahmad, Fuzail | |
dc.contributor.author | Alkhateeb, Abedalrhman | |
dc.date.accessioned | 2024-08-04T20:56:04Z | |
dc.date.available | 2024-08-04T20:56:04Z | |
dc.date.issued | 2024 | |
dc.department | İnönü Üniversitesi | en_US |
dc.description.abstract | Background: This study aims to identify unique metabolomics biomarkers associated with Type 2 Diabetes (T2D) and develop an accurate diagnostics model using tree-based machine learning (ML) algorithms integrated with bioinformatics techniques. Methods: Univariate and multivariate analyses such as fold change, a receiver operating characteristic curve (ROC), and Partial Least-Squares Discriminant Analysis (PLS-DA) were used to identify biomarker metabolites that showed significant concentration in T2D patients. Three tree-based algorithms [eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Adaptive Boosting (AdaBoost)] that demonstrated robustness in high-dimensional data analysis were used to create a diagnostic model for T2D. Results: As a result of the biomarker discovery process validated with three different approaches, Pyruvate, D-Rhamnose, AMP, pipecolate, Tetradecenoic acid, Tetradecanoic acid, Dodecanediothioic acid, Prostaglandin E3/D3 (isobars), ADP and Hexadecenoic acid were determined as potential biomarkers for T2D. Our results showed that the XGBoost model [accuracy = 0.831, F1-score = 0.845, sensitivity = 0.882, specificity = 0.774, positive predictive value (PPV) = 0.811, negative-PV (NPV) = 0.857 and Area under the ROC curve (AUC) = 0.887] had the slight highest performance measures. Conclusions: ML integrated with bioinformatics techniques offers accurate and positive T2D candidate biomarker discovery. The XGBoost model can successfully distinguish T2D based on metabolites. | en_US |
dc.description.sponsorship | Deanship of Scientific Research, King Khalid University, Kingdom of Saudi Arabia | en_US |
dc.description.sponsorship | No Statement Available | en_US |
dc.identifier.doi | 10.3390/nu16101537 | |
dc.identifier.issn | 2072-6643 | |
dc.identifier.issue | 10 | en_US |
dc.identifier.pmid | 38794775 | en_US |
dc.identifier.scopus | 2-s2.0-85194219122 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.uri | https://doi.org/10.3390/nu16101537 | |
dc.identifier.uri | https://hdl.handle.net/11616/102017 | |
dc.identifier.volume | 16 | en_US |
dc.identifier.wos | WOS:001231595400001 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.indekslendigikaynak | PubMed | en_US |
dc.language.iso | en | en_US |
dc.publisher | Mdpi | en_US |
dc.relation.ispartof | Nutrients | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | type 2 diabetes | en_US |
dc.subject | biomarker discovery | en_US |
dc.subject | metabolomics | en_US |
dc.subject | machine learning | en_US |
dc.subject | bioinformatics | en_US |
dc.title | Pilot-Study to Explore Metabolic Signature of Type 2 Diabetes: A Pipeline of Tree-Based Machine Learning and Bioinformatics Techniques for Biomarkers Discovery | en_US |
dc.type | Article | en_US |