Colak, CemilKucukakcali, ZeynepAkbulut, Sami2024-08-042024-08-0420232049-0801https://doi.org/10.1097/MS9.0000000000001210https://hdl.handle.net/11616/102402Background:Hepatocellular carcinoma (HCC) is the main cause of mortality from cancer globally. This paper intends to classify public gene expression data of patients with Hepatitis C virus-related HCC (HCV+HCC) and chronic HCV without HCC (HCV alone) through the XGboost approach and to identify key genes that may be responsible for HCC.Methods:The current research is a retrospective case-control study. Public data from 17 patients with HCV+HCC and 35 patients with HCV-alone samples were used in this study. An XGboost model was established for the classification by 10-fold cross-validation. Accuracy (AC), balanced accuracy (BAC), sensitivity, specificity, positive predictive value, negative predictive value, and F1 score were utilized for performance assessment.Results:AC, BAC, sensitivity, specificity, positive predictive value, negative predictive value, and F1 scores from the XGboost model were 98.1, 97.1, 100, 94.1, 97.2, 100, and 98.6%, respectively. According to the variable importance values from the XGboost, the HAO2, TOMM20, GPC3, and PSMB4 genes can be considered potential biomarkers for HCV-related HCC.Conclusion:A machine learning-based prediction method discovered genes that potentially serve as biomarkers for HCV-related HCC. After clinical confirmation of the acquired genes in the following medical study, their therapeutic use can be established. Additionally, more detailed clinical works are needed to substantiate the significant conclusions in the current study.eninfo:eu-repo/semantics/openAccessartificial Intelligencechronic liver diseasegenetic markershepatitis C infectionhepatocellular carcinomaArtificial intelligence-based prediction of molecular and genetic markers for hepatitis C-related hepatocellular carcinomaArticle8510467446823781106710.1097/MS9.0000000000001210WOS:001079998200006Q2