Prediction of breast cancer subtypes based on proteomic data with deep learning

dc.contributor.authorYaşar, Şeyma
dc.contributor.authorÇolak, Cemil
dc.contributor.authorYoloğlu, Saim
dc.date.accessioned2024-08-04T19:42:38Z
dc.date.available2024-08-04T19:42:38Z
dc.date.issued2020
dc.departmentİnönü Üniversitesien_US
dc.description.abstractAim: Although new advances in diagnosis and treatment have increased, breast cancer is still an important cause of morbidity and mortality today. Proteomics, which collectively deals with relevant information about proteins, is one of the important areas of study that has been emphasized recently. It is a machine learning class that uses many layers of nonlinear processing units for deep learning, feature extraction and conversion. The aim of this study is to classify the molecular subtypes (Basal-like, human epidermal growth factor receptor 2 (HER2)-enriched, Luminal A, Luminal B) of breast cancer with the deep learning algorithm designed by using proteomic data.Material and Methods: The data set used in this study consists of published Isobaric tags for relative and absolute quantitation (iTRAQ) proteome profiling of 77 breast cancer samples by the Clinical Proteomic Tumor Analysis Consortium (NCI/NIH). The missing values in the data were completed with the mean substitution method. “Lasso Regression Model” was used in the selection of variables and after repeating 100 times with 10 times cross-validation method. Finally, the deep learning algorithm has been used to classify the molecular subtypes of breast cancer.Results: The overall accuracy rate of the proposed model in classifying breast cancer are found to be 91.53%. The performance of this model for classifying molecular subtypes of breast cancer was calculated as accuracy %96.43, F-score %93.33, MCC %91.29, G-mean %93.54 for Basal-like, accuracy %94.74, F-score %84.21, MCC %81.23, G-mean %92.30 for HER2-enriched, accuracy %98.18, F-score %96.97, MCC %95.76, G-mean %98.71 for Luminal A and accuracy 93.10%, F-score 88.89%, MCC 83.89%, G-mean 91.89% for Luminal B, respectively.Conclusion: The model designed using the deep learning algorithm has been found to perform quite well in classifying the molecular subtypes of breast cancer. In further studies, different deep learning architectures can be used to classify the molecular subtypes of breast cancer with higher accuracy.en_US
dc.identifier.doi10.5455/annalsmedres.2020.02.165
dc.identifier.endpage2806en_US
dc.identifier.issn2636-7688
dc.identifier.issue10en_US
dc.identifier.startpage2803en_US
dc.identifier.trdizinid414171en_US
dc.identifier.urihttps://doi.org/10.5455/annalsmedres.2020.02.165
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/414171
dc.identifier.urihttps://hdl.handle.net/11616/88526
dc.identifier.volume27en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofAnnals of Medical Researchen_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titlePrediction of breast cancer subtypes based on proteomic data with deep learningen_US
dc.typeArticleen_US

Dosyalar