Derin öğrenme ve topluluk öğrenme yöntemlerine dayalı bilgisayar destekli tanı sisteminin geliştirilmesi: Omik teknolojileri üzerine uygulaması

Arslan, Ahmet Kadir

DSpace Home
→
Tezler
→
Sağlık Bilimleri Enstitüsü
→
Sağlık Bilimleri Enstitüsü Doktora Tezleri
→
View Item

dc.contributor.author	Arslan, Ahmet Kadir
dc.date.accessioned	2021-10-27T06:44:30Z
dc.date.available	2021-10-27T06:44:30Z
dc.date.issued	2021
dc.identifier.citation	Arslan, A. K. (2021). Derin öğrenme ve topluluk öğrenme yöntemlerine dayalı bilgisayar destekli tanı sisteminin geliştirilmesi: Omik teknolojileri üzerine uygulaması. Yayınlanmış Doktora Tezi, İnönü Üniversitesi.	en_US
dc.identifier.uri	http://hdl.handle.net/11616/42496
dc.description	İnönü Üniversitesi / Sağlık Bilimleri Enstitüsü / Biyoistatistik ve Tıp Bilişimi Ana Bilim Dalı	en_US
dc.description.abstract	Amaç: Bu tez çalışmasında, kolorektal kanser hastalığına ilişkin açık erişimli deneysel metabolomik veri seti kullanılarak, çeşitli topluluk öğrenme ve derin öğrenme modelleri ile ilgili hastalığın sınıflandırılmasını sağlayabilecek bir ardışık kod sisteminin (pipeline) tasarlanması ve bu kapsamda yüksek çıktılı bir karar destek sisteminin geliştirilmesi amaçlanmıştır. Materyal ve Metot: Bu tez çalışmasında, Washington Üniversitesi, Anesteziyoloji ve Algoloji Bölümü, Northwest Metabolomik Araştırma Merkezi'nde yürütülen PR000226 numaralı proje kapsamında üretilen veri seti kullanılmıştır. İlgili veri seti, iki denek grubu, 66 KRK hastası ve 92 sağlıklı kontrol olmak üzere toplam 158 örnekten oluşmaktadır. Değişken seçim yöntemleri olarak LASSO, Elastic-Net, Boruta ve BorutaShap yöntemleri kullanılmıştır. Sınıflandırma görevinde ise XGBoost, LightGBM, derin sinir ağları ve yığın otokodlayıcı modelleri kullanılmıştır. Bulgular: Bulgular incelendiğinde, en zayıf sınıflandırma performansı gösteren modelin yığın otokodlayıcı olduğu görülmektedir. Modelin hiçbir değişken seçimi senaryosunda istenilen sınıflandırma performansını başarısını gösteremediği görülmektedir. LightGBM modeli hem eğitim hem de test veri setlerinin sınıflandırmasında, tüm performans ölçütleri açısından, en iyi sonuçları vermiştir. Ayrıca LightGBM modeli bu sınıflandırma başarımını tüm değişken seçim yöntemleri bazında elde etmiştir. Sonuç: Bu tez çalışmasında, topluluk öğrenme yöntemlerinin, tüm değişken seçim senaryolarında, derin öğrenme yöntemlerine kıyasla çok daha iyi sınıflandırma sonuçları verdiği görülmüştür. Söz konusu topluluk öğrenme yöntemlerinin, büyük veri setlerinde daha hızlı sonuç verebilmeleri için grafik işlem birimi (GPU) destekli sürümlerinin kullanılmasının işlem ve zaman maliyetleri açısından daha verimli olacağı önerilebilir.	en_US
dc.description.abstract	Aim: In this study, it is aimed to design a pipeline system, by using open access experimental metabolomics data set on colorectal cancer disease, which can classify the related disease by various ensemble learning and deep learning models and to develop a high-throughput decision support system. Material and Method: In this thesis, the data set produced within the scope of the project numbered PR000226 carried out at Washington University, Department of Anesthesiology and Algology, Northwest Metabolomics Research Center was used. The related data set consisted of a total of 158 samples, including two groups of subjects, 66 CRC patients and 92 healthy controls. As variable selection methods, LASSO, Elastic-Net, Boruta and BorutaShap methods were used. In the classification task, XGBoost, LightGBM, deep neural networks and stacked autoencoder models were utilized. Results: When the findings were examined, it was seen that the model with the worst classification performance is the stacked autoencoder. It was seen that the model cannot achieve the desired classification performance in any variable selection scenario. The LightGBM model gave the best results for all performance measures in the classification of both training and test datasets. In addition, the LightGBM model has achieved this classification performance on the basis of all variable selection methods. Conclusion: In this thesis study, it was seen that ensemble learning methods had much better classification results compared to deep learning methods in all variable selection scenarios. It can be suggested that the use of graphics processing unit (GPU) supported versions of these ensemble learning methods will be more efficient in terms of processing and time costs so that they can provide faster results in large data sets.	en_US
dc.language.iso	tur	en_US
dc.publisher	İnönü Üniversitesi Sağlık Bilimleri Enstitüsü	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Biyoistatistik	en_US
dc.subject	Biostatistics	en_US
dc.title	Derin öğrenme ve topluluk öğrenme yöntemlerine dayalı bilgisayar destekli tanı sisteminin geliştirilmesi: Omik teknolojileri üzerine uygulaması	en_US
dc.title.alternative	Development of computer aided diagnosis system based on deep learning and ensemble learning methods: Application on omics technologies	en_US
dc.type	doctoralThesis	en_US
dc.contributor.department	İnönü Üniversitesi	en_US
dc.identifier.endpage	89	en_US