A Novel CAD Framework with Visual and Textual Interpretability: Multimodal Insights for Predicting Respiratory Diseases

Mukhlis, Raza; Saleem, Saied; Kwon, Hyunwook; Hussain, Jamil; Aydin, Ahmet Arif; Al-Antari, Mugahed A.

A Novel CAD Framework with Visual and Textual Interpretability: Multimodal Insights for Predicting Respiratory Diseases

dc.contributor.author	Mukhlis, Raza
dc.contributor.author	Saleem, Saied
dc.contributor.author	Kwon, Hyunwook
dc.contributor.author	Hussain, Jamil
dc.contributor.author	Aydin, Ahmet Arif
dc.contributor.author	Al-Antari, Mugahed A.
dc.date.accessioned	2026-04-04T13:18:59Z
dc.date.available	2026-04-04T13:18:59Z
dc.date.issued	2024
dc.department	İnönü Üniversitesi
dc.description	8th International Artificial Intelligence and Data Processing Symposium, IDAP 2024 -- 21 September 2024 through 22 September 2024 -- Malatya -- 203423
dc.description.abstract	Generating textual interpretability using recent advancements in large language models (LLMs) is crucial for enhancing the efficiency of comprehensive computer-aided diagnosis (CAD) systems. This improves transparency between medical staff, intelligent CAD systems, and end-users by creating a trustworthy and effective intermediate medical diagnosis environment. In this paper, an innovative explainable throughout CAD system is introduced, designed to predict diseases from Chest X-rays (CXR) in a comprehensive scenario. The primary goal is to undertake multiple tasks that reduce the burden on medical staff and enrich CAD outcomes, including classification, visual explanations (heatmaps), and textual report generation. The proposed CAD system is developed through eight key steps: Data Collection and Annotation, Data Preparation, Text Vectorizations (Indexing), Visual Encoder, RAG-Fusion, Structural Prompt, XAI LLmTextual Reasoning (LLM Model), and Final Output (LLM textual report, image classification, and heatmap localization). The AI-based CAD system is trained and evaluated using the public benchmark MIMIC-CXR dataset with 14 different classes. The classification performance achieved an overall accuracy of 70 %, precision of 70 %, and F1-score of 0.60 %, while for text report generation, the system obtained an average BERTScore precision of 0.83, RougeL 0.16, and a Meteor score of 0.28. These promising results suggest the potential for further improvement of the CAD system and its applicability to real-world medical tasks. © 2024 IEEE.
dc.description.sponsorship	National Research Foundation of Korea, NRF; MSIT, (RS-2022-00166402, RS- 2023-00256517); Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK, (123N325); Türkiye Bilimsel ve Teknolojik Araştırma Kurumu, TÜBİTAK
dc.identifier.doi	10.1109/IDAP64064.2024.10710824
dc.identifier.isbn	979-833153149-2
dc.identifier.scopus	2-s2.0-85207866263
dc.identifier.scopusquality	N/A
dc.identifier.uri	https://doi.org/10.1109/IDAP64064.2024.10710824
dc.identifier.uri	https://hdl.handle.net/11616/108050
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof	8th International Artificial Intelligence and Data Processing Symposium, IDAP 2024
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_Scopus_20250329
dc.subject	Comprehensive CAD system
dc.subject	Large language model (LLM)
dc.subject	Retrieval Augmented Generation (RAG)
dc.subject	Text embedding
dc.subject	visual and textual interpretability
dc.title	A Novel CAD Framework with Visual and Textual Interpretability: Multimodal Insights for Predicting Respiratory Diseases
dc.type	Conference Object

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

A Novel CAD Framework with Visual and Textual Interpretability: Multimodal Insights for Predicting Respiratory Diseases

Dosyalar

Koleksiyon