Diagnostic Accuracy and Agreement Between AI and Clinicians in Orthodontic 3D Model Analysis

Bor, Sabahattin; Oguz, Firat; Khanmohammadi, Ayla

Diagnostic Accuracy and Agreement Between AI and Clinicians in Orthodontic 3D Model Analysis

dc.contributor.author	Bor, Sabahattin
dc.contributor.author	Oguz, Firat
dc.contributor.author	Khanmohammadi, Ayla
dc.date.accessioned	2026-04-04T13:31:13Z
dc.date.available	2026-04-04T13:31:13Z
dc.date.issued	2025
dc.department	İnönü Üniversitesi
dc.description.abstract	Background: Artificial intelligence (AI) is increasingly integrated into orthodontic workflows, including digital model analysis modules embedded in orthodontic software. While these systems offer efficiency and automation, the accuracy and clinical reliability of AI-generated measurements and diagnostic assessments remain unclear. Therefore, to use AI systems safely and effectively in clinical orthodontics, it is important to check their results by comparing them with those of experienced orthodontists. Methods: Digital models of 48 patients were analyzed by the Orthodontist group and two AI platforms: Titan (full) and SoftSmile (Bolton only). Three orthodontists independently measured all variables using 3Shape OrthoAnalyzer, and group means were used for comparison. A subset of models was reanalyzed after two weeks to assess consistency. Data distribution was evaluated, and appropriate statistical tests were applied. Reliability was assessed using intraclass correlation coefficients (ICC) and Cohen's kappa. Results: Almost perfect agreement was observed between the orthodontists and Titan AI in molar classification (kappa = 0.955 right, kappa = 0.900 left; p < 0.001), with perfect agreement reported across all groups-including between the orthodontists themselves-for Angle classification (kappa = 1.00). In anterior and overall Bolton analyses, no meaningful agreement was found between the orthodontists and AI platforms. However, in a subset of patients where all three methods identified the tooth size discrepancy in the same arch (either maxilla or mandible), no significant differences were found in anterior (p = 0.226) or overall Bolton values (p = 0.795). Overjet, overbite, and space analysis values showed significant differences between the orthodontist and Titan groups (p < 0.001). ICC analysis indicated good to excellent intra- and inter-rater reliability within the orthodontist group (>= 0.77), while both AI systems demonstrated excellent internal consistency, with ICC values exceeding 0.95. Conclusions: AI-based platforms showed high agreement with orthodontists only in Angle classification. While their performance in Bolton analysis was limited, significant differences were observed in other linear measurements, indicating the need for further refinement before clinical use.
dc.identifier.doi	10.3390/app15147786
dc.identifier.issn	2076-3417
dc.identifier.issue	14
dc.identifier.orcid	0000-0001-6040-3790
dc.identifier.orcid	0000-0001-5463-0057
dc.identifier.scopus	2-s2.0-105011873216
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.3390/app15147786
dc.identifier.uri	https://hdl.handle.net/11616/108659
dc.identifier.volume	15
dc.identifier.wos	WOS:001550981200001
dc.identifier.wosquality	Q2
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Mdpi
dc.relation.ispartof	Applied Sciences-Basel
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/openAccess
dc.snmz	KA_WOS_20250329
dc.subject	digital orthodontics
dc.subject	model analysis
dc.subject	AI-based diagnosis
dc.subject	titan dental design
dc.subject	SoftSmile
dc.title	Diagnostic Accuracy and Agreement Between AI and Clinicians in Orthodontic 3D Model Analysis
dc.type	Article

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Diagnostic Accuracy and Agreement Between AI and Clinicians in Orthodontic 3D Model Analysis

Dosyalar

Koleksiyon