Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts

dc.contributor.authorAlnahas, Dima
dc.contributor.authorAteş, Abdullah
dc.contributor.authorAydın, Ahmet Arif
dc.contributor.authorAlagöz, Barış Baykant
dc.date.accessioned2024-08-04T19:54:38Z
dc.date.available2024-08-04T19:54:38Z
dc.date.issued2023
dc.departmentİnönü Üniversitesien_US
dc.description.abstractRelation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for the graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in the probabilistic analysis of short texts. To this end, the construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated, and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, the utilization of cosine similarity and mean squared error measures are demonstrated to evaluate the probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express the reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts. The performance of the applied similarity measures is evaluated in comparison to the similarity index of the word2vec language model. Results of the comparative study in one of the illustrative examples reveal that synonyms with 0.18157 word2vec similarity value scored 1.0 cosine similarity value according to the proposed method.en_US
dc.identifier.doi10.47000/tjmcs.1240729
dc.identifier.endpage354en_US
dc.identifier.issn2148-1830
dc.identifier.issue2en_US
dc.identifier.startpage334en_US
dc.identifier.trdizinid1218710en_US
dc.identifier.urihttps://doi.org/10.47000/tjmcs.1240729
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/1218710
dc.identifier.urihttps://hdl.handle.net/11616/89994
dc.identifier.volume15en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofTurkish Journal of Mathematics and Computer Scienceen_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleRevisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Textsen_US
dc.typeArticleen_US

Dosyalar