Extractive multi-document text summarization based on graph independent sets

Uckan, Taner; Karci, Ali

Extractive multi-document text summarization based on graph independent sets

dc.authorid	uckan, Taner/0000-0001-5385-6775
dc.authorid	Karci, Ali/0000-0002-8489-8617
dc.authorwosid	uckan, Taner/IZP-9705-2023
dc.authorwosid	Karci, Ali/AAG-5337-2019
dc.contributor.author	Uckan, Taner
dc.contributor.author	Karci, Ali
dc.date.accessioned	2024-08-04T20:47:03Z
dc.date.available	2024-08-04T20:47:03Z
dc.date.issued	2020
dc.department	İnönü Üniversitesi	en_US
dc.description.abstract	We propose a novel methodology for extractive, generic summarization of text documents. The Maximum Independent Set, which has not been used previously in any summarization study, has been utilized within the context of this study. In addition, a text processing tool, which we named KUSH, is suggested in order to preserve the semantic cohesion between sentences in the representation stage of introductory texts. Our anticipation was that the set of sentences corresponding to the nodes in the independent set should be excluded from the summary. Based on this anticipation, the nodes forming the Independent Set on the graphs are identified and removed from the graph. Thus, prior to quantification of the effect of the nodes on the global graph, a limitation is applied on the documents to be summarized. This limitation prevents repetition of word groups to be included in the summary. Performance of the proposed approach on the Document Understanding Conference (DUC-2002 and DUC-2004) datasets was calculated using ROUGE evaluation metrics. The developed model achieved a 0.38072 ROUGE performance value for 100-word summaries, 0.51954 for 200-word summaries, and 0.59208 for 400-word summaries. The values reported throughout the experimental processes of the study reveal the contribution of this innovative method. (C) 2019 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Artificial Intelligence, Cairo University.	en_US
dc.identifier.doi	10.1016/j.eij.2019.12.002
dc.identifier.endpage	157	en_US
dc.identifier.issn	1110-8665
dc.identifier.issn	2090-4754
dc.identifier.issue	3	en_US
dc.identifier.scopus	2-s2.0-85077387625	en_US
dc.identifier.scopusquality	Q1	en_US
dc.identifier.startpage	145	en_US
dc.identifier.uri	https://doi.org/10.1016/j.eij.2019.12.002
dc.identifier.uri	https://hdl.handle.net/11616/99130
dc.identifier.volume	21	en_US
dc.identifier.wos	WOS:000573603100003	en_US
dc.identifier.wosquality	Q2	en_US
dc.indekslendigikaynak	Web of Science	en_US
dc.indekslendigikaynak	Scopus	en_US
dc.language.iso	en	en_US
dc.publisher	Cairo Univ, Fac Computers & Information	en_US
dc.relation.ispartof	Egyptian Informatics Journal	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Graph independent set	en_US
dc.subject	Graph-based document summarization	en_US
dc.subject	Generic document summarization	en_US
dc.subject	Extractive text summarization	en_US
dc.subject	Multi document text summarization	en_US
dc.title	Extractive multi-document text summarization based on graph independent sets	en_US
dc.type	Article	en_US

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Extractive multi-document text summarization based on graph independent sets

Dosyalar

Koleksiyon