E-Mail Classification Using Natural Language Processing
dc.authorid | sel, ilhami/0000-0003-0222-7017 | |
dc.authorid | Hanbay, Davut/0000-0003-2271-7865 | |
dc.authorwosid | sel, ilhami/ABD-7350-2020 | |
dc.authorwosid | Hanbay, Davut/AAG-8511-2019 | |
dc.contributor.author | Sel, Ilhami | |
dc.contributor.author | Hanbay, Davut | |
dc.date.accessioned | 2024-08-04T20:46:46Z | |
dc.date.available | 2024-08-04T20:46:46Z | |
dc.date.issued | 2019 | |
dc.department | İnönü Üniversitesi | en_US |
dc.description | 27th Signal Processing and Communications Applications Conference (SIU) -- APR 24-26, 2019 -- Sivas Cumhuriyet Univ, Sivas, TURKEY | en_US |
dc.description.abstract | Thanks to the rapid increase in technology and electronic communications, e-mail has become a serious communication tool. In many applications such as business correspondence, reminders, academic notices, web page memberships, e-mail is used as primary way of communication. If we ignore spam e-mails, there remain hundreds of e-mails received every day. In order to determine the importance of received e-mails, the subject or content of each e-mail must be checked. In this study we proposed an unsupervised system to classify received e-mails. Received e-mails' coordinates are determined by a method of natural language processing called as Word2Vec algorithm. According to the similarities, processed data are grouped by k-means algorithm with an unsupervised training model. In this study, 10517 e-mails were used in training. The success of the system is tested on a test group of 200 e-mails. In the test phase M3 model (window size 3, min. Word frequency 10, Gram skip) consolidated the highest success (91%). Obtained results are evaluated in section VI. | en_US |
dc.description.sponsorship | IEEE Turkey Sect,Turkcell,Turkhavacilik Uzaysanayii,Turitak Bilgem,Gebze Teknik Univ,SAP, Detaysoft,NETAS,Havelsan | en_US |
dc.identifier.doi | 10.1109/siu.2019.8806593 | |
dc.identifier.isbn | 978-1-7281-1904-5 | |
dc.identifier.issn | 2165-0608 | |
dc.identifier.scopus | 2-s2.0-85071993162 | en_US |
dc.identifier.scopusquality | N/A | en_US |
dc.identifier.uri | https://doi.org/10.1109/siu.2019.8806593 | |
dc.identifier.uri | https://hdl.handle.net/11616/98948 | |
dc.identifier.wos | WOS:000518994300229 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | tr | en_US |
dc.publisher | Ieee | en_US |
dc.relation.ispartof | 2019 27th Signal Processing and Communications Applications Conference (Siu) | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Natural Language Processing | en_US |
dc.subject | Text Classification | en_US |
dc.subject | Word2Vec | en_US |
dc.subject | Skip Gram | en_US |
dc.subject | K-means | en_US |
dc.title | E-Mail Classification Using Natural Language Processing | en_US |
dc.type | Conference Object | en_US |