A comparison of Apache Solr and Elasticsearch technologies in support of large-scale data analysis

dc.contributor.authorDeniz, Ayşenur
dc.contributor.authorElömer, Muhammed Mehdi
dc.contributor.authorAydın, Ahmet Arif
dc.date.accessioned2024-08-04T19:51:30Z
dc.date.available2024-08-04T19:51:30Z
dc.date.issued2023
dc.departmentİnönü Üniversitesien_US
dc.description.abstractIn the era of big data, data has never been more important because it contains hidden insights. Additionally, it is necessary and challenging to extract usable information from enormous volumes of data. When attempting to perform data processing and analytics in a variety of domains, developers of data-intensive systems have consequently met several challenges. In addition, full-text search is one of the most significant components of big data processing and analytics for discovering fragments of required data among large volumes of data. Due to the importance of the subject, this article begins with an examination of the characteristics, capabilities, and technical comparisons of full-text search technologies, followed by a systematic comparison of Apache Solr and Elasticsearch in terms of indexing times and queries on three separate datasets. According to our findings, based on default configuration, Apache Solr has better performance when looking at indexing times measured on three machines with different hardware specifications. Likewise, Apache Solr outperforms Elasticsearch in seven out of ten search queries. Regarding our results, on computers with restricted hardware resources, we recommend utilizing Apache Solr instead of Elasticsearch. In addition, this study provides researchers and developers of data-intensive systems with a complete comparison and suggestions for choosing the most effective full-text search engine for their task.en_US
dc.identifier.doi10.17714/gumusfenbil.1213317
dc.identifier.endpage404en_US
dc.identifier.issn2146-538X
dc.identifier.issue2en_US
dc.identifier.startpage386en_US
dc.identifier.trdizinid1187343en_US
dc.identifier.urihttps://doi.org/10.17714/gumusfenbil.1213317
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/1187343
dc.identifier.urihttps://hdl.handle.net/11616/89008
dc.identifier.volume13en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofGümüşhane Üniversitesi Fen Bilimleri Dergisien_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleA comparison of Apache Solr and Elasticsearch technologies in support of large-scale data analysisen_US
dc.typeArticleen_US

Dosyalar