Data modelling for large-scale social media analytics: design challenges and lessons learned
dc.authorid | Aydin, Ahmet Arif/0000-0002-4124-7275 | |
dc.authorid | Aydin, Ahmet Arif/0000-0002-4124-7275 | |
dc.authorwosid | Aydin, Ahmet Arif/GON-5504-2022 | |
dc.authorwosid | Aydin, Ahmet Arif/K-6184-2019 | |
dc.contributor.author | Aydin, Ahmet Arif | |
dc.contributor.author | Anderson, Kenneth M. | |
dc.date.accessioned | 2024-08-04T20:49:06Z | |
dc.date.available | 2024-08-04T20:49:06Z | |
dc.date.issued | 2020 | |
dc.department | İnönü Üniversitesi | en_US |
dc.description.abstract | We live in a world of big data; organisations collect, store, and analyse large volumes of data for various purposes. The five V's of big data introduce new challenges for developers to handle when performing data processing and analysis. Indeed, data modelling is one of the most challenging and critical aspects of big data because it determines how data will be structured and stored; these decisions then impact how that data can be processed and analysed. In this paper, we report on designing a data model for storing and analysing Twitter data in support of crisis informatics. In this work, we leverage the data model provided by columnar NoSQL data stores to design column families that can efficiently index, sort, store and analyse large Twitter datasets. In particular, our column families are designed to achieve efficient batch data processing. We evaluate these claims and discuss our future work. | en_US |
dc.identifier.doi | 10.1504/IJDMMM.2020.111409 | |
dc.identifier.endpage | 414 | en_US |
dc.identifier.issn | 1759-1163 | |
dc.identifier.issn | 1759-1171 | |
dc.identifier.issue | 4 | en_US |
dc.identifier.scopus | 2-s2.0-85097132144 | en_US |
dc.identifier.scopusquality | Q4 | en_US |
dc.identifier.startpage | 386 | en_US |
dc.identifier.uri | https://doi.org/10.1504/IJDMMM.2020.111409 | |
dc.identifier.uri | https://hdl.handle.net/11616/99655 | |
dc.identifier.volume | 12 | en_US |
dc.identifier.wos | WOS:000596237800002 | en_US |
dc.identifier.wosquality | N/A | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | Inderscience Enterprises Ltd | en_US |
dc.relation.ispartof | International Journal of Data Mining Modelling and Management | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | data modelling | en_US |
dc.subject | social media analytics | en_US |
dc.subject | big data analytics | en_US |
dc.subject | NoSQL | en_US |
dc.title | Data modelling for large-scale social media analytics: design challenges and lessons learned | en_US |
dc.type | Article | en_US |