A distributed geospatial publish/subscribe system on Apache Spark

Livaja, Ivan; Pripužić, Krešimir; Sovilj, Siniša; Vuković, Marin

izvor podataka: crosbi !

A distributed geospatial publish/subscribe system on Apache Spark (CROSBI ID 306381)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Livaja, Ivan ; Pripužić, Krešimir ; Sovilj, Siniša ; Vuković, Marin A distributed geospatial publish/subscribe system on Apache Spark // Future generation computer systems, 132 (2022), 282-298. doi: 10.1016/j.future.2022.02.013

Podaci o odgovornosti

Autori

Livaja, Ivan ; Pripužić, Krešimir ; Sovilj, Siniša ; Vuković, Marin

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

A distributed geospatial publish/subscribe system on Apache Spark

Sažetak

Publish/subscribe is a messaging pattern where message producers, called publishers, publish messages which they want to be distributed to message consumers, called subscribers. Subscribers are required to subscribe to messages of interest in advance to be able to receive them upon the publishing. In this paper, we discuss a special type of publish/subscribe systems, namely geospatial publish/subscribe systems (GeoPS systems), in which both published messages (i.e., publications) and subscriptions include a geospatial object. Such an object is used to express both the location information of a publication and the location of interest of a subscription. We argue that there is great potential for using GeoPS systems for the Internet of Things and Sensor Web applications. However, existing GeoPS systems are not applicable for this purpose since they are centralized and cannot cope with multiple highly frequent incoming geospatial data streams containing publications. To overcome this limitation, we present a distributed GeoPS system in the cluster which efficiently matches incoming publications in real-time with a set of stored subscriptions. Additionally, we propose four different (distributed) replication and partitioning strategies for managing subscriptions in our distributed GeoPS system. Finally, we present results of an extensive experimental evaluation in which we compare the throughput, latency and memory consumption of these strategies. These results clearly show that they are both efficient and scalable to larger clusters. The comparison with centralized state- of-the-art approaches shows that the additional processing overhead of our distributed strategies introduced by the Apache Spark is almost negligible.

Ključne riječi

Geospatial data ; Partitioning ; Data replication ; Big data ; Data stream processing

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o izdanju

Časopis

Future generation computer systems

Volumen (broj)

132

Godina

2022.

Stranice rada

282-298

Status objave rada

objavljeno

ISSN

0167-739X

DOI

10.1016/j.future.2022.02.013

Povezanost rada

Povezane osobe

Ivan Livaja (autor/i)

Krešimir Pripužić (autor/i)

Siniša Sovilj (autor/i)

Marin Vuković (autor/i)

Povezane ustanove

Fakultet elektrotehnike i računarstva (036) (autorova ustanova)

Sveučilište Jurja Dobrile u Puli (303) (autorova ustanova)

Veleučilište u Šibeniku (294) (autorova ustanova)

Povezani projekti

Učinkovita stvarnovremenska obrada brzih geoprostornih podataka (rezultat rada na projektu)

Područje

Elektrotehnika, Računarstvo

Poveznice

doi.org

sciencedirect.com

Indeksiranost

Scopus

Current Contents Connect (CCC)

Web of Science Core Collection, Science Citation Index Expanded (WoSCC-SCI-Exp)

Web of Science Core Collection, SCI-Exp, SSCI & A&HCI (WoSCC-SCI-Exp, SSCI, A&HCI)