Napredna pretraga

## Parallel mining of uncertain data using segmentation of data set area and Voronoi diagrams

Lukić, Ivica; Hocenski, Željko; Köhler, Mirko; Galba, Tomislav
Parallel mining of uncertain data using segmentation of data set area and Voronoi diagrams // Automatika : časopis za automatiku, mjerenje, elektroniku, računarstvo i komunikacije, 59 (2018), 3-4; 349-356 doi:10.1080/00051144.2018.1541645 (međunarodna recenzija, članak, znanstveni)

Naslov
Parallel mining of uncertain data using segmentation of data set area and Voronoi diagrams

Autori
Lukić, Ivica ; Hocenski, Željko ; Köhler, Mirko ; Galba, Tomislav

Izvornik
Automatika : časopis za automatiku, mjerenje, elektroniku, računarstvo i komunikacije (0005-1144) 59 (2018), 3-4; 349-356

Ključne riječi
Clustering algorithms ; data mining ; data uncertainty ; Euclidean distance ; parallel algorithms

Sažetak
Clustering of uncertain objects in large uncertain databases and problem of mining uncertain data has been well studied. In this paper, clustering of uncertain objects with location uncertainty is studied. Moving objects, like mobile devices, report their locations periodically, thus their locations are uncertain and best described by a probability density function. The number of objects in a database can be large which makes the process of mining accurate data, a challenging and time consuming task. Authors will give an overview of existing clustering methods and present a new approach for data mining and parallel computing of clustering problems. All existing methods use pruning to avoid expected distance calculations. It is required to calculate the expected distance numerical integration, which is time- consuming. Therefore, a new method, called Segmentation of Data Set Area- Parallel, is proposed. In this method, a data set area is divided into many small segments. Only clusters and objects in that segment are observed. The number of segments is calculated using the number and location of clusters. The use of segments gives the possibility of parallel computing, because segments are mutually independent. Thus, each segment can be computed on multiple cores.

Izvorni jezik
Engleski

#### Časopis indeksira:

• Web of Science Core Collection (WoSCC)
• Science Citation Index Expanded (SCI-EXP)
• SCI-EXP, SSCI i/ili A&HCI
• Scopus