Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Redescription mining augmented with random forest of multi-target predictive clustering trees (CROSBI ID 251432)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Mihelčić, Matej ; Džeroski, Sašo ; Lavrač, Nada ; Šmuc, Tomislav Redescription mining augmented with random forest of multi-target predictive clustering trees // Journal of intelligent information systems, 50 (2018), 1; 63-96. doi: 10.1007/s10844-017-0448-5

Podaci o odgovornosti

Mihelčić, Matej ; Džeroski, Sašo ; Lavrač, Nada ; Šmuc, Tomislav

engleski

Redescription mining augmented with random forest of multi-target predictive clustering trees

In this work, we present a redescription mining algorithm that uses Random Forest of Predictive Clustering Trees (RFPCTs) for generating and iteratively improving a set of redescriptions. The approach uses information about element membership in different queries, generated from a single constructed PCT, to explore redescription space, while queries obtained from the Random Forest of PCTs increase candidate diversity. The approach is able to produce highly accurate, statistically significant redescriptions described by Boolean, nominal or numerical attributes. As opposed to current tree-based approaches that use multi-class or binary classification, we explore the benefits of using multi-label classification and multi-target regression to create redescriptions. Major benefit of the approach, compared to other state of the art solutions, is that it does not require specifying minimal threshold on redescription accuracy to obtain highly accurate, optimized set of redescriptions. The process of Random Forest based augmentation and different modes of redescription set creation are evaluated on three datasets with different properties. We use the same datasets to compare the performance of our algorithm to state of the art redescription mining approaches.

Knowledge discovery ; Redescription mining ; Random forest ; Predictive clustering trees ; World countries ; Computer science bibliography ; Bioclimatic niches

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

50 (1)

2018.

63-96

objavljeno

0925-9902

10.1007/s10844-017-0448-5

Povezanost rada

Računarstvo

Poveznice
Indeksiranost