Pregled bibliografske jedinice broj: 1241699
Meta-Modeling Execution Times of RapidMiner operators
Meta-Modeling Execution Times of RapidMiner operators // Proceedings of the 3rd RapidMiner Community Meeting and Conference (RCOMM 2012) / Ficher, Simon ; Mierswa, Ingo (ur.).
Budimpešta, Mađarska, 2012. str. 159-168 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1241699 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Meta-Modeling Execution Times of RapidMiner operators
Autori
Piškorec Matija ; Bošnjak Matko ; Šmuc Tomislav
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the 3rd RapidMiner Community Meeting and Conference (RCOMM 2012)
/ Ficher, Simon ; Mierswa, Ingo - , 2012, 159-168
Skup
3rd RapidMiner Community Meeting and Conference (RCOMM 2012)
Mjesto i datum
Budimpešta, Mađarska, 28.08.2012. - 31.08.2012
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
meta-mining ; data mining ; execution time estimation ; RapidMiner extension
Sažetak
Knowing the execution time of a computational model, especially when dealing with large data, is crucial in deciding whether the solution of the problem is attainable in acceptable time. In the case of data mining processes, typically both the time needed for model learning and model application could be of importance. We developed a meta-mining framework for execution time estimation of data mining algorithm built in RapidMiner. Operator execution time estimation is treated as a machine learning problem for which prediction models are built using execution times obtained by running algorithms on a set of predetermined datasets. With appropriate refitting this experimental methodology is applicable to any data mining environment. We present overall framework with modelling results for a subset of RapidMiner operators, and compare non-parametric distance measures based predictions with polynomial function fitting. Finally, integration of these models in the form of standalone RapidMiner extension is demonstrated and issues related to reliability, scalability and applicability for the overall workflow execution time modelling are discussed.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Institut "Ruđer Bošković", Zagreb