Pregled bibliografske jedinice broj: 658439
Stability of Software Defect Prediction in Relation to Levels of Data Imbalance
Stability of Software Defect Prediction in Relation to Levels of Data Imbalance // Proceedings of SQAMIA 2013 / Budimac, Zoran (ur.).
Novi Sad, 2013. str. 1-10 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 658439 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Stability of Software Defect Prediction in Relation to Levels of Data Imbalance
Autori
Galinac Grbac, Tihana ; Mauša, Goran ; Dalbelo Bašić, Bojana
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of SQAMIA 2013
/ Budimac, Zoran - Novi Sad, 2013, 1-10
ISBN
978-86-7031-269-2
Skup
Second Workshop on Software Quality Analysis, Monitoring, Improvement and Applications
Mjesto i datum
Novi Sad, Srbija, 15.09.2013. - 17.09.2013
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Software Defect Prediction; Data Imbalance; Feature Selection; Stability
Sažetak
Software defect prediction is an important decision support activity in software quality assurance. Its goal is reducing verification costs by predicting the system modules that are more likely to contain defects, thus enabling more efficient allocation of resources in verification process. The problem is that there is no widely applicable well performing prediction method. The main reason is in the very nature of software datasets, their imbalance, complexity and properties dependent on the application domain. In this paper we suggest a research strategy for the study of the performance stability using different machine learning methods over different levels of imbalance for software defect prediction datasets. We also provide a preliminary case study on a dataset from the NASA MDP open repository using multivariate binary logistic regression and forward and backward feature selection. Results indicate that the performance becomes unstable around 80% of imbalance.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
069-0362214-1575 - Optimizacija i dizajn vremensko-frekvencijskih distribucija (Sučić, Viktor, MZOS ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb,
Tehnički fakultet, Rijeka