Pregled bibliografske jedinice broj: 557797
An Iterative Semi-supervised Approach to Software Fault Prediction
An Iterative Semi-supervised Approach to Software Fault Prediction // Proceedings of the 7th International Conference on Predictive Models in Software Engineering / Menzies, Tim (ur.).
New York (NY): The Association for Computing Machinery (ACM), 2011. str. 1-15 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 557797 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
An Iterative Semi-supervised Approach to Software Fault Prediction
Autori
Lu, Huihua ; Cukic, Bojan ; Culp, Mark
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the 7th International Conference on Predictive Models in Software Engineering
/ Menzies, Tim - New York (NY) : The Association for Computing Machinery (ACM), 2011, 1-15
ISBN
978-1-4503-0709-3
Skup
7th ACM International Conference on Predictive Models in Software Engineering
Mjesto i datum
Banff, Kanada, 20.09.2011. - 21.09.2011
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Metrics-complexity measures; performance
Sažetak
Background: Many statistical and machine learning techniques have been implemented to build predictive fault models. Traditional methods are based on supervised learning. Software metrics for a module and corresponding fault information, available from previous projects, are used to train a fault prediction model. This approach calls for a large size of training data set and enables the development of effective fault prediction models. In practice, data collection costs, the lack of data from earlier projects or product versions may make large fault prediction training data set unattainable. Small size of the training set that may be available from the current project is known to deteriorate the performance of the fault predictive model. In semi-supervised learning approaches, software modules with known or unknown fault content can be used for training. Aims: To implement and evaluate a semi-supervised learning approach in software fault prediction. Methods: We investigate an iterative semi-supervised approach to software quality prediction in which a base supervised learner is used within a semi-supervised application. Results: We varied the size of labeled software modules from 2% to 50% of all the modules in the project. After tracking the performance of each iteration in the semi-supervised algorithm, we observe that semi-supervised learning improves fault prediction if the number of initially labeled software modules exceeds 5%. Conclusion: The semi-supervised approach outperforms the corresponding supervised learning approach when both use random forest as base classification algorithm.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
165-0362980-2002 - Postupci raspoređivanja u samoodrživim raspodijeljenim računalnim sustavima (Martinović, Goran, MZO ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike, računarstva i informacijskih tehnologija Osijek
Profili:
Bojan Čukić
(autor)