Pregled bibliografske jedinice broj: 1062863
REPD: Source code defect prediction as anomaly detection
REPD: Source code defect prediction as anomaly detection // Journal of systems and software, 168 (2020), 110641; 1-15 doi:10.1016/j.jss.2020.110641 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1062863 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
REPD: Source code defect prediction as anomaly
detection
Autori
Afrić, Petar ; Šikić, Lucija ; Kurdija, Adrian Satja ; Šilić, Marin
Izvornik
Journal of systems and software (0164-1212) 168
(2020), 110641;
1-15
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
Defect prediction ; Anomaly detection ; REPD ; Program analysis
Sažetak
In this paper, we present a novel approach for within-project source code defect prediction. Since defect prediction datasets are typically imbalanced, and there are few defective examples, we treat defect prediction as anomaly detection. We present our Reconstruction Error Probability Distribution (REPD) model which can handle point and collective anomalies. We compare it on five different traditional code feature datasets against five models: Gaussian Naive Bayes, logistic regression, k-nearest-neighbors, decision tree, and Hybrid SMOTE-Ensemble. In addition, REPD is compared on 24 semantic features datasets against previously mentioned models. In order to compare the performance of competing models, we utilize F1-score measure. By using statistical means, we show that our model produces significantly better results, improving F1-score up to 7.12%. Additionally, REPD’s robustness to dataset imbalance is analyzed by creating defect undersampled and non-defect oversampled datasets.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
KK.01.1.1.01.0009 - Napredne metode i tehnologije u znanosti o podatcima i kooperativnim sustavima (EK )
KK.01.2.1.01.0111
HRZZ-IP-2018-01-6423 - Pouzdani kompozitni primjenski sustavi zasnovani na web uslugama (RELS) (Srbljić, Siniša, HRZZ ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Citiraj ovu publikaciju:
Časopis indeksira:
- Current Contents Connect (CCC)
- Web of Science Core Collection (WoSCC)
- Science Citation Index Expanded (SCI-EXP)
- SCI-EXP, SSCI i/ili A&HCI
- Scopus