Napredna pretraga

Pregled bibliografske jedinice broj: 795836

A Systematic Data Collection Procedure for Software Defect Prediction


Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana
A Systematic Data Collection Procedure for Software Defect Prediction // Computer Science and Information Systems, 13 (2016), 1; 173-197 doi:10.2298/CSIS141228061M (međunarodna recenzija, članak, znanstveni)


Naslov
A Systematic Data Collection Procedure for Software Defect Prediction

Autori
Mauša, Goran ; Galinac Grbac, Tihana ; Dalbelo Bašić, Bojana

Izvornik
Computer Science and Information Systems (1820-0214) 13 (2016), 1; 173-197

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
software defect prediction ; data collection issues ; dataset bias ; bug-code linking ; open-source projects

Sažetak
Software defect prediction research relies on data that must be collected from otherwise separate repositories. To achieve greater generalization of the results, standardized protocols for data collection and validation are necessary. This paper presents an exhaustive survey of techniques and approaches used in the data collection process. It identifies some of the issues that must be addressed to minimize dataset bias and also provides a number of measures that can help researchers to compare their data collection approaches and evaluate their data quality. Moreover, we present a data collection procedure that uses a bug-code linking technique based on regular expression. The detailed comparison and root cause analysis of inconsistencies with a number of popular data collection approaches and their publicly available datasets, reveals that our procedure achieves the most favorable results. Finally, we implement our data collection procedure in a data collection tool we name the Bug-Code (BuCo) Analyzer.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Projekt / tema
HRZZ-UIP-2014-09-7945 - Programski sustavi u evoluciji: analiza i inovativni pristupi pametnom upravljanju (Tihana Galinac Grbac, )

Ustanove
Fakultet elektrotehnike i računarstva, Zagreb,
Tehnički fakultet, Rijeka

Citiraj ovu publikaciju

Mauša, Goran; Galinac Grbac, Tihana; Dalbelo Bašić, Bojana
A Systematic Data Collection Procedure for Software Defect Prediction // Computer Science and Information Systems, 13 (2016), 1; 173-197 doi:10.2298/CSIS141228061M (međunarodna recenzija, članak, znanstveni)
Mauša, G., Galinac Grbac, T. & Dalbelo Bašić, B. (2016) A Systematic Data Collection Procedure for Software Defect Prediction. Computer Science and Information Systems, 13 (1), 173-197 doi:10.2298/CSIS141228061M.
@article{article, year = {2016}, pages = {173-197}, DOI = {10.2298/CSIS141228061M}, keywords = {software defect prediction, data collection issues, dataset bias, bug-code linking, open-source projects}, journal = {Computer Science and Information Systems}, doi = {10.2298/CSIS141228061M}, volume = {13}, number = {1}, issn = {1820-0214}, title = {A Systematic Data Collection Procedure for Software Defect Prediction}, keyword = {software defect prediction, data collection issues, dataset bias, bug-code linking, open-source projects} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Citati