Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Cervical Cancer Diagnostics Using Machine Learning Algorithms and Class Balancing Techniques (CROSBI ID 319258)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Glučina, Matko ; Lorencin, Ariana ; Anđelić, Nikola ; Lorencin, Ivan Cervical Cancer Diagnostics Using Machine Learning Algorithms and Class Balancing Techniques // Applied sciences (Basel), 13 (2023), 2; 1061, 25

Podaci o odgovornosti

Glučina, Matko ; Lorencin, Ariana ; Anđelić, Nikola ; Lorencin, Ivan

engleski

Cervical Cancer Diagnostics Using Machine Learning Algorithms and Class Balancing Techniques

Objectives:Cervical cancer is present in most cases of squamous cell carcinoma. In most cases, it is the result of an infection with human papillomavirus or adenocarcinoma. This type of cancer is the third most common cancer of the female reproductive organs. The risk groups for cervical cancer are mostly younger women who frequently change partners, have early sexual intercourse, are infected with human papillomavirus (HPV), and who are nicotine addicts. In most cases, the cancer is asymptomatic until it has progressed to the later stages. Cervical cancer screening rates are low, especially in developing countries and in some minority groups. Due to these facts, the introduction of a tentative cervical cancer screening based on a questionnaire can enable more diagnoses of cervical cancer in the initial stages of the disease. Methods: In this research, publicly available cervical cancer data collected on 859 female patients are used. Each sample consists of 36 input attributes and four different outputs Hinselmann, Schiller, cytology, and biopsy. Due to the significant unbalance of the data set, class balancing techniques were used, and these are the Synthetic Minority Oversampling Technique, the ADAptive SYNthetic algorithm (ADASYN), SMOTEEN, random oversampling, and SMOTETOMEK. To obtain the mentioned target outputs, multiple artificial intelligence (AI) and machine learning (ML) methods are proposed. In this research, multiple classification algorithms such as logistic regression, multilayer perceptron (MLP), support vector machine (SVM), K-nearest neighbors (KNN), and several naive Bayes methods were used. Results: From the achieved results, it can be seen that the highest performances were achieved if MLP and KNN are used in combination with Random oversampling, SMOTEEN, and SMOTETOMEK. Such an approach has resulted in mean area under the receiver operating characteristic curve (AUC) and mean Matthew’s correlation coefficient (MCC) scores of higher than 0.95, regardless of which diagnostic method was used for output vector construction. Conclusions: According to the presented results, it can be concluded that there is a possibility for the utilization of artificial intelligence (AI) and machine learning (ML) techniques for the development of a tentative cervical cancer screening method, which is based on a questionnaire and an AI-based algorithm. Furthermore, it can be concluded that by using class balancing techniques, a certain performance boost can be achieved.

artificial intelligence ; cervical cancer ; class balancing techniques ; machine learning

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

13 (2)

2023.

1061

25

objavljeno

2076-3417

Trošak objave rada u otvorenom pristupu

APC

Povezanost rada

Javno zdravstvo i zdravstvena zaštita, Kliničke medicinske znanosti, Računarstvo, Strojarstvo, Temeljne tehničke znanosti

Poveznice
Indeksiranost