Napredna pretraga

Pregled bibliografske jedinice broj: 609184

Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models


Tanabe, Kazutoshi; Kurita, Takio; Nishida, Kenji; Lučić, Bono; Amić, Dragan; Suzuki, Takahiro
Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models // SAR and QSAR in environmental research, 24 (2013), 7; 565-580 (međunarodna recenzija, članak, znanstveni)


Naslov
Improvement of carcinogenicity prediction performances based on sensitivity analysis in variable selection of SVM models

Autori
Tanabe, Kazutoshi ; Kurita, Takio ; Nishida, Kenji ; Lučić, Bono ; Amić, Dragan ; Suzuki, Takahiro

Izvornik
SAR and QSAR in environmental research (1062-936X) 24 (2013), 7; 565-580

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Carcinogenicity prediction; QSAR; SVM; sensitivity analysis; variable selection

Sažetak
A new sensitivity analysis (SA) method for variable selection in support vector machine (SVM) was proposed to improve the performance level of the QSAR model to predict carcinogenicity based on the correlation coefficient (CC) method used in our preceding study. The performances of both methods were also compared with that of the F-score (FS) method proposed by Chang and Lin. The 911 non-congeneric chemicals were classified into 20 mutually overlapping groups according to contained substructures, and a specific SVM model created on chemicals belonging to each group was optimized by searching the best set of SVM parameters while successively omitting descriptors of lower absolute values of sensitivity, correlation coefficient or F-score till the maximum predictive performance was obtained. The SA method improves the overall accuracy from 80% of CC and FS to 84%, which is considerably higher than those of existing models for predicting the carcinogenicity of non-congeneric chemicals. It selects the optimum sets of effective descriptors fewer than the correlation coefficient and F-score methods, and is not time-consuming and can be applied to a large set of initial descriptors. It is concluded that SA is superior as a variable selection method in SVM models.

Izvorni jezik
Engleski

Znanstvena područja
Kemija



POVEZANOST RADA


Projekt / tema
079-0000000-3211 - Odnos strukture i aktivnosti flavonoida (Dragan Amić, )
098-1770495-2919 - Razvoj metoda za modeliranje svojstava bioaktivnih molekula i proteina (Bono Lučić, )

Ustanove
Fakultet agrobiotehničkih znanosti Osijek,
Institut "Ruđer Bošković", Zagreb

Autor s matičnim brojem:
Bono Lučić, (184293)
Dragan Amić, (86325)

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus
  • MEDLINE