Pregled bibliografske jedinice broj: 1193284
Coupled encoding methods for antimicrobial peptide prediction: How sensitive is a highly accurate model?
Coupled encoding methods for antimicrobial peptide prediction: How sensitive is a highly accurate model? // Artificial Intelligence in the Life Sciences, 2 (2022), 100034, 10 doi:10.1016/j.ailsci.2022.100034 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1193284 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Coupled encoding methods for antimicrobial peptide
prediction: How sensitive is a highly accurate
model?
Autori
Erjavac, Ivan ; Kalafatovic, Daniela ; Mauša, Goran
Izvornik
Artificial Intelligence in the Life Sciences (2667-3185) 2
(2022);
100034, 10
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
Peptide encoding ; Machine learning ; Antimicrobial peptide prediction ; False positive ; Model sensitivity
Sažetak
Current application of machine learning in the process of antimicrobial peptide discovery call for the reduction of the false positive predictions that are produced by the classification models. Considering that the positive predictions of high confidence drive modern experimental design, the model’s sensitivity is crucial to reduce the number of unnecessary in vitro tests. Furthermore, taking into account the expert-based design approaches that employ random mutations on confirmed sequences, the machine learning models are required to distinguish between subtle differences among shuffled sequences. With the goal of reducing the false positive rate and improving sensitivity, we propose a hybrid approach to antimicrobial peptide prediction that utilizes combined encoding models. To this end, we implement models that employ both the physico- chemical features and sequence ordering information to stress the importance of using both representations. We also investigate the usage of binary encoding for peptide representation purposes, a method that is insufficiently represented in related research, which proved to act as a viable low dimensional alternative to the one-hot encoding. Our results, supported by Cochran and McNemar statistical tests and Spearman correlation analysis, indicate that the sequence- based encodings complement the physico-chemical features and their synergic effect yields improvement in terms of every evaluation metric. Finally, the proposed hybrid approach that combines physico-chemical features and binary encoding using logical conjunction was shown to be superior to other single models by a factor of 2.96 in terms of fall-out and up to 6.1% in terms of precision.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Biotehnologija u biomedicini (prirodno područje, biomedicina i zdravstvo, biotehničko područje)
POVEZANOST RADA
Projekti:
--UIP-2019-04-7999 - Dizajn katalitički aktivnih peptida i peptidnih nanostruktura (UIP-2019-04) (DeShPet) (Kalafatović, Daniela) ( CroRIS)
Ustanove:
Tehnički fakultet, Rijeka,
Sveučilište u Rijeci - Odjel za biotehnologiju
Citiraj ovu publikaciju:
Uključenost u ostale bibliografske baze podataka::
- Directory of Open Access Journals (DOAJ)