Pregled bibliografske jedinice broj: 1206204
Sequential Properties Representation Scheme for Recurrent Neural Network-Based Prediction of Therapeutic Peptides
Sequential Properties Representation Scheme for Recurrent Neural Network-Based Prediction of Therapeutic Peptides // Journal of chemical information and modeling, 62 (2022), 12; 2961-2972 doi:10.1021/acs.jcim.2c00526 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1206204 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Sequential Properties Representation Scheme for
Recurrent Neural Network-Based Prediction of
Therapeutic Peptides
Autori
Otović, Erik ; Njirjak, Marko ; Kalafatovic, Daniela ; Mauša, Goran
Izvornik
Journal of chemical information and modeling (1549-9596) 62
(2022), 12;
2961-2972
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
machine learning ; peptide activity prediction ; peptide representation ; sequential properties
Sažetak
The discovery of therapeutic peptides is often accelerated by means of virtual screening supported by machine learning-based predictive models. The predictive performance of such models is sensitive to the choice of data and its representation scheme. While the peptide physicochemical and compositional representations fail to distinguish sequence permutations, the amino acid arrangement within the sequence lacks the important information contained in physicochemical, conformational, topological, and geometrical properties. In this paper, we propose a solution to the identified information gap by implementing a hybrid scheme that complements the best traits from both approaches with the aim of predicting antimicrobial and antiviral activities based on experimental data from DRAMP 2.0, AVPdb, and Uniprot data repositories. Using the Friedman test of statistical significance, we compared our hybrid, sequential properties approach to peptide properties, one-hot vector encoding, and word embedding schemes in the 10-fold cross-validation setting, with respect to the F1 score, Matthews correlation coefficient, geometric mean, recall, and precision evaluation metrics. Moreover, the sequence modeling neural network was employed to gain insight into the synergic effect of both properties- and amino acid order-based predictions. The results suggest that sequential properties significantly (P < 0.01) surpasses the aforementioned state-of-the-art representation schemes. This makes it a strong candidate for increasing the predictive power of screening methods based on machine learning, applicable to any category of peptides.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Interdisciplinarne tehničke znanosti, Biotehnologija
POVEZANOST RADA
Projekti:
--UIP-2019-04-7999 - Dizajn katalitički aktivnih peptida i peptidnih nanostruktura (UIP-2019-04) (DeShPet) (Kalafatović, Daniela) ( CroRIS)
Ustanove:
Tehnički fakultet, Rijeka,
Sveučilište u Rijeci - Odjel za biotehnologiju
Citiraj ovu publikaciju:
Časopis indeksira:
- Current Contents Connect (CCC)
- Web of Science Core Collection (WoSCC)
- Science Citation Index Expanded (SCI-EXP)
- SCI-EXP, SSCI i/ili A&HCI
- Scopus
- MEDLINE