Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Transfer Learning for Improved Peptide Activity Prediction on Small Dataset (CROSBI ID 722349)

Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija

Otović, Erik ; Kalafatović, Daniela ; Mauša, Goran Transfer Learning for Improved Peptide Activity Prediction on Small Dataset // 5th RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry. 2022. str. 35-36

Podaci o odgovornosti

Otović, Erik ; Kalafatović, Daniela ; Mauša, Goran

engleski

Transfer Learning for Improved Peptide Activity Prediction on Small Dataset

In recent years, deep neural networks have been successfully applied to peptide activity prediction as a part of in-silico virtual screening process. Such methods can guide the discovery of novel active peptides by providing prospective peptide sequences, and thus saving time and resources. However, such methods require significant amounts of collected data, which poses a huge obstacle for their application in the case of poorly researched peptide activities. In this work, we investigate how the predictive power of a deep learning model can be improved by transferring knowledge (i.e. transfer learning ; TL) from a larger category of peptides to the category of interest represented by a smaller dataset. The model is first pretrained on a large dataset and then fine-tuned by optimizing relevant hyperparameters (e.g. per layer learning rates, which weights to transfer and dropout rates) on a smaller target dataset, and consequently uses the knowledge gained during pretraining to solve the target task. In our case study, one antiviral and one antimicrobial dataset obtained from DRAMP 2.0 complemented with negative instances from Uniprot database are used for pretraining, while the target dataset is the antiviral AVPdb database. We represent the peptides by encoding each amino acid with 94 physico-chemical properties (e.g. hydrophobicity, Cruciani properties, etc.) A model based on convolutional and recurrent layers is used in which convolutional layers learn to detect short patterns while the recurrent layer learns long-term dependencies of short patterns in the input sequence. The convolutional filters estimated during pretraining are preserved, which creates a better starting point in comparison to training the entire model "from scratch". By using sequential feature selection, the dependency of ROC-AUC score on the number of selected features is investigated for baseline and TL models. Preliminary results demonstrate that TL-based models outperformed the baseline model, either by pretraining on a database of the same or different peptide activity. This suggests that some of the short patterns responsible for antimicrobial activity are also responsible for antiviral activity and they can be exploited to improve the predictive power.

Transfer learning ; Machine learning ; Peptide activity prediction

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

35-36.

2022.

objavljeno

Podaci o matičnoj publikaciji

5th RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry

Podaci o skupu

5th RSC-BMCS / RSC-CICAG Artificial Intelligence in Chemistry

poster

01.09.2022-02.09.2022

Cambridge, Ujedinjeno Kraljevstvo

Povezanost rada

Biotehnologija u biomedicini (prirodno područje, biomedicina i zdravstvo, biotehničko područje), Biotehnologija, Interdisciplinarne prirodne znanosti, Interdisciplinarne tehničke znanosti, Kemija