Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Prediction of secondary protein structure with binary coding patterns of amino acid and nucleotide physicochemical properties (CROSBI ID 98550)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Štambuk, Nikola ; Konjevoda, Paško Prediction of secondary protein structure with binary coding patterns of amino acid and nucleotide physicochemical properties // International journal of quantum chemistry, 92 (2003), 2; 123-134-x

Podaci o odgovornosti

Štambuk, Nikola ; Konjevoda, Paško

engleski

Prediction of secondary protein structure with binary coding patterns of amino acid and nucleotide physicochemical properties

We present binary coding algorithm for the alpha- and beta-protein fold prediction. The method links amino acid molecular polarity patterns and physicochemical properties of nucleotide bases coded by means of a binary addresses. Primary sequences that define secondary protein structure were analyzed with respect to the symbolic oligopeptides (SO) obtained by the reduction of the 20 amino acid letter alphabet into a binary alphabet of nonpolar group 0 (W, C, I, F, M, V, L, Y) and polar group 1 (Q, R, H, K, N, E, D, S, G, T, A, P). The groups were extracted from the Grantham polarity scale with the clustering around medoids procedure. The transformation of protein strings into binary coding patterns of the polar and nonpolar amino acid groups reduced analyzed elements within the protein motif of length n by the factor of 10^n. SMO learning algorithm for the support vector machines was applied to classify alpha-helices and beta-strands. It was shown that the relative frequencies of binary hexapeptides classify all 174 nonhomologous alpha- and beta-protein folds from the Jpred database with 100% accuracy. The results of 10-fold cross-validation and leave-one-out test were 86.78%. Classification tree confirmed the results of SMO analysis and correctly classified 100% of the folds by means of 9 binary hexapeptides. Linear block triple-check code was proposed for the description of hexapeptide patterns. The presented method enables simple, quick, and accurate prediction of alpha- and beta-protein folding types from the primary amino acid and nucleotide sequences on a personal computer. Our results imply that few amino acid polarity patterns specified by the nucleotide physicochemical properties describe basic protein folding types with >90% accuracy.

protein fold; secondary structure; prediction; error-correcting code; genetic code; nucleotides; amino acids

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

92 (2)

2003.

123-134-x

objavljeno

0020-7608

Povezanost rada

Temeljne medicinske znanosti

Indeksiranost