Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 433901

Clustering of protein domains for functional and evolutionary studies


Goldstein, Pavle; Žučko, Jurica; Vujaklija, Dušica; Kriško, Anita; Hranueli, Daslav; Long, Paul F.; Etchebest, Catherine; Basrak, Bojan; Cullum, John
Clustering of protein domains for functional and evolutionary studies // BMC bioinformatics, 10 (2009), 335, 11 doi:10.1186/1471-2105-10-335 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 433901 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Clustering of protein domains for functional and evolutionary studies

Autori
Goldstein, Pavle ; Žučko, Jurica ; Vujaklija, Dušica ; Kriško, Anita ; Hranueli, Daslav ; Long, Paul F. ; Etchebest, Catherine ; Basrak, Bojan ; Cullum, John

Izvornik
BMC bioinformatics (1471-2105) 10 (2009); 335, 11

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
protein families ; DNA sequences ; sequence criteria ; evolutionary split statistic ; clustering algorithm

Sažetak
Background The number of protein family members defined by DNA sequencing is usually much larger than those characterised experimentally. This paper describes a method to divide protein families into subtypes purely on sequence criteria. Comparison with experimental data allows an independent test of the quality of the clustering. Results An evolutionary split statistic is calculated for each column in a protein multiple sequence alignment ; the statistic has a larger value when a column is better described by an evolutionary model that assumes clustering around two or more amino acids rather than a single amino acid. The user selects columns (typically the top ranked columns) to construct a motif. The motif is used to divide the family into subtypes using a stochastic optimization procedure related to the deterministic annealing EM algorithm (DAEM), which yields a specificity score showing how well each family member is assigned to a subtype. The clustering obtained is not strongly dependent on the number of amino acids chosen for the motif. The robustness of this method was demonstrated using six well characterized protein families: nucleotidyl cyclase, protein kinase, dehydrogenase, two polyketide synthase domains and small heat shock proteins. Phylogenetic trees did not allow accurate clustering for three of the six families. Conclusion The method clustered the families into functional subtypes with an accuracy of 90 to 100%. False assignments usually had a low specificity score.

Izvorni jezik
Engleski

Znanstvena područja
Matematika, Biologija, Biotehnologija



POVEZANOST RADA


Projekti:
MZOS-037-0982913-2762 - Deterministički i probabilistički modeli u biologiji (Marušić, Miljenko, MZOS ) ( CroRIS)
MZOS-058-0000000-3475 - Generiranje potencijalnih lijekova u uvjetima in silico (Hranueli/Jurica Žučko, Daslav, MZOS ) ( CroRIS)
MZOS-098-0982913-2877 - Temeljna molekularno-biološka istraživanja streptomiceta (Vujaklija, Dušica, MZOS ) ( CroRIS)

Ustanove:
Prirodoslovno-matematički fakultet, Matematički odjel, Zagreb,
Prehrambeno-biotehnološki fakultet, Zagreb,
Institut "Ruđer Bošković", Zagreb,
Prirodoslovno-matematički fakultet, Zagreb

Citiraj ovu publikaciju:

Goldstein, Pavle; Žučko, Jurica; Vujaklija, Dušica; Kriško, Anita; Hranueli, Daslav; Long, Paul F.; Etchebest, Catherine; Basrak, Bojan; Cullum, John
Clustering of protein domains for functional and evolutionary studies // BMC bioinformatics, 10 (2009), 335, 11 doi:10.1186/1471-2105-10-335 (međunarodna recenzija, članak, znanstveni)
Goldstein, P., Žučko, J., Vujaklija, D., Kriško, A., Hranueli, D., Long, P., Etchebest, C., Basrak, B. & Cullum, J. (2009) Clustering of protein domains for functional and evolutionary studies. BMC bioinformatics, 10, 335, 11 doi:10.1186/1471-2105-10-335.
@article{article, author = {Goldstein, Pavle and \v{Z}u\v{c}ko, Jurica and Vujaklija, Du\v{s}ica and Kri\v{s}ko, Anita and Hranueli, Daslav and Long, Paul F. and Etchebest, Catherine and Basrak, Bojan and Cullum, John}, year = {2009}, pages = {11}, DOI = {10.1186/1471-2105-10-335}, chapter = {335}, keywords = {protein families, DNA sequences, sequence criteria, evolutionary split statistic, clustering algorithm}, journal = {BMC bioinformatics}, doi = {10.1186/1471-2105-10-335}, volume = {10}, issn = {1471-2105}, title = {Clustering of protein domains for functional and evolutionary studies}, keyword = {protein families, DNA sequences, sequence criteria, evolutionary split statistic, clustering algorithm}, chapternumber = {335} }
@article{article, author = {Goldstein, Pavle and \v{Z}u\v{c}ko, Jurica and Vujaklija, Du\v{s}ica and Kri\v{s}ko, Anita and Hranueli, Daslav and Long, Paul F. and Etchebest, Catherine and Basrak, Bojan and Cullum, John}, year = {2009}, pages = {11}, DOI = {10.1186/1471-2105-10-335}, chapter = {335}, keywords = {protein families, DNA sequences, sequence criteria, evolutionary split statistic, clustering algorithm}, journal = {BMC bioinformatics}, doi = {10.1186/1471-2105-10-335}, volume = {10}, issn = {1471-2105}, title = {Clustering of protein domains for functional and evolutionary studies}, keyword = {protein families, DNA sequences, sequence criteria, evolutionary split statistic, clustering algorithm}, chapternumber = {335} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus
  • MEDLINE


Uključenost u ostale bibliografske baze podataka::


  • PubMed
  • CAS


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font