Napredna pretraga

Pregled bibliografske jedinice broj: 324205

Text Summarization of XML documents in Croatian


Preradović Mikelić, Nives; Lauc, Tomislava; Boras, Damir
Text Summarization of XML documents in Croatian // Electrical and Computer Engineering Series, 1 (2008), 143-148 (podatak o recenziji nije dostupan, članak, znanstveni)


Naslov
Text Summarization of XML documents in Croatian

Autori
Preradović Mikelić, Nives ; Lauc, Tomislava ; Boras, Damir

Izvornik
Electrical and Computer Engineering Series (1790-5117) 1 (2008); 143-148

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Automatic summarization; XML documents; Croatian language; Perl

Sažetak
The paper describes automatic summarization of the XML documents in Croatian language. The goal of the summarizer is to generate extracts with high percent of extract-worthiness and similarity to the author's abstract. Our research shows that extracts generated using our algorithm are well formed, but it also shows that algorithm is very domain dependant. The research brought us to conclusion that we should develop the implementation of the Porter's stemming algorithm in order to improve the text summarization for Croatian language, which is currently at an early stage of development.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekt / tema
130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Damir Boras, )

Ustanove
Filozofski fakultet, Zagreb

Uključenost u ostale bibliografske baze podataka:


  • Scopus
  • Inspec
  • ISI
  • ACM