Napredna pretraga

Pregled bibliografske jedinice broj: 69353

Advanced Methods for Web Information Mining


Lasić-Lazić, Jadranka; Seljan, Sanja; Stančić, Hrvoje
Advanced Methods for Web Information Mining // Zbornik radova "Težakovi dani" / Lasić-Lazić, Jadranka ; Tkalec, Slavko (ur.).
Zagreb: Filozofski fakultet, Zavod za informacijske studije Odsjeka za informacijske znanosti, 2002. str. 85-96 (poster, sažetak, znanstveni)


Naslov
Advanced Methods for Web Information Mining

Autori
Lasić-Lazić, Jadranka ; Seljan, Sanja ; Stančić, Hrvoje

Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, znanstveni

Izvornik
Zbornik radova "Težakovi dani" / Lasić-Lazić, Jadranka ; Tkalec, Slavko - Zagreb : Filozofski fakultet, Zavod za informacijske studije Odsjeka za informacijske znanosti, 2002, 85-96

ISBN
953-175-182-X

Skup
Težakovi dani

Mjesto i datum
Zagreb, Hrvatska, Xx.-xx.xx., 2002.

Vrsta sudjelovanja
Poster

Vrsta recenzije
Neobjavljeni rad

Ključne riječi
Pronalaženje dokumenata; relevantnost; klasifikacija; multimedijski dokumenti
(Document retrieval; relevancy; klassification; multimedia documents)

Sažetak
There is currently huge amount of data on the Web and almost no classification information. The key problem is how to embed knowledge into information mining algorithms. The authors analyse techniques of information retrieval and give their strong and weak points. Although most Web documents are text oriented, there are plenty of them that contain multimedia elements, which are not easily accessible through common search methods. Web information is dynamic, semi-structured, and interwound with hyperlinks. Several advanced methods for Web information mining are analyzed: 1) syntax analysis, 2) metadata-based searching using RDF, 3) knowledge annotation by use of conceptual graphs (CGs), 4) KPS: Keyword, Pattern, Sample search techniques, and 5) techniques of obtaining descriptions by fuzzification and back-propagation. The problem of choosing proper keywords is also stressed out. The authors suggest the usage of already accepted standards for classification hierarchy, such as Dewey Decimal Classification (DDC).

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti

Napomena
Radovi Zavoda za informacijske studije ; knj. 11.



POVEZANOST RADA


Projekt / tema
0130440
0130462
0130740

Ustanove
Filozofski fakultet, Zagreb