A Generic Method for Multi Word Extraction from Wikipedia (CROSBI ID 537792)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Bekavac, Božo ; Tadić, Marko
engleski
A Generic Method for Multi Word Extraction from Wikipedia
This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the propreties of this specific encyclopedic genre in its HTML format and it relies on the intention of the autors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development envi-ronment. We tested the method on a Croatian version of Wikipedia and we present the results obtained.
multi word expressions; multi word extraction; Croatian; Wikipedia
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
663-667.
2008.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 30th International Conference on Information Technology Interfaces
Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce)
978-953-7138-12-7
1330-1012
Podaci o skupu
30th International Conference on Information Technology Interfaces (ITI 2008)
predavanje
23.06.2008-26.06.2008
Dubrovnik, Hrvatska; Cavtat, Hrvatska