Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 348724

A Generic Method for Multi Word Extraction from Wikipedia


Bekavac, Božo; Tadić, Marko
A Generic Method for Multi Word Extraction from Wikipedia // Proceedings of the 30th International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2008. str. 663-667 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 348724 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
A Generic Method for Multi Word Extraction from Wikipedia

Autori
Bekavac, Božo ; Tadić, Marko

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 30th International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran - Zagreb : Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2008, 663-667

ISBN
978-953-7138-12-7

Skup
30th International Conference on Information Technology Interfaces (ITI 2008)

Mjesto i datum
Dubrovnik, Hrvatska; Cavtat, Hrvatska, 23.06.2008. - 26.06.2008

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
multi word expressions; multi word extraction; Croatian; Wikipedia

Sažetak
This paper presents the generic method for multiword expression extraction from Wikipedia. The method is using the propreties of this specific encyclopedic genre in its HTML format and it relies on the intention of the autors of articles to link to other articles. The relevant links were processed by applying local regular grammars within the NooJ development envi-ronment. We tested the method on a Croatian version of Wikipedia and we present the results obtained.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekti:
036-1300646-1986 - Otkrivanje znanja u tekstnim podacima (Dalbelo-Bašić, Bojana, MZO ) ( CroRIS)
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
130-1300646-1002 - Leksička semantika u izradi Hrvatskog WordNeta (Raffaelli, Ida, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Marko Tadić (autor)

Avatar Url Božo Bekavac (autor)


Citiraj ovu publikaciju:

Bekavac, Božo; Tadić, Marko
A Generic Method for Multi Word Extraction from Wikipedia // Proceedings of the 30th International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2008. str. 663-667 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Bekavac, B. & Tadić, M. (2008) A Generic Method for Multi Word Extraction from Wikipedia. U: Lužar-Stiffler, V., Hljuz Dobrić, V. & Bekić, Z. (ur.)Proceedings of the 30th International Conference on Information Technology Interfaces.
@article{article, author = {Bekavac, Bo\v{z}o and Tadi\'{c}, Marko}, year = {2008}, pages = {663-667}, keywords = {multi word expressions, multi word extraction, Croatian, Wikipedia}, isbn = {978-953-7138-12-7}, title = {A Generic Method for Multi Word Extraction from Wikipedia}, keyword = {multi word expressions, multi word extraction, Croatian, Wikipedia}, publisher = {Sveu\v{c}ili\v{s}ni ra\v{c}unski centar Sveu\v{c}ili\v{s}ta u Zagrebu (Srce)}, publisherplace = {Dubrovnik, Hrvatska; Cavtat, Hrvatska} }
@article{article, author = {Bekavac, Bo\v{z}o and Tadi\'{c}, Marko}, year = {2008}, pages = {663-667}, keywords = {multi word expressions, multi word extraction, Croatian, Wikipedia}, isbn = {978-953-7138-12-7}, title = {A Generic Method for Multi Word Extraction from Wikipedia}, keyword = {multi word expressions, multi word extraction, Croatian, Wikipedia}, publisher = {Sveu\v{c}ili\v{s}ni ra\v{c}unski centar Sveu\v{c}ili\v{s}ta u Zagrebu (Srce)}, publisherplace = {Dubrovnik, Hrvatska; Cavtat, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font