Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Error Analysis in Croatian Morphosyntactic Tagging (CROSBI ID 546823)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Agić, Željko ; Tadić, Marko ; Dovedan, Zdravko Error Analysis in Croatian Morphosyntactic Tagging // Proceedings of the 31st International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Jarec, Iva ; Bekić, Zoran (ur.). Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2009. str. 521-526

Podaci o odgovornosti

Agić, Željko ; Tadić, Marko ; Dovedan, Zdravko

engleski

Error Analysis in Croatian Morphosyntactic Tagging

In this paper, we provide detailed insight on properties of errors generated by a stochastic morphosyntactic tagger assigning Multext-East morphosyntactic descriptions to Croatian texts. Tagging the Croatia Weekly newspaper corpus by the CroTag tagger in stochastic mode revealed that approximately 85 percent of all tagging errors occur on nouns, adjectives, pronouns and verbs. Moreover, approximately 50 percent of these are shown to be incorrect assignments of case values. We provide various other distributional properties of errors in assigning morphosyntactic descriptions for these and other parts of speech. On the basis of these properties, we propose rule- based and stochastic strategies which could be integrated in the tagging module, creating a hybrid procedure in order to raise overall tagging accuracy for Croatian.

morphosyntactic tagging; part-of-speech tagging; error analysis; error distribution; Croatian language; hybrid tagging

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

521-526.

2009.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of the 31st International Conference on Information Technology Interfaces

Lužar-Stiffler, Vesna ; Jarec, Iva ; Bekić, Zoran

Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce)

978-953-7138-16-5

Podaci o skupu

31st International Conference on Information Technology Interfaces, ITI 2009

predavanje

22.06.2009-25.06.2009

Dubrovnik, Hrvatska

Povezanost rada

Informacijske i komunikacijske znanosti, Filologija