Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 389315

Error Analysis in Croatian Morphosyntactic Tagging


Agić, Željko; Tadić, Marko; Dovedan, Zdravko
Error Analysis in Croatian Morphosyntactic Tagging // Proceedings of the 31st International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Jarec, Iva ; Bekić, Zoran (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2009. str. 521-526 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 389315 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Error Analysis in Croatian Morphosyntactic Tagging

Autori
Agić, Željko ; Tadić, Marko ; Dovedan, Zdravko

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 31st International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Jarec, Iva ; Bekić, Zoran - Zagreb : Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2009, 521-526

ISBN
978-953-7138-16-5

Skup
31st International Conference on Information Technology Interfaces (ITI 2009)

Mjesto i datum
Cavtat, Hrvatska, 22.06.2009. - 25.06.2009

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
morphosyntactic tagging; part-of-speech tagging; error analysis; error distribution; Croatian language; hybrid tagging

Sažetak
In this paper, we provide detailed insight on properties of errors generated by a stochastic morphosyntactic tagger assigning Multext-East morphosyntactic descriptions to Croatian texts. Tagging the Croatia Weekly newspaper corpus by the CroTag tagger in stochastic mode revealed that approximately 85 percent of all tagging errors occur on nouns, adjectives, pronouns and verbs. Moreover, approximately 50 percent of these are shown to be incorrect assignments of case values. We provide various other distributional properties of errors in assigning morphosyntactic descriptions for these and other parts of speech. On the basis of these properties, we propose rule- based and stochastic strategies which could be integrated in the tagging module, creating a hybrid procedure in order to raise overall tagging accuracy for Croatian.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekti:
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Dovedan Han, Zdravko, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Marko Tadić (autor)

Avatar Url Zdravko Dovedan Han (autor)

Avatar Url Željko Agić (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada

Citiraj ovu publikaciju:

Agić, Željko; Tadić, Marko; Dovedan, Zdravko
Error Analysis in Croatian Morphosyntactic Tagging // Proceedings of the 31st International Conference on Information Technology Interfaces / Lužar-Stiffler, Vesna ; Jarec, Iva ; Bekić, Zoran (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2009. str. 521-526 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Agić, Ž., Tadić, M. & Dovedan, Z. (2009) Error Analysis in Croatian Morphosyntactic Tagging. U: Lužar-Stiffler, V., Jarec, I. & Bekić, Z. (ur.)Proceedings of the 31st International Conference on Information Technology Interfaces.
@article{article, author = {Agi\'{c}, \v{Z}eljko and Tadi\'{c}, Marko and Dovedan, Zdravko}, year = {2009}, pages = {521-526}, keywords = {morphosyntactic tagging, part-of-speech tagging, error analysis, error distribution, Croatian language, hybrid tagging}, isbn = {978-953-7138-16-5}, title = {Error Analysis in Croatian Morphosyntactic Tagging}, keyword = {morphosyntactic tagging, part-of-speech tagging, error analysis, error distribution, Croatian language, hybrid tagging}, publisher = {Sveu\v{c}ili\v{s}ni ra\v{c}unski centar Sveu\v{c}ili\v{s}ta u Zagrebu (Srce)}, publisherplace = {Cavtat, Hrvatska} }
@article{article, author = {Agi\'{c}, \v{Z}eljko and Tadi\'{c}, Marko and Dovedan, Zdravko}, year = {2009}, pages = {521-526}, keywords = {morphosyntactic tagging, part-of-speech tagging, error analysis, error distribution, Croatian language, hybrid tagging}, isbn = {978-953-7138-16-5}, title = {Error Analysis in Croatian Morphosyntactic Tagging}, keyword = {morphosyntactic tagging, part-of-speech tagging, error analysis, error distribution, Croatian language, hybrid tagging}, publisher = {Sveu\v{c}ili\v{s}ni ra\v{c}unski centar Sveu\v{c}ili\v{s}ta u Zagrebu (Srce)}, publisherplace = {Cavtat, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font