Investigating Language Independence in HMM PoS/MSD-Tagging (CROSBI ID 537793)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Agić, Željko ; Tadić, Marko ; Dovedan, Zdravko
engleski
Investigating Language Independence in HMM PoS/MSD-Tagging
The paper presents an investigation of functional dependencies in morphosyntactic tagging using hidden Markov models. Starting from a well known fact that the HMM tagging paradigm relies on lexical knowledge acquired from training corpora and stored in form of transition and emission matrices, also called a language model, in the experiment, we apply the TnT trigram tagger on creating language models for seven different languages from the MULTEXT East v3 project translations of George Orwell’ s 1984. – Czech, Estonian, Hungarian, Romanian, Serbian, Slovene and original English version. We then use these language models in the tagging procedure and obtain details on various relations between training corpora statistics, training outputs and outputs of the tagging procedure.
language independence; part-of-speech tagging; morphosyntactic tagging; hidden Markov models
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
657-662.
2008.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 30th International Conference on Information Technology Interfaces
Lužar-Stiffler, Vesna ; Hljuz Dobrić, Vesna ; Bekić, Zoran
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce)
978-953-7138-12-7
Podaci o skupu
30th International Conference on Information Technology Interfaces (ITI 2008)
predavanje
23.06.2008-26.06.2008
Dubrovnik, Hrvatska; Cavtat, Hrvatska