Napredna pretraga

Pregled bibliografske jedinice broj: 418827

Interoperability and Rapid Bootstrapping of Morphological Parsing and Annotation Automata


Ćavar, Damir; Jazbec, Ivo-Pavao; Runjaić, Siniša
Interoperability and Rapid Bootstrapping of Morphological Parsing and Annotation Automata // Proceedings of the Sixth Language Technologies Conference, October 16th-17th, 2008 : proceedings of the 11th International Multiconference Information Society - IS 2008, volume C / Erjavec, Tomaž ; Žganec Gros, Jerneja (ur.).
Ljubljana: Institut » Jožef Stefan«, 2008. str. 80-85 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Interoperability and Rapid Bootstrapping of Morphological Parsing and Annotation Automata

Autori
Ćavar, Damir ; Jazbec, Ivo-Pavao ; Runjaić, Siniša

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Sixth Language Technologies Conference, October 16th-17th, 2008 : proceedings of the 11th International Multiconference Information Society - IS 2008, volume C / Erjavec, Tomaž ; Žganec Gros, Jerneja - Ljubljana : Institut » Jožef Stefan«, 2008, 80-85

ISBN
978-961-264-006-4

Skup
11th International Multiconference INFORMATION SOCIETY - IS 2008, Sixth Language Technologies Conference

Mjesto i datum
Ljubljana, Slovenija, 16. do 17. listopada 2009

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Finite state transducer; morphology; Croatian language; GOLD ontology

Sažetak
We discuss the design and development of a finite state transducer for morphological segmentation, annotation, and lemmatization that allows for merging of three major functionalities into one high-performance monolithic automaton. It is designed to be flexible, extensible, and applicable to any language that allows for purely morphotactic modeling on the lexical level of morphological structure. The annotation schema used in an initial Croatian language model is a direct mapping from the GOLD ontology of linguistic concepts and features, which increases the potential for interoperability, but also opens up advanced possibilities for a DL-based post- processing.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekt / tema
212-2120920-0930 - Semantičke mreže i računalna leksikologija (Damir Ćavar, )

Ustanove
Institut za hrvatski jezik i jezikoslovlje, Zagreb,
Sveučilište u Zadru

Autor s matičnim brojem:
Damir Ćavar, (261230)