Interoperability and Rapid Bootstrapping of Morphological Parsing and Annotation Automata (CROSBI ID 553000)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Ćavar, Damir ; Jazbec, Ivo-Pavao ; Runjaić, Siniša
engleski
Interoperability and Rapid Bootstrapping of Morphological Parsing and Annotation Automata
We discuss the design and development of a finite state transducer for morphological segmentation, annotation, and lemmatization that allows for merging of three major functionalities into one high-performance monolithic automaton. It is designed to be flexible, extensible, and applicable to any language that allows for purely morphotactic modeling on the lexical level of morphological structure. The annotation schema used in an initial Croatian language model is a direct mapping from the GOLD ontology of linguistic concepts and features, which increases the potential for interoperability, but also opens up advanced possibilities for a DL-based post- processing.
Finite state transducer; morphology; Croatian language; GOLD ontology
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
80-85.
2008.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the Sixth Language Technologies Conference, October 16th-17th, 2008 : proceedings of the 11th International Multiconference Information Society - IS 2008, volume C
Erjavec, Tomaž ; Žganec Gros, Jerneja
Ljubljana: Institut Jožef Stefan
978-961-264-006-4
1581-9973
Podaci o skupu
Nepoznat skup
predavanje
29.02.1904-29.02.2096