Pregled bibliografske jedinice broj: 125662
The MULTEXT-East Morphosyntactic Specifications for Slavic Languages
The MULTEXT-East Morphosyntactic Specifications for Slavic Languages // Proceedings of the EACL2003 Workshop on Morphological Processing of Slavic Languages / Erjavec, Tomaž ; Vitas, Duško (ur.).
Budimpešta: ACL, 2003. str. 25-32 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 125662 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
The MULTEXT-East Morphosyntactic Specifications for Slavic Languages
Autori
Erjavec, Tomaž ; Krstev, Cvetana ; Petkevič, Vladimir ; Simov, Kiril ; Tadić, Marko ; Vitas, Duško
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the EACL2003 Workshop on Morphological Processing of Slavic Languages
/ Erjavec, Tomaž ; Vitas, Duško - Budimpešta : ACL, 2003, 25-32
Skup
EACL2003 Workshop on Morphological Processing of Slavic Languages
Mjesto i datum
Budimpešta, Mađarska, 13.04.2003
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Slavic Languages; morphology; morphological processing
Sažetak
Word-level morphosyntactic descriptions, such as &#8220 ; Ncmsn&#8221 ; designating a common masculine singular noun in the nominative, have been developed for all Slavic languages, yet there have been few attempts to arrive at a proposal that would be harmonised across the languages. Standardisation adds to the interchange potential of the resources, making it easier to develop multilingual applications or to evaluate language technology tools across several languages. The process of the harmonisation of morphosyntactic categories, esp. for morphologically rich Slavic languages is also interesting from a language-typological perspective. The EU MULTEXT-East project developed corpora, lexica and tools for seven languages, with the focus being on morphosyntactic data, including formal, EAGLES-based specifications for lexical morphosyntactic descriptions. The specifications were later extended, so that they currently cover nine languages, five from the Slavic family: Bulgarian, Croatian, Czech, Serbian and Slovene. The paper presents these morphosyntactic specifications, giving their background and structure, including the encoding of the tables as TEI feature structures. The five Slavic language specifications are discussed in more depth.
Izvorni jezik
Engleski
Znanstvena područja
Filologija
POVEZANOST RADA