Pregled bibliografske jedinice broj: 701193
CroDeriV: a New Resource for Processing Croatian Morphology
CroDeriV: a New Resource for Processing Croatian Morphology // Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14) / Calzolari, N., Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J., Piperidis, S. (ur.).
Reykjavík: European Language Resources Association (ELRA), 2014. str. 3366-3370 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 701193 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
CroDeriV: a New Resource for Processing Croatian Morphology
Autori
Šojat, Krešimir ; Srebačić, Matea ; Pavelić, Tin ; Tadić, Marko
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
/ Calzolari, N., Choukri, K., Declerck, T., Loftsson, H., Maegaard, B., Mariani, J., Moreno, A., Odijk, J., Piperidis, S. - Reykjavík : European Language Resources Association (ELRA), 2014, 3366-3370
ISBN
978-2-9517408-8-4
Skup
Ninth International Conference on Language Resources and Evaluation (LREC'14)
Mjesto i datum
Reykjavík, Island, 26.05.2014. - 31.05.2014
Vrsta sudjelovanja
Poster
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
computational morphology ; CroDeriV ; derivational lexicon
Sažetak
The paper deals with the processing of Croatian morphology and presents CroDeriV ― a newly developed language resource that contains data about morphological structure and derivational relatedness of verbs in Croatian. In its present shape, CroDeriV contains 14 192 Croatian verbs. Verbs in CroDeriV are analyzed for morphemes and segmented into lexical, derivational and inflectional morphemes. The structure of CroDeriV enables the detection of verbal derivational families in Croatian as well as the distribution and frequency of particular affixes and lexical morphemes. Derivational families consist of a verbal base form and all prefixed or suffixed derivatives detected in available machine readable Croatian dictionaries and corpora. Language data structured in this way was further used for the expansion of other language resources for Croatian, such as Croatian WordNet and the Croatian Morphological Lexicon. Matching the data from CroDeriV on one side and Croatian WordNet and the Croatian Morphological Lexicon on the other resulted in significant enrichment of Croatian WordNet and enlargement of the Croatian Morphological Lexicon.
Izvorni jezik
Engleski
Znanstvena područja
Filologija
POVEZANOST RADA
Projekti:
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb,
Sveučilište u Zagrebu