Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Building Croatian medical dictionary from medical corpus (CROSBI ID 285170)

Prilog u časopisu | prethodno priopćenje | međunarodna recenzija

Kocijan, Kristina ; Kurolt, Silvia ; Mijić, Linda Building Croatian medical dictionary from medical corpus // Rasprave Instituta za hrvatski jezik i jezikoslovlje, 46 (2020), 2; 765-782. doi: 10.31724/rihjj.46.2.17

Podaci o odgovornosti

Kocijan, Kristina ; Kurolt, Silvia ; Mijić, Linda

engleski

Building Croatian medical dictionary from medical corpus

The overall objective of this project is to define linguistic models at the lexical and syntactic levels that appear in the health domain, depending on the type of corpus. In the first phase of the project, the texts forming the medical corpus A – MedCorA (2, 232 pharmaceutical instructions for medicaments available in Croatia) were prepared. The terminology found in this corpus was analyzed and the semantic subdomains (anatomy, condition, microorganism, chemistry, etc.) within the medical domain were defined and added to the dictionary entries. These dictionary resources were used as the foundation for the second phase in which NooJ morphological grammars were built allowing annotation of medical terminology in the corpus. Said grammars were built to allow for recognizing Latinisms, as well as Latin expressions written with Croatian case endings, not only Croatian words. Prepared resources are made available to a broader scientific community via Sketch Engine for further research in the field of medicine enabling additional research and development of algorithms for, among others, medical documents classification, medical texts’ information retrieval or machine translation of medical documentation, taking into account quality and reliability as well as terminology variability.

language processing ; semantic annotations ; medical domain ; NooJ ; Croatian

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

46 (2)

2020.

765-782

objavljeno

1331-6745

1849-0379

10.31724/rihjj.46.2.17

Povezanost rada

Filologija, Informacijske i komunikacijske znanosti

Poveznice
Indeksiranost