Pregled bibliografske jedinice broj: 1113984
Multiword expressions in the medical domain: who carries the domain-specific meaning
Multiword expressions in the medical domain: who carries the domain-specific meaning // Formalising natural languages: applications to natural language processing and digital humanities. NooJ 2020. : revised selected papers / Bekavac, Božo ; Kocijan, Kristina ; Silberztein, Max ; Šojat, Krešimir (ur.).
Zagreb, Hrvatska: Springer, 2021. str. 49-60 doi:10.1007/978-3-030-70629-6_5 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1113984 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Multiword expressions in the medical domain: who
carries the domain-specific meaning
Autori
Kocijan, Kristina ; Šojat, Krešimir ; Kurolt, Silvia
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Formalising natural languages: applications to natural language processing and digital humanities. NooJ 2020. : revised selected papers
/ Bekavac, Božo ; Kocijan, Kristina ; Silberztein, Max ; Šojat, Krešimir - : Springer, 2021, 49-60
ISBN
978-3-030-70628-9
Skup
International Conference on Automatic Processing of Natural-Language Electronic Texts with NooJ (NooJ2020)
Mjesto i datum
Zagreb, Hrvatska, 05.06.2020. - 07.06.2020
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
medical domain corpus ; detecting MWE ; multiword expressions ; multiword units ; domain specific meaning ; morphology ; syntax ; Croatian language ; NooJ
Sažetak
This paper is a continuation of work in natural language processing in the medical domain for Croatian. After we have annotated single nouns from our corpus consisting of pharmaceutical instructions for medicaments, we are shifting the focus to multiword expressions (MWEs). The project still relies on the nouns from the previous step to detect MWEs where the noun is the main carrier of the medical meaning. However, in cases where the main noun is more general and not directly associated with the medical domain (e.g., bubrežna funkcija ‘kidney function’), we use the power of NooJ morphology grammar to check if the preceding adjective root is associated with the noun found in the main dictionary and annotated as a medical domain noun. Thus, we are checking if the adjective (endoskopski ‘endoscopic’) has a corresponding noun (endoskopija ‘endoscopy’) that is already marked in the NooJ dictionary as a noun belonging to the medical domain. In such cases, we assume that the adjective belongs to the same domain as the noun and that the attribute for the medical domain can be inherited, not only for the adjective, but for the entire MWE as well. The project hopes to help with the automatic extraction and annotation of single adjectives from the medical domain, but also to help identify medical MWEs. Additionally, we wanted to learn more about who carries the domain-specific meaning in Croatian MWEs.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti, Interdisciplinarne društvene znanosti, Filologija, Kognitivna znanost (prirodne, tehničke, biomedicina i zdravstvo, društvene i humanističke znanosti)
POVEZANOST RADA
Ustanove:
Filozofski fakultet, Zagreb
Citiraj ovu publikaciju:
Časopis indeksira:
- Scopus