Pregled bibliografske jedinice broj: 441037
Terminology Structuring Based on Croatian–English Parallel Corpus
Terminology Structuring Based on Croatian–English Parallel Corpus // Challenges of Translation Studies in a Globalized World - Book of Abstracts / Teržan Kopecky, Karmen ; Zupan, Simon (ur.).
Maribor: Department of Translation Studies, Faculty of Arts Maribor, 2009. str. 22-22 (predavanje, međunarodna recenzija, sažetak, znanstveni)
CROSBI ID: 441037 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Terminology Structuring Based on Croatian–English Parallel Corpus
Autori
Gašpar, Angelina ; Seljan, Sanja
Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, znanstveni
Izvornik
Challenges of Translation Studies in a Globalized World - Book of Abstracts
/ Teržan Kopecky, Karmen ; Zupan, Simon - Maribor : Department of Translation Studies, Faculty of Arts Maribor, 2009, 22-22
Skup
Challenges of Translation Studies in a Globalized World
Mjesto i datum
Maribor, Slovenija, 15.10.2009. - 17.10.2009
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
terminology structuring; term candidates; parallel corpus; statistical approach; linguistic approach; evaluation; NooJ.
Sažetak
Terminology bases have been frequently used by translators, terminologists, lexicographers and everyday users. Labour intensive terminological work encompasses the identification and the ranking of term candidates according to their domain representativity. In order to measure it, a sample of Croatian–English parallel corpora is used and a hybrid statistical and linguistic approach has been adopted as in Frantzi et al.(2000), Nakagava et.al.(1998), Daille (1955), Alegria et.al.(2004). While statistical techniques are used for ranking and filtering of n-gram term candidates, linguistic techniques are applied to select terms capturing their morpho-syntactic properties. The automatically created list is then evaluated and followed by the description of morpho-syntactic patterns. Multiword term list obtained through hybrid approach is then evaluated through measures of precision, recall and F-measure and analyzed with suggestions for further improvements. NooJ linguistic englineering tool has been applied in order to meet linguistic needs of this research.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija
POVEZANOST RADA
Projekti:
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Seljan, Sanja, MZOS ) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb