Napredna pretraga

Pregled bibliografske jedinice broj: 441037

Terminology Structuring Based on Croatian–English Parallel Corpus


Gašpar, Angelina; Seljan, Sanja
Terminology Structuring Based on Croatian–English Parallel Corpus // Challenges of Translation Studies in a Globalized World - Book of Abstracts / Teržan Kopecky, Karmen ; Zupan, Simon (ur.).
Maribor: Department of Translation Studies, Faculty of Arts Maribor, 2009. str. 22-22 (predavanje, međunarodna recenzija, sažetak, znanstveni)


Naslov
Terminology Structuring Based on Croatian–English Parallel Corpus

Autori
Gašpar, Angelina ; Seljan, Sanja

Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, znanstveni

Izvornik
Challenges of Translation Studies in a Globalized World - Book of Abstracts / Teržan Kopecky, Karmen ; Zupan, Simon - Maribor : Department of Translation Studies, Faculty of Arts Maribor, 2009, 22-22

Skup
Challenges of Translation Studies in a Globalized World

Mjesto i datum
Maribor, Slovenija, 15-17.10.2009

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Terminology structuring; term candidates; parallel corpus; statistical approach; linguistic approach; evaluation; NooJ.

Sažetak
Terminology bases have been frequently used by translators, terminologists, lexicographers and everyday users. Labour intensive terminological work encompasses the identification and the ranking of term candidates according to their domain representativity. In order to measure it, a sample of Croatian–English parallel corpora is used and a hybrid statistical and linguistic approach has been adopted as in Frantzi et al.(2000), Nakagava et.al.(1998), Daille (1955), Alegria et.al.(2004). While statistical techniques are used for ranking and filtering of n-gram term candidates, linguistic techniques are applied to select terms capturing their morpho-syntactic properties. The automatically created list is then evaluated and followed by the description of morpho-syntactic patterns. Multiword term list obtained through hybrid approach is then evaluated through measures of precision, recall and F-measure and analyzed with suggestions for further improvements. NooJ linguistic englineering tool has been applied in order to meet linguistic needs of this research.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekt / tema
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Sanja Seljan, )

Ustanove
Filozofski fakultet, Zagreb