Pregled bibliografske jedinice broj: 336869
Sentence Alignment as the Basis For Translation Memory Database
Sentence Alignment as the Basis For Translation Memory Database // The Future of Information Sciences: INFuture 2007 - Digital Information and Heritage / Seljan, Sanja ; Stančić, Hrvoje (ur.).
Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2007. str. 299-311
CROSBI ID: 336869 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Sentence Alignment as the Basis For Translation Memory Database
Autori
Seljan, Sanja ; Gašpar, Angelina ; Pavuna, Damir
Vrsta, podvrsta i kategorija rada
Poglavlja u knjigama, pregledni
Knjiga
The Future of Information Sciences: INFuture 2007 - Digital Information and Heritage
Urednik/ci
Seljan, Sanja ; Stančić, Hrvoje
Izdavač
Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu
Grad
Zagreb
Godina
2007
Raspon stranica
299-311
ISBN
978-953-175-305-0
Ključne riječi
sentence, alignment, translation memory, computer-assisted translation (CAT), tool, segmentation, set up
Sažetak
Sentence alignment represents the basis for computer-assisted translation (CAT), terminology management, term extraction, word alignment and cross-linguistic information retrieval. Created out of the sentence alignment process, translation memory (TM) represents the basis for further research in translation equivalencies. Automatic sentence alignment, based on parallel texts, faces two types of problems: robustness and discrepancies between source and target text in layout and omissions which have influence on the accuracy of the alignment process. The aim of the paper is to present the research of the sentence alignment process realized on the Croatian- English parallel texts (laws, regulations, acts and decisions) and implemented by the alignment tool WinAlign 7.5.0 by SDL Trados 2006 Professional. The alignment process and its impact on creation of translation memories has been presented through comparison of translation memories that distinguish regarding levels of an expert intervention in the set up of the alignment program and preparation of the source text for the segmentation. Recommendations for further development using statistical analysis, automatic learning techniques and language knowledge are suggested.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Seljan, Sanja, MZOS ) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb