Napredna pretraga

Pregled bibliografske jedinice broj: 336869

Sentence Alignment as the Basis For Translation Memory Database


Seljan, Sanja; Gašpar, Angelina; Pavuna, Damir
Sentence Alignment as the Basis For Translation Memory Database // The Future of Information Sciences: INFuture 2007 - Digital Information and Heritage / Seljan, Sanja ; Stančić, Hrvoje (ur.).
Zagreb: Odsjek za Informacijske znanosti, Filozofski fakultet Zagreb, 2007. str. 299-311


Naslov
Sentence Alignment as the Basis For Translation Memory Database

Autori
Seljan, Sanja ; Gašpar, Angelina ; Pavuna, Damir

Vrsta, podvrsta i kategorija rada
Poglavlja u knjigama, pregledni

Knjiga
The Future of Information Sciences: INFuture 2007 - Digital Information and Heritage

Urednik/ci
Seljan, Sanja ; Stančić, Hrvoje

Izdavač
Odsjek za Informacijske znanosti, Filozofski fakultet Zagreb

Grad
Zagreb

Godina
2007

Raspon stranica
299-311

ISBN
978-953-175-305-0

Ključne riječi
Sentence, alignment, translation memory, computer-assisted translation (CAT), tool, segmentation, set up

Sažetak
Sentence alignment represents the basis for computer-assisted translation (CAT), terminology management, term extraction, word alignment and cross-linguistic information retrieval. Created out of the sentence alignment process, translation memory (TM) represents the basis for further research in translation equivalencies. Automatic sentence alignment, based on parallel texts, faces two types of problems: robustness and discrepancies between source and target text in layout and omissions which have influence on the accuracy of the alignment process. The aim of the paper is to present the research of the sentence alignment process realized on the Croatian- English parallel texts (laws, regulations, acts and decisions) and implemented by the alignment tool WinAlign 7.5.0 by SDL Trados 2006 Professional. The alignment process and its impact on creation of translation memories has been presented through comparison of translation memories that distinguish regarding levels of an expert intervention in the set up of the alignment program and preparation of the source text for the segmentation. Recommendations for further development using statistical analysis, automatic learning techniques and language knowledge are suggested.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekt / tema
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Sanja Seljan, )

Ustanove
Filozofski fakultet, Zagreb