Sentence Alignment as the Basis For Translation Memory Database

Seljan, Sanja; Gašpar, Angelina; Pavuna, Damir

izvor podataka: crosbi !

Sentence Alignment as the Basis For Translation Memory Database (CROSBI ID 35709)

Prilog u knjizi | izvorni znanstveni rad

Seljan, Sanja ; Gašpar, Angelina ; Pavuna, Damir Sentence Alignment as the Basis For Translation Memory Database // The Future of Information Sciences: INFuture 2007 - Digital Information and Heritage / Seljan, Sanja ; Stančić, Hrvoje (ur.). Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2007. str. 299-311

Podaci o odgovornosti

Autori

Seljan, Sanja ; Gašpar, Angelina ; Pavuna, Damir

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

Sentence Alignment as the Basis For Translation Memory Database

Sažetak

Sentence alignment represents the basis for computer-assisted translation (CAT), terminology management, term extraction, word alignment and cross-linguistic information retrieval. Created out of the sentence alignment process, translation memory (TM) represents the basis for further research in translation equivalencies. Automatic sentence alignment, based on parallel texts, faces two types of problems: robustness and discrepancies between source and target text in layout and omissions which have influence on the accuracy of the alignment process. The aim of the paper is to present the research of the sentence alignment process realized on the Croatian- English parallel texts (laws, regulations, acts and decisions) and implemented by the alignment tool WinAlign 7.5.0 by SDL Trados 2006 Professional. The alignment process and its impact on creation of translation memories has been presented through comparison of translation memories that distinguish regarding levels of an expert intervention in the set up of the alignment program and preparation of the source text for the segmentation. Recommendations for further development using statistical analysis, automatic learning techniques and language knowledge are suggested.

Ključne riječi

sentence, alignment, translation memory, computer-assisted translation (CAT), tool, segmentation, set up

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o prilogu

Stranice rada

299-311.

Status objave rada

objavljeno

Podaci o knjizi

Knjiga u kojoj je prilog objavljen

The Future of Information Sciences: INFuture 2007 - Digital Information and Heritage

Urednici

Seljan, Sanja ; Stančić, Hrvoje

Izdavač

Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu

Godina izdavanja

2007.

ISBN

978-953-175-305-0

Povezanost rada

Povezane osobe

Damir Pavuna (CroRIS ID: 15547; MBZ: 120084) (autor/i)

Sanja Seljan (CroRIS ID: 5564; MBZ: 219255) (autor/i)

Povezane ustanove

Filozofski fakultet u Zagrebu (130) (autorova ustanova)

Povezani projekti

Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (rezultat rada na projektu)

Područje

Informacijske i komunikacijske znanosti

Poveznice

darhiv.ffzg.unizg.hr