Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Possibilities of Identification of Translation Equivalents in Croatian-English Parallel Corpus (CROSBI ID 480867)

Neobjavljeno sudjelovanje sa skupa | neobjavljeni prilog sa skupa | međunarodna recenzija

Šojat, Krešimir ; Tadić, Marko Possibilities of Identification of Translation Equivalents in Croatian-English Parallel Corpus // Nepoznat skup , 29.02.1904-29.02.2096

Podaci o odgovornosti

Šojat, Krešimir ; Tadić, Marko

engleski

Possibilities of Identification of Translation Equivalents in Croatian-English Parallel Corpus

The paper discuses associations between translation equivalents in a parallel aligned corpus. The focus is on identification of multi-word units in a parallel corpus and verification of translation equivalents. The data have been extracted from the Croatian-English parallel corpus aligned to the level of sentence that has been compiled at the Institute of Linguistics, Faculty of Philosophy, University of Zagreb. Using statistical measures, primarily mutual information value, significant co-occurrences of words were identified first in a source language. In order to define significant multi-word units both in the source language (Croatian) as well as in the target language (English), the same procedure has been carried through in the target language. The analysis and classification of the results takes place afterwards. In order to establish which translations are significant, the next step consist of applying the statistical procedures between translation equivalents in a test sample extracted from the parallel corpus. Applying such statistical measures between translation equivalents enables a systematization of terminology from the areas where a constant lack of new Croatian terms exists (e.g. market-economy, computer science). On the other hand, examination of translation equivalents in the target language can be used as an instrument for information extraction in the source language.

corpus; parallel corpora; Croatian; English; translation equivalents

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

nije evidentirano

nije evidentirano

Podaci o skupu

Nepoznat skup

predavanje

29.02.1904-29.02.2096

Povezanost rada

Filologija