Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora (CROSBI ID 562114)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Seljan, Sanja ; Tadić, Marko ; Agić, Željko ; Šnajder, Jan ; Dalbelo Bašić, Bojana ; Osmann, Vjekoslav Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora // Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010) / Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente et al. (ur.). Valletta: European Language Resources Association (ELRA), 2010. str. 3481-3484

Podaci o odgovornosti

Seljan, Sanja ; Tadić, Marko ; Agić, Željko ; Šnajder, Jan ; Dalbelo Bašić, Bojana ; Osmann, Vjekoslav

engleski

Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora

An increasing demand for new language resources of recent EU members and accessing countries has in turn initiated the development of different language tools and resources, such as alignment tools and corresponding translation memories for new languages pairs. The primary goal of this paper is to provide a description of a free sentence alignment tool CorAl (Corpus Aligner), developed at the Faculty of Electrical Engineering and Computing, University of Zagreb. The tool performs paragraph alignment at the first step of the alignment process, which is followed by sentence alignment. Description of the tool is followed by its evaluation. The paper describes an experiment with applying the CorAl aligner to a English-Croatian parallel corpus of legislative domain using metrics of precision, recall and F1- measure. Results are discussed and the concluding sections discuss future directions of CorAl development.

Corpus Aligner; Coral; English-Croatian Parallel Corpora

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

3481-3484.

2010.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente ; Mariani, Joseph ; Odjik, Jan ; Piperidis, Stelios ; Rosner, Mike ; Tapias, Daniel

Valletta: European Language Resources Association (ELRA)

2-9517408-6-7

Podaci o skupu

Seventh International Conference on Language Resources and Evaluation

poster

17.05.2010-23.05.2010

Valletta, Malta

Povezanost rada

Računarstvo, Informacijske i komunikacijske znanosti, Filologija

Poveznice