Pregled bibliografske jedinice broj: 463848
Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora
Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora // Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010) / Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente ; Mariani, Joseph ; Odjik, Jan ; Piperidis, Stelios ; Rosner, Mike ; Tapias, Daniel (ur.).
Valletta: European Language Resources Association (ELRA), 2010. str. 3481-3484 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 463848 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Corpus Aligner (CorAl) Evaluation on English- Croatian Parallel Corpora
Autori
Seljan, Sanja ; Tadić, Marko ; Agić, Željko ; Šnajder, Jan ; Dalbelo Bašić, Bojana ; Osmann, Vjekoslav
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)
/ Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente ; Mariani, Joseph ; Odjik, Jan ; Piperidis, Stelios ; Rosner, Mike ; Tapias, Daniel - Valletta : European Language Resources Association (ELRA), 2010, 3481-3484
ISBN
2-9517408-6-7
Skup
Seventh International Conference on Language Resources and Evaluation
Mjesto i datum
Valletta, Malta, 17.05.2010. - 23.05.2010
Vrsta sudjelovanja
Poster
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Corpus Aligner; Coral; English-Croatian Parallel Corpora
Sažetak
An increasing demand for new language resources of recent EU members and accessing countries has in turn initiated the development of different language tools and resources, such as alignment tools and corresponding translation memories for new languages pairs. The primary goal of this paper is to provide a description of a free sentence alignment tool CorAl (Corpus Aligner), developed at the Faculty of Electrical Engineering and Computing, University of Zagreb. The tool performs paragraph alignment at the first step of the alignment process, which is followed by sentence alignment. Description of the tool is followed by its evaluation. The paper describes an experiment with applying the CorAl aligner to a English-Croatian parallel corpus of legislative domain using metrics of precision, recall and F1- measure. Results are discussed and the concluding sections discuss future directions of CorAl development.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti, Filologija
POVEZANOST RADA
Projekti:
036-1300646-1986 - Otkrivanje znanja u tekstnim podacima (Dalbelo-Bašić, Bojana, MZO ) ( CroRIS)
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Seljan, Sanja, MZOS ) ( CroRIS)
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Dovedan Han, Zdravko, MZOS ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb,
Filozofski fakultet, Zagreb
Profili:
Sanja Seljan
(autor)
Bojana Dalbelo Bašić
(autor)
Marko Tadić
(autor)
Željko Agić
(autor)
Jan Šnajder
(autor)