Napredna pretraga

Pregled bibliografske jedinice broj: 433160

Evaluation of the Statistical Machine Translation Service for Croatian-English


Brkić, Marija; Vičić, Tomislav; Seljan, Sanja
Evaluation of the Statistical Machine Translation Service for Croatian-English // 2nd international conference The future of information sciences (INFuture 2009) : Digital resources and knowledge sharing : proceedings / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida (ur.).
Zagreb: Department of Information Sciences, Faculty of Humanities and Social Sciences, University of Zagreb, 2009. str. 319-332 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Evaluation of the Statistical Machine Translation Service for Croatian-English

Autori
Brkić, Marija ; Vičić, Tomislav ; Seljan, Sanja

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
2nd international conference The future of information sciences (INFuture 2009) : Digital resources and knowledge sharing : proceedings / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida - Zagreb : Department of Information Sciences, Faculty of Humanities and Social Sciences, University of Zagreb, 2009, 319-332

ISBN
978-953-175-355-5

Skup
International Conference The Future of Information Sciences (2 ; 2009)

Mjesto i datum
Zagreb, Hrvatska, 04.-06.11.2009.

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
SMT (statistical machine translation); online; Google Translate; MT; Croatian-English; manual evaluation; fluency; adequacy; χ2-test

Sažetak
Much thought has been given in an endeavour to formalize the translation process. As a result, various approaches to MT (machine translation) were taken. With the exception of statistical translation, all approaches require cooperation between language and computer science experts. Most of the models use various hybrid approaches. Statistical translation approach is completely language independent if we disregard the fact that it requires huge parallel corpus that needs to be split into sentences and words. This paper compares and discusses state-of-the-art statistical machine translation (SMT) models and evaluation methods. Results of statistically-based Google Translate tool for Croatian-English translations are presented and multilevel analysis is given. Three different types of texts are manually evaluated and results are analysed by the χ2-test.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekt / tema
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Sanja Seljan, )
318-0361935-0852 - Govorne tehnologije (Ivo Ipšić, )

Ustanove
Filozofski fakultet, Zagreb,
Sveučilište u Rijeci - Odjel za informatiku