Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Statistical machine translation of Croatian weather forecast: How much data do we need? (CROSBI ID 571697)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Ljubešić, Nikola ; Bago, Petra ; Boras, Damir Statistical machine translation of Croatian weather forecast: How much data do we need? // ITI ... / Luzar-Stiffler, V. (ur.). 2010. str. 91-x

Podaci o odgovornosti

Ljubešić, Nikola ; Bago, Petra ; Boras, Damir

engleski

Statistical machine translation of Croatian weather forecast: How much data do we need?

This research is a first step towards a system for translating Croatian weather forecast into multiple languages. This steps deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic consisting of 7, 893 sentence pairs. Evaluation is performed by best known automatic evaluation measures BLUE, NIST and METEOR, as well as by evaluating manually a sample of 200 translations. In this research we have shown that with a small-sized training set and the state-of-the art Moses system, decoding can be done with 96% accuracy concerning adequacy and fluency. Additional improvement is to be expected by increasing the training set size.

statistical machine translation; weather forecast; automatic evaluation; human evaluation

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

91-x.

2010.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of the ITI 2010 32nd International Conference on INFORMATION TECHNOLOGY INTERFACES

Luzar-Stiffler, V.

Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce)

978-1-4244-5732-8

1330-1012

Podaci o skupu

ITI 2010 (32nd International Conference on Information Technology Interfaces)

predavanje

21.06.2010-24.06.2010

Dubrovnik, Hrvatska; Cavtat, Hrvatska

Povezanost rada

Informacijske i komunikacijske znanosti