Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 507907

Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?


Ljubešić, Nikola; Bago, Petra; Boras, Damir
Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need? // CIT. Journal of computing and information technology, 18 (2010), 4; 303-308 doi:10.2498/cit.1001914 (podatak o recenziji nije dostupan, članak, znanstveni)


CROSBI ID: 507907 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?

Autori
Ljubešić, Nikola ; Bago, Petra ; Boras, Damir

Izvornik
CIT. Journal of computing and information technology (1330-1136) 18 (2010), 4; 303-308

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
statistical machine translation; weather forecast; automatic evaluation; human evaluation

Sažetak
This research is the first step towards developing a system for translating Croatian weather forecasts into multiple languages. This step deals with the Croatian-English language pair. The parallel corpus consists of a one-year sample of the weather forecasts for the Adriatic, con- sisting of 7, 893 sentence pairs. Evaluation is performed by the automatic evaluation measures BLUE, NIST and METEOR, as well as by manually evaluating a sample of 200 translations. We have shown that with a small- sized training set and the state-of-the art Moses system, decod- ing can be done with 96% accuracy concerning adequacy and fluency. Additional improvement is expected by increasing the training set size. Finally, the correlation of the recorded evaluation measures is explored.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Boras, Damir, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Petra Bago (autor)

Avatar Url Nikola Ljubešić (autor)

Avatar Url Damir Boras (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada doi

Citiraj ovu publikaciju:

Ljubešić, Nikola; Bago, Petra; Boras, Damir
Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need? // CIT. Journal of computing and information technology, 18 (2010), 4; 303-308 doi:10.2498/cit.1001914 (podatak o recenziji nije dostupan, članak, znanstveni)
Ljubešić, N., Bago, P. & Boras, D. (2010) Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?. CIT. Journal of computing and information technology, 18 (4), 303-308 doi:10.2498/cit.1001914.
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Bago, Petra and Boras, Damir}, year = {2010}, pages = {303-308}, DOI = {10.2498/cit.1001914}, keywords = {statistical machine translation, weather forecast, automatic evaluation, human evaluation}, journal = {CIT. Journal of computing and information technology}, doi = {10.2498/cit.1001914}, volume = {18}, number = {4}, issn = {1330-1136}, title = {Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?}, keyword = {statistical machine translation, weather forecast, automatic evaluation, human evaluation} }
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Bago, Petra and Boras, Damir}, year = {2010}, pages = {303-308}, DOI = {10.2498/cit.1001914}, keywords = {statistical machine translation, weather forecast, automatic evaluation, human evaluation}, journal = {CIT. Journal of computing and information technology}, doi = {10.2498/cit.1001914}, volume = {18}, number = {4}, issn = {1330-1136}, title = {Statistical Machine Translation of Croatian Weather Forecasts: How Much Data Do We Need?}, keyword = {statistical machine translation, weather forecast, automatic evaluation, human evaluation} }

Uključenost u ostale bibliografske baze podataka::


  • INSPEC
  • LISA: Library and Information Science Abstracts
  • Zentrallblatt für Mathematik/Mathematical Abstracts
  • BSCO Computer Science Index
  • PASCAL Database
  • PILA CrossRef
  • Compuscience Database on STN International and Internet


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font