Idioms in state-of-the-art Croatian-English and English-Croatian SMT systems (CROSBI ID 648234)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Manojlović, Maja ; Dajak, Luka ; Brkić Bakarić, Marija
engleski
Idioms in state-of-the-art Croatian-English and English-Croatian SMT systems
Idioms are well known for posing problems to non-native speakers, let alone machines. A failure to identify idioms often leads to unnatural, even hilarious outputs. This paper investigates the treatment of idioms in state- of-the art SMT systems involving English and Croatian. First we introduce the concept of idioms. Then we construct three short stories abundant with idioms per each language, and translate them into the other language by two state-of-the-art SMT systems. Next we manually inspect the outputs and present results. For the purpose of conducting analysis, we devise an error taxonomy for handling idioms.
idioms, Croatian-English, SMT, English-Croatian, machine translation
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
1798-1802.
2017.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the MIPRO 2017 40th Jubilee International Convention
Petar Biljanović
Rijeka:
978-953-233-093-9
Podaci o skupu
MIPRO 2017
predavanje
22.05.2017-26.05.2017
Opatija, Hrvatska