JW300: A wide-coverage parallel corpus for low- resource languages (CROSBI ID 679373)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Agić, Željko ; Vulić, Ivan
engleski
JW300: A wide-coverage parallel corpus for low- resource languages
Viable cross-lingual transfer critically depends on the availability of parallel texts. Shortage of such resources imposes a development and evaluation bottleneck in multilingual processing. We introduce JW300, a parallel corpus of over 300 languages with around 100 thousand parallel sentences per language pair on average. In this paper, we present the resource and showcase its utility in experiments with crosslingual word embedding induction and multisource part-of-speech projection.
low-resource languages ; parallel corpus
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
3204-3210.
2019.
objavljeno
10.18653/v1/P19-1310
Podaci o matičnoj publikaciji
ACL 2019: The 57th Conference of the Association for Computational Linguistics: Proceedings of the Conference
Cabrio, Elena ; Sprugnoli, Rachele
Firenza : München: Association for Computational Linguistics (ACL)
978-1-950737-48-2
Podaci o skupu
7th Workshop on Balto-Slavic Natural Language Processing. Association for Computational Linguistics
poster
28.07.2019-02.08.2019
Firenca, Italija