Experiments with Neural Word Embeddings for Croatian (CROSBI ID 619156)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Zuanović, Leo ; Karan, Mladen ; Šnajder, Jan
engleski
Experiments with Neural Word Embeddings for Croatian
Word representations extracted from a large corpus have been shown to be very useful in a variety of natural language processing tasks. Recently, there has been much work on using neural networks to learn good word representations from raw text. We adopt this approach and train neural word embeddings from a large Croatian web corpus. We evaluate the embeddings on three lexico-semantic tasks: synonym detection, semantic relatedness, and analogy modeling. Results on all three tasks are remarkably good and some of them markedly above the state-of-the-art results for Croatian. In particular, on the synonym detection and semantic relatedness tasks, the model achieves an accuracy of 73% and a correlation of 0.67 with human judgments, respectively.
word embeddings ; lexical semantics ; distributional semantics ; Croatian language
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
69-72.
2014.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014)
Ljubljana:
Podaci o skupu
Ninth Language Technologies Conference, Information Society (IS-JT 2014)
predavanje
09.10.2014-10.10.2014
Ljubljana, Slovenija