Pregled bibliografske jedinice broj: 737161
Experiments with Neural Word Embeddings for Croatian
Experiments with Neural Word Embeddings for Croatian // Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014)
Ljubljana, 2014. str. 69-72 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 737161 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Experiments with Neural Word Embeddings for Croatian
Autori
Zuanović, Leo ; Karan, Mladen ; Šnajder, Jan
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the Ninth Language Technologies Conference, Information Society (IS-JT 2014)
/ - Ljubljana, 2014, 69-72
Skup
Ninth Language Technologies Conference, Information Society (IS-JT 2014)
Mjesto i datum
Ljubljana, Slovenija, 09.10.2014. - 10.10.2014
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
word embeddings ; lexical semantics ; distributional semantics ; Croatian language
Sažetak
Word representations extracted from a large corpus have been shown to be very useful in a variety of natural language processing tasks. Recently, there has been much work on using neural networks to learn good word representations from raw text. We adopt this approach and train neural word embeddings from a large Croatian web corpus. We evaluate the embeddings on three lexico-semantic tasks: synonym detection, semantic relatedness, and analogy modeling. Results on all three tasks are remarkably good and some of them markedly above the state-of-the-art results for Croatian. In particular, on the synonym detection and semantic relatedness tasks, the model achieves an accuracy of 73% and a correlation of 0.67 with human judgments, respectively.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike, strojarstva i brodogradnje, Split,
Fakultet elektrotehnike i računarstva, Zagreb