Comparison of Short-Text Sentiment Analysis Methods for Croatian (CROSBI ID 702547)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Šnajder, Jan ; Rotim, Leon
engleski
Comparison of Short-Text Sentiment Analysis Methods for Croatian
We focus on the task of supervised sentiment classification of short and informal texts in Croatian, using two simple yet effective methods: word embeddings and string kernels. We investigate whether word embeddings offer any advantage over corpus-and preprocessing-free string kernels, and how these compare to bag-of-words baselines. We conduct a comparison on three different datasets, using different preprocessing methods and kernel functions. Results show that, on two out of three datasets, word embeddings outperform string kernels, which in turn outperform word and n-gram bag-of-words baselines.
sentiment analysis ; string kernels ; word embeddings ; Croatian language
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
69-75.
2017.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing
Podaci o skupu
The 6th Workshop on Balto-Slavic Natural Language Processing
predavanje
04.04.2017-04.04.2017
Valencia, Španjolska