Napredna pretraga

Pregled bibliografske jedinice broj: 455003

Towards Sentiment Analysis of Financial Texts in Croatian


Agić, Željko; Ljubešić, Nikola; Tadić, Marko
Towards Sentiment Analysis of Financial Texts in Croatian // Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010) / Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente ; Mariani, Joseph ; Odjik, Jan ; Piperidis, Stelios ; Rosner, Mike ; Tapias, Daniel (ur.).
Valletta: European Language Resources Association, 2010. str. 1164-1167 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Towards Sentiment Analysis of Financial Texts in Croatian

Autori
Agić, Željko ; Ljubešić, Nikola ; Tadić, Marko

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010) / Calzolari, Nicoletta ; Choukri, Khalid ; Maegaard, Bente ; Mariani, Joseph ; Odjik, Jan ; Piperidis, Stelios ; Rosner, Mike ; Tapias, Daniel - Valletta : European Language Resources Association, 2010, 1164-1167

ISBN
2-9517408-6-7

Skup
Proceedings of the Seventh International Conference on Language Resources and Evaluation

Mjesto i datum
Valletta, Malta, 17-23.5.2010.

Vrsta sudjelovanja
Poster

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Sentiment analysis; financial texts; Croatian language

Sažetak
The paper presents results of an experiment dealing with sentiment analysis of Croatian text from the domain of finance. The goal of the experiment was to design a system model for automatic detection of general sentiment and polarity phrases in these texts. We have assembled a document collection from web sources writing on the financial market in Croatia and manually annotated articles from a subset of that collection for general sentiment. Additionally, we have manually annotated a number of these articles for phrases encoding positive or negative sentiment within a text. In the paper, we provide an analysis of the compiled resources. We show a statistically significant correspondence (1) between the overall market trend on the Zagreb Stock Exchange and the number of positively and negatively accented articles within periods of trend and (2) between the general sentiment of articles and the number of polarity phrases within those articles. We use this analysis as an input for designing a rule-based local grammar system for automatic detection of polarity phrases and evaluate it on held out data. The system achieves F1-scores of 0.61 (P: 0.94, R: 0.45) and 0.63 (P: 0.97, R: 0.47) on positive and negative polarity phrases.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekt / tema
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Marko Tadić, )
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Zdravko Dovedan Han, )
130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Damir Boras, )

Ustanove
Filozofski fakultet, Zagreb