Experiments on Active Learning for Croatian Word Sense Disambiguation (CROSBI ID 629059)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Alagić, Domagoj ; Šnajder, Jan
engleski
Experiments on Active Learning for Croatian Word Sense Disambiguation
Supervised word sense disambiguation (WSD) has been shown to achieve state-of-the-art results but at high annotation costs. Active learning can ameliorate that problem by allowing the model to dynamically choose the most informative word contexts for manual labeling. In this paper we investigate the use of active learning for Croatian WSD. We adopt a lexical sample approach and compile a corresponding sense- annotated dataset on which we evaluate our models. We carry out a detailed investigation of the different active learning setups, and show that labeling as few as 100 instances suffices to reach near- optimal performance.
word sense disambiguation; lexical sample; active learning; Croatian language
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
49-58.
2015.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 5th Workshop on Balto-Slavic Natural Language Processing
Piskorski, J. ; Pivovarova, L. ; Šnajder, J. ; Tanev, H. ; Yangarber, R.
Hisarya: Incoma Ltd.
978-954-452-033-5
Podaci o skupu
5th Workshop on Balto-Slavic Natural Language Processing associated with the 10th International Conference on Recent Advances in Natural Language Processing (RANLP 2015)
predavanje
10.09.2015-11.09.2015
Hisar, Bugarska