Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 484912

Sentence Classification and Clause Detection for Croatian


Vučković, Kristina; Agić, Željko; Tadić, Marko
Sentence Classification and Clause Detection for Croatian // Proceedings of the 7th International Conference on Formal Approaches to South Slavic and Balkan Languages / Tadić, Marko ; Dimitrova-Vulchanova, Mila ; Koeva, Svetla (ur.).
Zagreb: Croatian Language Technologies Society -- Faculty of Humanities and Social Sciences, 2010. str. 131-138 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 484912 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Sentence Classification and Clause Detection for Croatian

Autori
Vučković, Kristina ; Agić, Željko ; Tadić, Marko

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 7th International Conference on Formal Approaches to South Slavic and Balkan Languages / Tadić, Marko ; Dimitrova-Vulchanova, Mila ; Koeva, Svetla - Zagreb : Croatian Language Technologies Society -- Faculty of Humanities and Social Sciences, 2010, 131-138

ISBN
978-953-55375-2-6

Skup
Formal Approaches to South Slavic and Balkan Languages

Mjesto i datum
Dubrovnik, Hrvatska, 04.10.2010. - 06.10.2010

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
sentence detection; sentence classification; clause detection; Croatian language

Sažetak
We present a method for classifying Croatian sentences by structure and detecting independent and dependent clauses within these sentences and provide its evaluation. A prototype system applying the method was implemented by using the NooJ linguistic development environment, both for purposes of this experiment and for further utilization in a prototype rule-based chunking and shallow parsing system for Croatian. With regards to pre-processing, we implemented and evaluated three different approaches to designing the system: (1) no pre-processing of input sentences, (2) automatic morphosyntactic tagging of sentences by using the CroTag stochastic tagger and (3) manual morphosyntactic annotation of input sentences. All three approaches were evaluated for sentence classification and clause detection accuracy in terms of precision and recall. The highest scoring system was the one using sentences with manually assigned morphosyntactic tags as input and it scored an overall F1-measure of 0.861 (P: 0.928, R: 0.813). In the paper, a more detailed discussion of system design and experiment setup is provided, followed by a discussion of the obtained results and future research directions.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekti:
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Dovedan Han, Zdravko, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Marko Tadić (autor)

Avatar Url Željko Agić (autor)

Avatar Url Kristina Kocijan (autor)

Citiraj ovu publikaciju:

Vučković, Kristina; Agić, Željko; Tadić, Marko
Sentence Classification and Clause Detection for Croatian // Proceedings of the 7th International Conference on Formal Approaches to South Slavic and Balkan Languages / Tadić, Marko ; Dimitrova-Vulchanova, Mila ; Koeva, Svetla (ur.).
Zagreb: Croatian Language Technologies Society -- Faculty of Humanities and Social Sciences, 2010. str. 131-138 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Vučković, K., Agić, Ž. & Tadić, M. (2010) Sentence Classification and Clause Detection for Croatian. U: Tadić, M., Dimitrova-Vulchanova, M. & Koeva, S. (ur.)Proceedings of the 7th International Conference on Formal Approaches to South Slavic and Balkan Languages.
@article{article, author = {Vu\v{c}kovi\'{c}, Kristina and Agi\'{c}, \v{Z}eljko and Tadi\'{c}, Marko}, year = {2010}, pages = {131-138}, keywords = {sentence detection, sentence classification, clause detection, Croatian language}, isbn = {978-953-55375-2-6}, title = {Sentence Classification and Clause Detection for Croatian}, keyword = {sentence detection, sentence classification, clause detection, Croatian language}, publisher = {Croatian Language Technologies Society -- Faculty of Humanities and Social Sciences}, publisherplace = {Dubrovnik, Hrvatska} }
@article{article, author = {Vu\v{c}kovi\'{c}, Kristina and Agi\'{c}, \v{Z}eljko and Tadi\'{c}, Marko}, year = {2010}, pages = {131-138}, keywords = {sentence detection, sentence classification, clause detection, Croatian language}, isbn = {978-953-55375-2-6}, title = {Sentence Classification and Clause Detection for Croatian}, keyword = {sentence detection, sentence classification, clause detection, Croatian language}, publisher = {Croatian Language Technologies Society -- Faculty of Humanities and Social Sciences}, publisherplace = {Dubrovnik, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font