Napredna pretraga

Pregled bibliografske jedinice broj: 698034

Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing


Agić, Željko; Berović, Daša; Merkler, Danijela; Tadić, Marko
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing // Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) / Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios (ur.).
Reykjavik, Iceland: European Language Resources Association (ELRA), 2014. str. 2313-2319 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


Naslov
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing

Autori
Agić, Željko ; Berović, Daša ; Merkler, Danijela ; Tadić, Marko

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) / Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios - Reykjavik, Iceland : European Language Resources Association (ELRA), 2014, 2313-2319

ISBN
978-2-9517408-8-4

Skup
Ninth International Conference on Language Resources and Evaluation (LREC 2014)

Mjesto i datum
Reykjavik, Island, 26-31.05.2014

Vrsta sudjelovanja
Poster

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Dependency treebank; dependency parsing; Croatian language

Sažetak
We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Projekt / tema
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Marko Tadić, )
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Zdravko Dovedan Han, )

Ustanove
Filozofski fakultet, Zagreb