Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing (CROSBI ID 610830)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Agić, Željko ; Berović, Daša ; Merkler, Danijela ; Tadić, Marko
engleski
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development.
dependency treebank; dependency parsing; Croatian language
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
2313-2319.
2014.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)
Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios
Reykjavík: European Language Resources Association (ELRA)
978-2-9517408-8-4
Podaci o skupu
Ninth International Conference on Language Resources and Evaluation (LREC 2014)
poster
26.05.2014-31.05.2014
Reykjavík, Island