Pregled bibliografske jedinice broj: 698034
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing // Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) / Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios (ur.).
Reykjavík: European Language Resources Association (ELRA), 2014. str. 2313-2319 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 698034 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Croatian Dependency Treebank 2.0: New Annotation Guidelines for Improved Parsing
Autori
Agić, Željko ; Berović, Daša ; Merkler, Danijela ; Tadić, Marko
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)
/ Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios - Reykjavík : European Language Resources Association (ELRA), 2014, 2313-2319
ISBN
978-2-9517408-8-4
Skup
Ninth International Conference on Language Resources and Evaluation (LREC 2014)
Mjesto i datum
Reykjavík, Island, 26.05.2014. - 31.05.2014
Vrsta sudjelovanja
Poster
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
dependency treebank; dependency parsing; Croatian language
Sažetak
We present a new version of the Croatian Dependency Treebank. It constitutes a slight departure from the previously closely observed Prague Dependency Treebank syntactic layer annotation guidelines as we introduce a new subset of syntactic tags on top of the existing tagset. These new tags are used in explicit annotation of subordinate clauses via subordinate conjunctions. Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates. In an experiment with data-driven dependency parsing, we show that implementing these new annotation guidelines leeds to a statistically significant improvement in parsing accuracy. We also observe a substantial improvement in inter-annotator agreement, facilitating more consistent annotation in further treebank development.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija
POVEZANOST RADA
Projekti:
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Dovedan Han, Zdravko, MZOS ) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb