Pregled bibliografske jedinice broj: 644078
Parsing Croatian and Serbian by Using Croatian Dependency Treebanks
Parsing Croatian and Serbian by Using Croatian Dependency Treebanks // Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2013)
Seattle (WA): Association for Computational Linguistics (ACL), 2013. str. 22-33 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 644078 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Parsing Croatian and Serbian by Using Croatian Dependency Treebanks
Autori
Agić, Željko ; Merkler, Danijela ; Berović, Daša
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2013)
/ - Seattle (WA) : Association for Computational Linguistics (ACL), 2013, 22-33
ISBN
978-1-937284-97-8
Skup
Fourth Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2013)
Mjesto i datum
Seattle (WA), Sjedinjene Američke Države, 18.10.2013. - 21.10.2013
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
dependency treebank; dependency parsing; Croatian; Serbian
Sažetak
We investigate statistical dependency parsing of two closely related languages, Croatian and Serbian. As these two morphologically complex languages of relaxed word order are generally under- resourced -- with the topic of dependency parsing still largely unaddressed, especially for Serbian -- we make use of the two available dependency treebanks of Croatian to produce state-of- the-art parsing models for both languages. We observe parsing accuracy on four test sets from two domains. We give insight into overall parser performance for Croatian and Serbian, impact of preprocessing for lemmas and morphosyntactic tags and influence of selected morphosyntactic features on parsing accuracy.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Dovedan Han, Zdravko, MZOS ) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb