Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

The SETimes.HR Linguistically Annotated Corpus of Croatian (CROSBI ID 610829)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Agić, Željko ; Ljubešić, Nikola The SETimes.HR Linguistically Annotated Corpus of Croatian // Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014) / Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry et al. (ur.). Reykjavík: European Language Resources Association (ELRA), 2014. str. 1724-1727

Podaci o odgovornosti

Agić, Željko ; Ljubešić, Nikola

engleski

The SETimes.HR Linguistically Annotated Corpus of Croatian

We present SETIMES.HR— the first linguistically annotated corpus of Croatian that is freely available for all purposes. The corpus is built on top of the SETIMES parallel corpus of nine Southeast European languages and English. It is manually annotated for lemmas, morphosyntactic tags, named entities and dependency syntax. We couple the corpus with domain-sensitive test sets for Croatian and Serbian to support direct model transfer evaluation between these closely related languages. We build and evaluate statistical models for lemmatization, morphosyntactic tagging, named entity recognition and dependency parsing on top of SETIMES.HR and the test sets, providing the state of the art in all the tasks. We make all resources presented in the paper freely available under a very permissive licensing scheme.

dependency treebank; Croatian language; free availability

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

1724-1727.

2014.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

Calzolari, Nicoletta ; Choukri, Khalid ; Declerck, Thierry ; Loftsson, Hrafn ; Maegaard, Bente ; Mariani, Joseph ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios

Reykjavík: European Language Resources Association (ELRA)

978-2-9517408-8-4

Podaci o skupu

Ninth International Conference on Language Resources and Evaluation (LREC 2014)

poster

26.05.2014-31.05.2014

Reykjavík, Island

Povezanost rada

Informacijske i komunikacijske znanosti

Poveznice