Croatian error-annotated corpus of non- professional written language (CROSBI ID 637494)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Štefanec, Vanja ; Ljubešić, Nikola ; Kuvač Kraljević, Jelena
engleski
Croatian error-annotated corpus of non- professional written language
In the paper authors will present the Croatian corpus of non-professional written language. Consisting of two subcorpora, i.e. the clinical subcorpus, consisting of written texts produced by speakers with various types of language disorders, and the healthy speakers subcorpus, as well as by the levels of its annotation, it offers an opportunity for different lines of research. Authors will present the corpus structure, describe the sampling methodology, explain the levels of annotation, and give some very basic statistic. On the basis of data from the corpus, existing language technologies for Croatian will be adapted in order to be implemented in a platform facilitating text production to speakers with language disorders. In this respect, several analyses of the corpus data will be presented.
error corpus ; language disorders ; Croatian
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
3220-3226.
2016.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the Tenth International conference on language resources and evaluation (LREC 2016)
Calzolari, Nicoletta ; Khalid Choukr ; Declerck, Thierry ; Goggi, Sara ; Grobelnik, Marko ; Maegaard, Bente ; Mariani, Joseph ; Mazo, Hélène ; Moreno, Asunción ; Odijk, Jan ; Piperidis, Stelios
Portorož: The European Language Resources Association
978-2-9517408-9-1
Podaci o skupu
Tenth International Conference on Language Resources and Evaluation (LREC 2016)
poster
23.05.2016-28.05.2016
Portorož, Slovenija
Povezanost rada
Pedagogija