Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1062865

Evaluating Language Tools for Fifteen EU- official Under-resourced Languages


Alves, Diego; Thakkar, Gaurish; Tadić, Marko
Evaluating Language Tools for Fifteen EU- official Under-resourced Languages // Proceedings of The 12th Language Resources and Evaluation Conference / Calzolari, Nicoletta ; Béchet, Frédéric ; Blache, Philippe ; Choukri, Khalid ; Cieri, Christopher ; Declerck, Thierry ; Goggi, Sara ; Isahara, Hitoshi ; Maegaard, Bente ; Mariani, Joseph ; Mazo, Hélène ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios (ur.).
Marseille: European Language Resources Association (ELRA), 2020. str. 1866-1873 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 1062865 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Evaluating Language Tools for Fifteen EU- official Under-resourced Languages

Autori
Alves, Diego ; Thakkar, Gaurish ; Tadić, Marko

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of The 12th Language Resources and Evaluation Conference / Calzolari, Nicoletta ; Béchet, Frédéric ; Blache, Philippe ; Choukri, Khalid ; Cieri, Christopher ; Declerck, Thierry ; Goggi, Sara ; Isahara, Hitoshi ; Maegaard, Bente ; Mariani, Joseph ; Mazo, Hélène ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios - Marseille : European Language Resources Association (ELRA), 2020, 1866-1873

Skup
The 12th Language Resources and Evaluation Conference (LREC2020)

Mjesto i datum
Marseille, Francuska, 11.05.2020. - 16.05.2020

Vrsta sudjelovanja
Poster

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
language processing chains ; under-resourced languages ; evaluation

Sažetak
This article presents the results of the evaluation campaign of language tools available for fifteen EU-official under-resourced languages. The evaluation was conducted within the MSC ITN CLEOPATRA action that aims at building the cross-lingual event- centric knowledge processing on top of the application of linguistic processing chains (LPCs) for at least 24 EU-official languages. In this campaign, we concentrated on three existing NLP platforms (Stanford CoreNLP, NLP Cube, UDPipe) that all provide models for under-resourced languages and in this first run we covered 15 under- resourced languages for which the models were available. We present the design of the evaluation campaign and present the results as well as discuss them. We considered the difference between reported and our tested results within a single percentage point as being within the limits of acceptable tolerance and thus consider this result as reproducible. However, for a number of languages, the results are below what was reported in the literature, and in some cases, our testing results are even better than the ones reported previously. Particularly problematic was the evaluation of NERC systems. One of the reasons is the absence of universally or cross-lingually applicable named entities classification scheme that would serve the NERC task in different languages analogous to the Universal Dependency scheme in parsing task. To build such a scheme has become one of our the future research directions.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija

Napomena
Zbog pandemije krunastoga virusa, kongres nije
održan, ali je zbornik radova objavljen 2020-05-15.



POVEZANOST RADA


Projekti:
EK-H2020-812997 - Cross-lingual Event-centric Open Analytics Research Academy (Cleopatra) (Tadić, Marko, EK - H2020-MSCA-ITN-2018) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Poveznice na cjeloviti tekst rada:

www.lrec-conf.org

Citiraj ovu publikaciju:

Alves, Diego; Thakkar, Gaurish; Tadić, Marko
Evaluating Language Tools for Fifteen EU- official Under-resourced Languages // Proceedings of The 12th Language Resources and Evaluation Conference / Calzolari, Nicoletta ; Béchet, Frédéric ; Blache, Philippe ; Choukri, Khalid ; Cieri, Christopher ; Declerck, Thierry ; Goggi, Sara ; Isahara, Hitoshi ; Maegaard, Bente ; Mariani, Joseph ; Mazo, Hélène ; Moreno, Asuncion ; Odijk, Jan ; Piperidis, Stelios (ur.).
Marseille: European Language Resources Association (ELRA), 2020. str. 1866-1873 (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Alves, D., Thakkar, G. & Tadić, M. (2020) Evaluating Language Tools for Fifteen EU- official Under-resourced Languages. U: Calzolari, N., Béchet, F., Blache, P., Choukri, K., Cieri, C., Declerck, T., Goggi, S., Isahara, H., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J. & Piperidis, S. (ur.)Proceedings of The 12th Language Resources and Evaluation Conference.
@article{article, author = {Alves, Diego and Thakkar, Gaurish and Tadi\'{c}, Marko}, year = {2020}, pages = {1866-1873}, keywords = {language processing chains, under-resourced languages, evaluation}, title = {Evaluating Language Tools for Fifteen EU- official Under-resourced Languages}, keyword = {language processing chains, under-resourced languages, evaluation}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Marseille, Francuska} }
@article{article, author = {Alves, Diego and Thakkar, Gaurish and Tadi\'{c}, Marko}, year = {2020}, pages = {1866-1873}, keywords = {language processing chains, under-resourced languages, evaluation}, title = {Evaluating Language Tools for Fifteen EU- official Under-resourced Languages}, keyword = {language processing chains, under-resourced languages, evaluation}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Marseille, Francuska} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Conference Proceedings Citation Index - Science (CPCI-S)
    • Conference Proceedings Citation Index - Social Sciences & Humanities (CPCI-SSH)





Contrast
Increase Font
Decrease Font
Dyslexic Font