Pregled bibliografske jedinice broj: 1121014
Building and Evaluating Universal Named-Entity Recognition English Corpus
Building and Evaluating Universal Named-Entity Recognition English Corpus // Proceedings of the 2nd International Workshop on Cross-lingual Event-centric Open Analytics / Demidova, Elena ; Hakimov, Sherzod ; Winters, Jane ; Tadić, Marko (ur.).
Ljubljana, 2021. str. 1-15 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 1121014 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Building and Evaluating Universal Named-Entity Recognition English
Corpus
Autori
Alves, Diego ; Thakkar, Gaurish ; Tadić, Marko
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the 2nd International Workshop on Cross-lingual Event-centric Open Analytics
/ Demidova, Elena ; Hakimov, Sherzod ; Winters, Jane ; Tadić, Marko - Ljubljana, 2021, 1-15
Skup
2nd International Workshop on Cross-lingual Event-centric Open Analytics (CLEOPATRA 2021)
Mjesto i datum
Ljubljana, Slovenija, 12.04.2021
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
universal named entities recognition ; cross-lingual information ; event-centric analytics
Sažetak
This article presents the application of the Universal Named Entity framework to generate automatically annotated corpora. By us- ing a workflow that extracts Wikipedia data and meta-data and DB- pedia information, we generated an English dataset which is described and evaluated. Furthermore, we conducted a set of experiments to im- prove the annotations in terms of precision, recall, and F1-measure. The final dataset is available and the established workflow can be applied to any language with existing Wikipedia and DBpedia. As part of future research, we intend to continue improving the annotation process and extend it to other languages.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija
POVEZANOST RADA
Projekti:
EK-H2020-812997 - Cross-lingual Event-centric Open Analytics Research Academy (Cleopatra) (Tadić, Marko, EK - H2020-MSCA-ITN-2018) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb
Profili:
Diego Fernando Valio Antunes Alves
(autor)
Gaurish Pandurang Thakkar
(autor)
Marko Tadić
(autor)