Pregled bibliografske jedinice broj: 374805
Interacting Croatian NERC System and Intex/NooJ Environment
Interacting Croatian NERC System and Intex/NooJ Environment // Applications of Finite-State Language Processing: Selected Papers from the 2008 International NooJ Conference / Váradi, Tamás ; Kuti, Judit ; Silberztein, Max (ur.).
Newcastle upon Tyne: Cambridge Scholars Publishing, 2010. str. 21-29
CROSBI ID: 374805 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Interacting Croatian NERC System and Intex/NooJ Environment
Autori
Bekavac, Božo ; Agić, Željko ; Tadić, Marko
Vrsta, podvrsta i kategorija rada
Poglavlja u knjigama, znanstveni
Knjiga
Applications of Finite-State Language Processing: Selected Papers from the 2008 International NooJ Conference
Urednik/ci
Váradi, Tamás ; Kuti, Judit ; Silberztein, Max
Izdavač
Cambridge Scholars Publishing
Grad
Newcastle upon Tyne
Godina
2010
Raspon stranica
21-29
ISBN
978-1-4438-2573-3
Ključne riječi
named entity recognition, finite state transducers, Croatian language, Intex, NooJ
Sažetak
In this contribution, we present design and implementation details of an early version of Croatian finite state transducer engine called NercFst. The engine currently implements a small subset of Intex/NooJ finite state transducer functionality developed for the purpose of deriving a standalone module for named entity recognition and classification (NERC) system applicable to Croatian texts, previously created as a module in Intex. We also provide some general notes on the Intex module for Croatian NERC and notes on porting the module from Intex to NooJ. Current NercFst engine functionality overview is given in more detail along with some upcoming export features for NooJ which are currently under development with a purpose of supporting portability to various other open source finite state transducer libraries by exporting systems designed and implemented within Intex or NooJ linguistic development environment.
Izvorni jezik
Engleski
Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija
POVEZANOST RADA
Projekti:
130-1300646-0645 - Hrvatski jezični resursi i njihovo obilježavanje (Tadić, Marko, MZOS ) ( CroRIS)
130-1300646-1002 - Leksička semantika u izradi Hrvatskog WordNeta (Raffaelli, Ida, MZOS ) ( CroRIS)
130-1300646-1776 - Računalna sintaksa hrvatskoga jezika (Dovedan Han, Zdravko, MZOS ) ( CroRIS)
Ustanove:
Filozofski fakultet, Zagreb