Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 418517

Generating a Morphological Lexicon of Organization Entity Names


Ljubešić, Nikola; Lauc, Tomislava; Boras, Damir
Generating a Morphological Lexicon of Organization Entity Names // Proceedings of the Sixth International Language Resources and Evaluation (LREC'08) / Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias (ur.).
Marakeš: European Language Resources Association (ELRA), 2008. (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 418517 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Generating a Morphological Lexicon of Organization Entity Names

Autori
Ljubešić, Nikola ; Lauc, Tomislava ; Boras, Damir

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Sixth International Language Resources and Evaluation (LREC'08) / Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias - Marakeš : European Language Resources Association (ELRA), 2008

ISBN
2-9517408-4-0

Skup
Sixth International Language Resources and Evaluation Conference

Mjesto i datum
Marrakesh, Maroko, 28.05.2008. - 30.05.2008

Vrsta sudjelovanja
Poster

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
morphological lexicon; lexicon generation; organization entity names; linear successive abstraction

Sažetak
This paper describes methods used for generating a morphological lexicon of organization entity names in Croatian. This resource is intended for two primary tasks: template-based natural language generation and named entity identification. The main problems concerning the lexicon generation are high level of inflection in Croatian and low linguistic quality of the primary resource containing named entities in normal form. The problem is divided into two subproblems concerning single- word and multi-word expressions. The single-word problem is solved by training a supervised learning algorithm called linear successive abstraction. With existing common language morphological resources and two simple hand-crafted rules backing up the algorithm, accuracy of 98.70% on the test set is achieved. The multi-word problem is solved through a semi- automated process for multi-word entities occurring in the first 10, 000 named entities. The generated multi-word lexicon will be used for natural language generation only while named entity identification will be solved algorithmically in forthcoming research. The single-word lexicon is capable of handling both tasks.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Boras, Damir, MZOS ) ( CroRIS)
130-1301799-1999 - Oblikovanje i upravljanje javnim znanjem u informacijskom prostoru (Tuđman, Miroslav, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Nikola Ljubešić (autor)

Avatar Url Tomislava Lauc (autor)

Avatar Url Damir Boras (autor)

Citiraj ovu publikaciju:

Ljubešić, Nikola; Lauc, Tomislava; Boras, Damir
Generating a Morphological Lexicon of Organization Entity Names // Proceedings of the Sixth International Language Resources and Evaluation (LREC'08) / Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias (ur.).
Marakeš: European Language Resources Association (ELRA), 2008. (poster, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Ljubešić, N., Lauc, T. & Boras, D. (2008) Generating a Morphological Lexicon of Organization Entity Names. U: Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, Daniel Tapias (ur.)Proceedings of the Sixth International Language Resources and Evaluation (LREC'08).
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Lauc, Tomislava and Boras, Damir}, year = {2008}, keywords = {morphological lexicon, lexicon generation, organization entity names, linear successive abstraction}, isbn = {2-9517408-4-0}, title = {Generating a Morphological Lexicon of Organization Entity Names}, keyword = {morphological lexicon, lexicon generation, organization entity names, linear successive abstraction}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Marrakesh, Maroko} }
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Lauc, Tomislava and Boras, Damir}, year = {2008}, keywords = {morphological lexicon, lexicon generation, organization entity names, linear successive abstraction}, isbn = {2-9517408-4-0}, title = {Generating a Morphological Lexicon of Organization Entity Names}, keyword = {morphological lexicon, lexicon generation, organization entity names, linear successive abstraction}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Marrakesh, Maroko} }




Contrast
Increase Font
Decrease Font
Dyslexic Font