Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction (CROSBI ID 552945)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Ljubešić, Nikola ; Bakarić, Nikola ; Lauc, Tomislava Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction / Bogunović, Nikola ; Ribarić, Slobodan (ur.). Rijeka: Hrvatsko informacijsko i dokumentacijsko društvo, 2008. str. 190-193

Podaci o odgovornosti

Ljubešić, Nikola ; Bakarić, Nikola ; Lauc, Tomislava

engleski

Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction

This paper describes how a supervised learning method is used for assigning inflectional paradigms to organization entity names as the main prerequisite for generating a morphological lexicon of these named entities. An inflectional paradigm consists of a set of rules for generating all forms of a lexicon entry. A morphological lexicon consists of lexicon entries and their corresponding forms. This type of language resource is crucial in tasks such as natural language generation (generating natural language business news from database data and news templates) and named entity identification (necessary step in data mining and business intelligence). The basic resource used in this research is a list of 106, 530 named entities of organizations given in basic form (nominative case) and ranked by relevance. On the first 5, 000 manually tagged named entities 59 inflectional paradigm classes are defined. Using linear successive abstraction, a suffix model is trained, validated and tested on this tagged dataset. Morphological lexica of general language, personal names and settlements are used as additional resources in the decision process. The achieved accuracy on the test set is 98.70%.

inflectional morphology ; supervised learning ; linear successive abstraction ; morphological paradigm assignment ; named entity

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

190-193.

2008.

nije evidentirano

objavljeno

978-953-233-038-0

Podaci o matičnoj publikaciji

Bogunović, Nikola ; Ribarić, Slobodan

Rijeka: Hrvatsko informacijsko i dokumentacijsko društvo

Podaci o skupu

MIPRO 2008

predavanje

25.05.2008-29.05.2008

Opatija, Hrvatska

Povezanost rada

Informacijske i komunikacijske znanosti