Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 418550

Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction


Ljubešić, Nikola; Bakarić, Nikola; Lauc, Tomislava
Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction // MIPRO 2008 / Bogunović, Nikola ; Ribarić, Slobodan (ur.).
Rijeka: Croatian Society for Information and Communication Technology, 2008. str. 190-193 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 418550 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction

Autori
Ljubešić, Nikola ; Bakarić, Nikola ; Lauc, Tomislava

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

ISBN
978-953-233-038-0

Skup
MIPRO 2008

Mjesto i datum
Opatija, Hrvatska, 26-30.05.2008

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
inflectional morphology ; supervised learning ; linear successive abstraction ; morphological paradigm assignment ; named entity

Sažetak
This paper describes how a supervised learning method is used for assigning inflectional paradigms to organization entity names as the main prerequisite for generating a morphological lexicon of these named entities. An inflectional paradigm consists of a set of rules for generating all forms of a lexicon entry. A morphological lexicon consists of lexicon entries and their corresponding forms. This type of language resource is crucial in tasks such as natural language generation (generating natural language business news from database data and news templates) and named entity identification (necessary step in data mining and business intelligence). The basic resource used in this research is a list of 106, 530 named entities of organizations given in basic form (nominative case) and ranked by relevance. On the first 5, 000 manually tagged named entities 59 inflectional paradigm classes are defined. Using linear successive abstraction, a suffix model is trained, validated and tested on this tagged dataset. Morphological lexica of general language, personal names and settlements are used as additional resources in the decision process. The achieved accuracy on the test set is 98.70%.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
MZOS-130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Boras, Damir, MZOS ) ( POIROT)
MZOS-130-1301799-1999 - Oblikovanje i upravljanje javnim znanjem u informacijskom prostoru (Tuđman, Miroslav, MZOS ) ( POIROT)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Tomislava Lauc (autor)

Avatar Url Nikola Ljubešić (autor)

Avatar Url Nikola Bakaric (autor)


Citiraj ovu publikaciju:

Ljubešić, Nikola; Bakarić, Nikola; Lauc, Tomislava
Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction // MIPRO 2008 / Bogunović, Nikola ; Ribarić, Slobodan (ur.).
Rijeka: Croatian Society for Information and Communication Technology, 2008. str. 190-193 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Ljubešić, N., Bakarić, N. & Lauc, T. (2008) Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction. U: Bogunović, N. & Ribarić, S. (ur.)MIPRO 2008.
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Bakari\'{c}, Nikola and Lauc, Tomislava}, year = {2008}, pages = {190-193}, keywords = {inflectional morphology, supervised learning, linear successive abstraction, morphological paradigm assignment, named entity}, isbn = {978-953-233-038-0}, title = {Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction}, keyword = {inflectional morphology, supervised learning, linear successive abstraction, morphological paradigm assignment, named entity}, publisher = {Croatian Society for Information and Communication Technology}, publisherplace = {Opatija, Hrvatska} }
@article{article, author = {Ljube\v{s}i\'{c}, Nikola and Bakari\'{c}, Nikola and Lauc, Tomislava}, year = {2008}, pages = {190-193}, keywords = {inflectional morphology, supervised learning, linear successive abstraction, morphological paradigm assignment, named entity}, isbn = {978-953-233-038-0}, title = {Assigning Inflectional Paradigms to Named Entities by Linear Successive Abstraction}, keyword = {inflectional morphology, supervised learning, linear successive abstraction, morphological paradigm assignment, named entity}, publisher = {Croatian Society for Information and Communication Technology}, publisherplace = {Opatija, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font