Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Informativeness of Inflective Noun Bigrams in Croatian (CROSBI ID 185100)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Jurić, Damir ; Banek, Marko ; Dembitz, Šandor Informativeness of Inflective Noun Bigrams in Croatian // Lecture notes in computer science, 7327 (2012), 114-123

Podaci o odgovornosti

Jurić, Damir ; Banek, Marko ; Dembitz, Šandor

engleski

Informativeness of Inflective Noun Bigrams in Croatian

A feature of Croatian and other Slavic languages is a rich inflection system, which does not exist in English and other languages that traditionally dominate the scientific focus of computational linguistics. In this paper we present the results of the experiments conducted on the corpus of the Croatian online spellchecker Hascheck, which point to using non-nominative cases for discovering collocations between two nouns, specifically the first name and the family name of a person. We analyzed the frequencies and conditional probabilities of the morphemes corresponding to Croatian cases and quantified the level of attraction between two words using the normalized pointwise mutual information measure. Two components of a personal name are more likely to co-occur in any of the non-nominative cases than in nominative. Furthermore, given a component of a personal name, the conditional probability that it is accompanied with the other component of the name are higher for the genitive/accusative and instrumental case than for nominative.

collocations; declension; named entity recognition; semantics; language technologies

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

7327

2012.

114-123

objavljeno

0302-9743

Povezanost rada

Elektrotehnika, Računarstvo

Indeksiranost