Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 920541

Disambiguation of Homograms in a Pitch Accent Language


Nacinovic Prskalo, Lucia; Brkic Bakaric, Marija
Disambiguation of Homograms in a Pitch Accent Language // Proceedings of 2017 International Conference on Computer Science and Artificial Intelligence CSAI 2017
New York: ACM, 2017. str. 32-37 doi:10.1145/3168390.3168409 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 920541 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Disambiguation of Homograms in a Pitch Accent Language

Autori
Nacinovic Prskalo, Lucia ; Brkic Bakaric, Marija

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of 2017 International Conference on Computer Science and Artificial Intelligence CSAI 2017 / - New York : ACM, 2017, 32-37

ISBN
978-1-4503-5392-2

Skup
International Conference on Computer Science and Artificial Intelligence (CSAI 2017)

Mjesto i datum
Jakarta, Indonesia, 5-7.12.2017

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
disambiguation of homograms ; lexicon of homograms ; word sense disambiguation ; pitch accent language ; homogram, homograph ; homophone ; Croatian language

Sažetak
The Croatian language is a pitch-accent language in which the tone contour realized in the stressed syllable carries the lexical information. Therefore, in some cases, different lexical accent gives the word a different meaning. In such cases, the ambiguity of the word in written texts where accents are not usually marked, can be solved by determining the appropriate accent. There are also cases when various basic and derived forms of words have different meanings, different morphosyntactic tags and possibly different accents. When words have same written forms, but different meanings, they are called homograms. In order to resolve the ambiguity of homograms, we created a lexicon of homograms which is comprised of all Croatian nouns of different gender which have the same written forms (if accents are not marked), but different meanings, morphosyntactic tags and possibly different accents. The lexicon consists of 19, 366 entries and 3, 460 unique homograms. Each entry in the lexicon comprises of the homogram (unaccented word), the corresponding MSD, the accented word and the accented lemma. The obtained lexicon enables us to identificate and disambiguate homograms within the corpus efficiently and accurately.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Sveučilište u Rijeci - Odjel za informatiku

Poveznice na cjeloviti tekst rada:

doi dl.acm.org

Citiraj ovu publikaciju:

Nacinovic Prskalo, Lucia; Brkic Bakaric, Marija
Disambiguation of Homograms in a Pitch Accent Language // Proceedings of 2017 International Conference on Computer Science and Artificial Intelligence CSAI 2017
New York: ACM, 2017. str. 32-37 doi:10.1145/3168390.3168409 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Nacinovic Prskalo, L. & Brkic Bakaric, M. (2017) Disambiguation of Homograms in a Pitch Accent Language. U: Proceedings of 2017 International Conference on Computer Science and Artificial Intelligence CSAI 2017 doi:10.1145/3168390.3168409.
@article{article, year = {2017}, pages = {32-37}, DOI = {10.1145/3168390.3168409}, keywords = {disambiguation of homograms, lexicon of homograms, word sense disambiguation, pitch accent language, homogram, homograph, homophone, Croatian language}, doi = {10.1145/3168390.3168409}, isbn = {978-1-4503-5392-2}, title = {Disambiguation of Homograms in a Pitch Accent Language}, keyword = {disambiguation of homograms, lexicon of homograms, word sense disambiguation, pitch accent language, homogram, homograph, homophone, Croatian language}, publisher = {ACM}, publisherplace = {Jakarta, Indonesia} }
@article{article, year = {2017}, pages = {32-37}, DOI = {10.1145/3168390.3168409}, keywords = {disambiguation of homograms, lexicon of homograms, word sense disambiguation, pitch accent language, homogram, homograph, homophone, Croatian language}, doi = {10.1145/3168390.3168409}, isbn = {978-1-4503-5392-2}, title = {Disambiguation of Homograms in a Pitch Accent Language}, keyword = {disambiguation of homograms, lexicon of homograms, word sense disambiguation, pitch accent language, homogram, homograph, homophone, Croatian language}, publisher = {ACM}, publisherplace = {Jakarta, Indonesia} }

Časopis indeksira:


  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font