Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 792825

Vector Disambiguation for Translation Extraction from Comparable Corpora


Apidianaki, Marianna; Ljubešić, Nikola; Fišer, Darja
Vector Disambiguation for Translation Extraction from Comparable Corpora // Informatica (Ljubljana), 37 (2013), 2; 193-201 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 792825 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Vector Disambiguation for Translation Extraction from Comparable Corpora

Autori
Apidianaki, Marianna ; Ljubešić, Nikola ; Fišer, Darja

Izvornik
Informatica (Ljubljana) (0350-5596) 37 (2013), 2; 193-201

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
word sense disambiguation; sense clustering; comparable corpora

Sažetak
We present a new data-driven approach for enhancing the extraction of translation equivalents from comparable corpora which exploits bilingual lexico-semantic knowledge harvested from a parallel corpus. First, the bilingual lexicon obtained from word-aligning the parallel corpus replaces an external seed dictionary, making the approach knowledge-light and portable. Next, instead of using simple one-to-one mappings between the source and the target language, translation equivalents are clustered into sets of synonyms by a cross-lingual Word Sense Induction method. The obtained sense clusters enable us to expand the translation of vector features with several translation variants using a cross-lingual Word Sense Disambiguation method. Consequently, the vector features are disambiguated and translated with the translation variants included in the semantically most appropriate cluster, thus producing less noisy and richer vectors that allow for a more successful cross-lingual vector comparison than in previous methods.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Nikola Ljubešić (autor)

Citiraj ovu publikaciju:

Apidianaki, Marianna; Ljubešić, Nikola; Fišer, Darja
Vector Disambiguation for Translation Extraction from Comparable Corpora // Informatica (Ljubljana), 37 (2013), 2; 193-201 (međunarodna recenzija, članak, znanstveni)
Apidianaki, M., Ljubešić, N. & Fišer, D. (2013) Vector Disambiguation for Translation Extraction from Comparable Corpora. Informatica (Ljubljana), 37 (2), 193-201.
@article{article, author = {Apidianaki, Marianna and Ljube\v{s}i\'{c}, Nikola and Fi\v{s}er, Darja}, year = {2013}, pages = {193-201}, keywords = {word sense disambiguation, sense clustering, comparable corpora}, journal = {Informatica (Ljubljana)}, volume = {37}, number = {2}, issn = {0350-5596}, title = {Vector Disambiguation for Translation Extraction from Comparable Corpora}, keyword = {word sense disambiguation, sense clustering, comparable corpora} }
@article{article, author = {Apidianaki, Marianna and Ljube\v{s}i\'{c}, Nikola and Fi\v{s}er, Darja}, year = {2013}, pages = {193-201}, keywords = {word sense disambiguation, sense clustering, comparable corpora}, journal = {Informatica (Ljubljana)}, volume = {37}, number = {2}, issn = {0350-5596}, title = {Vector Disambiguation for Translation Extraction from Comparable Corpora}, keyword = {word sense disambiguation, sense clustering, comparable corpora} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Emerging Sources Citation Index (ESCI)
  • Scopus





Contrast
Increase Font
Decrease Font
Dyslexic Font