Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1106494

Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings


Vintar, Špela; Grčić Simeunović, Larisa; Martinc, Matej; Pollak, Senja; Stepišnik, Uroš
Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings // Proceedings of the 13th Workshop on Building and Using Comparable Corpora, Language Resources and Evaluation Conference (LREC 2020) / Rapp, Reinhard ; Zweigenbaum, Pierre ; Sharoff, Serge (ur.).
Marseille: European Language Resources Association (ELRA), 2020. str. 29-34


CROSBI ID: 1106494 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings

Autori
Vintar, Špela ; Grčić Simeunović, Larisa ; Martinc, Matej ; Pollak, Senja ; Stepišnik, Uroš

Vrsta, podvrsta i kategorija rada
Poglavlja u knjigama, znanstveni

Knjiga
Proceedings of the 13th Workshop on Building and Using Comparable Corpora, Language Resources and Evaluation Conference (LREC 2020)

Urednik/ci
Rapp, Reinhard ; Zweigenbaum, Pierre ; Sharoff, Serge

Izdavač
European Language Resources Association (ELRA)

Grad
Marseille

Godina
2020

Raspon stranica
29-34

ISBN
0-979-95546-42-9

Ključne riječi
semantic relations, word embeddings, comparable corpus, karstology, frame-based terminology

Sažetak
We report an experiment aimed at extracting words expressing a specific semantic relation using intersections of word embeddings. In a multilingual frame-based domain model, specific features of a concept are typically described through a set of non-arbitrary semantic relations. In karstology, our domain of choice which we are exploring though a comparable corpus in English and Croatian, karst phenomena such as landforms are usually described through their FORM, LOCATION, CAUSE, FUNCTION and COMPOSITION. We propose an approach to mine words pertaining to each of these relations by using a small number of seed adjectives, for which we retrieve closest words using word embeddings and then use intersections of these neighbourhoods to refine our search. Such crosslanguage expansion of semantically-rich vocabulary is a valuable aid in improving the coverage of a multilingual knowledge base, but also in exploring differences between languages in their respective conceptualisations of the domain.

Izvorni jezik
Engleski

Znanstvena područja
Filologija



POVEZANOST RADA


Ustanove:
Sveučilište u Zadru

Profili:

Avatar Url Larisa Grčić Simeunović (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada www.aclweb.org

Citiraj ovu publikaciju:

Vintar, Špela; Grčić Simeunović, Larisa; Martinc, Matej; Pollak, Senja; Stepišnik, Uroš
Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings // Proceedings of the 13th Workshop on Building and Using Comparable Corpora, Language Resources and Evaluation Conference (LREC 2020) / Rapp, Reinhard ; Zweigenbaum, Pierre ; Sharoff, Serge (ur.).
Marseille: European Language Resources Association (ELRA), 2020. str. 29-34
Vintar, Š., Grčić Simeunović, L., Martinc, M., Pollak, S. & Stepišnik, U. (2020) Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings. U: Rapp, R., Zweigenbaum, P. & Sharoff, S. (ur.) Proceedings of the 13th Workshop on Building and Using Comparable Corpora, Language Resources and Evaluation Conference (LREC 2020). Marseille, European Language Resources Association (ELRA), str. 29-34.
@inbook{inbook, author = {Vintar, \v{S}pela and Gr\v{c}i\'{c} Simeunovi\'{c}, Larisa and Martinc, Matej and Pollak, Senja and Stepi\v{s}nik, Uro\v{s}}, year = {2020}, pages = {29-34}, keywords = {semantic relations, word embeddings, comparable corpus, karstology, frame-based terminology}, isbn = {0-979-95546-42-9}, title = {Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings}, keyword = {semantic relations, word embeddings, comparable corpus, karstology, frame-based terminology}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Marseille} }
@inbook{inbook, author = {Vintar, \v{S}pela and Gr\v{c}i\'{c} Simeunovi\'{c}, Larisa and Martinc, Matej and Pollak, Senja and Stepi\v{s}nik, Uro\v{s}}, year = {2020}, pages = {29-34}, keywords = {semantic relations, word embeddings, comparable corpus, karstology, frame-based terminology}, isbn = {0-979-95546-42-9}, title = {Mining Semantic Relations from Comparable Corpora through Intersections of Word Embeddings}, keyword = {semantic relations, word embeddings, comparable corpus, karstology, frame-based terminology}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Marseille} }




Contrast
Increase Font
Decrease Font
Dyslexic Font