Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 967278

Cross-language information retrieval by reduced k- means


Dobša, Jasminka; Mladenić, Dunja; Rupnik, Jan; Radošević, Danijel; Magdalenić, Ivan
Cross-language information retrieval by reduced k- means // International Journal of Computer Information Systems and Industrial Managerment Applications, 10 (2018), 1; 314-322 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 967278 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Cross-language information retrieval by reduced k- means

Autori
Dobša, Jasminka ; Mladenić, Dunja ; Rupnik, Jan ; Radošević, Danijel ; Magdalenić, Ivan

Izvornik
International Journal of Computer Information Systems and Industrial Managerment Applications (2150-7988) 10 (2018), 1; 314-322

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
cross-language information retrieval, dimensionality reduction, latent semantic indexing, canonical correlation analysis, Reduced k-means

Sažetak
Cross-language information retrieval aims at retrieving relevant documents in one language for a query set in another language. Here we propose a new approach to the problem of cross-language information retrieval based on factorization of a term-document matrix by an iterative method of Reduced k-means clustering. Method of Reduced k- means intended at simultaneous reduction of objects (documents) and variables (index terms). Proposed method is compared to standard machine learning techniques of cross-language information retrieval by usage of latent semantic indexing and canonical correlation analysis. Motivation for usage of Reduced k-means method for a task of cross-language information retrieval comes from an observation that documents in a semantic space obtained by method of latent semantic indexing are clustered by their language and not by their topics in the first place. As Reduced k-means aims at preserving clustering structure of data, the idea is that the proposed method could address the mentioned problem.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Fakultet organizacije i informatike, Varaždin


Citiraj ovu publikaciju:

Dobša, Jasminka; Mladenić, Dunja; Rupnik, Jan; Radošević, Danijel; Magdalenić, Ivan
Cross-language information retrieval by reduced k- means // International Journal of Computer Information Systems and Industrial Managerment Applications, 10 (2018), 1; 314-322 (međunarodna recenzija, članak, znanstveni)
Dobša, J., Mladenić, D., Rupnik, J., Radošević, D. & Magdalenić, I. (2018) Cross-language information retrieval by reduced k- means. International Journal of Computer Information Systems and Industrial Managerment Applications, 10 (1), 314-322.
@article{article, author = {Dob\v{s}a, Jasminka and Mladeni\'{c}, Dunja and Rupnik, Jan and Rado\v{s}evi\'{c}, Danijel and Magdaleni\'{c}, Ivan}, year = {2018}, pages = {314-322}, keywords = {cross-language information retrieval, dimensionality reduction, latent semantic indexing, canonical correlation analysis, Reduced k-means}, journal = {International Journal of Computer Information Systems and Industrial Managerment Applications}, volume = {10}, number = {1}, issn = {2150-7988}, title = {Cross-language information retrieval by reduced k- means}, keyword = {cross-language information retrieval, dimensionality reduction, latent semantic indexing, canonical correlation analysis, Reduced k-means} }
@article{article, author = {Dob\v{s}a, Jasminka and Mladeni\'{c}, Dunja and Rupnik, Jan and Rado\v{s}evi\'{c}, Danijel and Magdaleni\'{c}, Ivan}, year = {2018}, pages = {314-322}, keywords = {cross-language information retrieval, dimensionality reduction, latent semantic indexing, canonical correlation analysis, Reduced k-means}, journal = {International Journal of Computer Information Systems and Industrial Managerment Applications}, volume = {10}, number = {1}, issn = {2150-7988}, title = {Cross-language information retrieval by reduced k- means}, keyword = {cross-language information retrieval, dimensionality reduction, latent semantic indexing, canonical correlation analysis, Reduced k-means} }

Uključenost u ostale bibliografske baze podataka::


  • INSPEC





Contrast
Increase Font
Decrease Font
Dyslexic Font