Capturing clustering structure of the data in cross-language representation of textual documents (CROSBI ID 678412)
Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija
Podaci o odgovornosti
Dobša, Jasminka
engleski
Capturing clustering structure of the data in cross-language representation of textual documents
In the paper is proposed a new approach for representation of textual documents in lowerdimensional space. Approach aims at capturing clustering structure of the data in the goal of further application for cross-language classification of textual documents. The proposed approach is based on factorization of a term- document matrix by an iterative method of Reduced k-means clustering. Method of Reduced k- means aims at simultaneous reduction of objects (documents) and variables (index terms). Proposed method is compared to standard machine learning techniques of cross- language representation of textual documents by usage of latent semantic indexing and canonical correlation analysis.
clustering, textual documents, cross-language representation
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
31-31.
2019.
objavljeno
Podaci o matičnoj publikaciji
Book of Abstracts BIOSTAT 2019
Jazbec, A. ; Pecina, M. ; Sonicki, Z ; Šimić, D. ; Vedriš, M. ; Sović, S.
Zagreb: Hrvatsko biometrijsko društvo
1849-434X
Podaci o skupu
24th International Scientific Symposium on Biometrics (BIOSTAT 2019)
predavanje
06.06.2019-08.06.2019
Zagreb, Hrvatska