Algorithm for classification of textual documents represented by Tandem analysis (CROSBI ID 615362)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Dobša, Jasminka
engleski
Algorithm for classification of textual documents represented by Tandem analysis
In this research is presented algorithm for classification of textual documents which are represented in the space of reduced dimension in respect to original bag of words representation. Algorithm is carried out in two steps: in the first step classification is conducted for documents represented in original bag of words representation, while in the second step classification is conducted for documents represented in the space of reduced dimension. Reduction of dimensionality is obtained also in two steps: in the first step documents are represented by usage of latent semantic indexing, while in the second step this representation is projected on the space of membership matrix defining a membership of documents in classes. Evaluation of algorithm is conducted on Reuters21578 collection of documents.
classification of textual documents ; latent semantic indexing ; Tandem analysis ; support vector machines
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
1-4.
2014.
objavljeno
Podaci o matičnoj publikaciji
Proceeding of Conference on Data Mining and Data Warehouses 2014
Grobelnik ; Marko ; Mladenić, Dunja
Podaci o skupu
Conference on Data Mining and Data Warehouses
predavanje
06.10.2014-06.10.2014
Ljubljana, Slovenija