Content-based Recommender System for Textual Documents Written in Croatian (CROSBI ID 607588)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Ćavar, Ivana ; Kavran, Zvonko ; Jolić, Natalija ; Anđelović, Neven ; Cvitić, Ivan ; Gović, Marko
engleski
Content-based Recommender System for Textual Documents Written in Croatian
The paper describes a content-based recommender system that classifies textual documents written in Croatian. We describe how documents are pre- processed, including procedures of dimensionality reduction, selection of stop- words and creation of document-term matrix. For the text classification, a combination of v- fold cross validation and k - nearest neighbours (kNN) methods is used. This way, the ‘optimal’ value of k is firstly analyzed, and the results of v-fold cross validation are applied for the selection of value k. Results are given in the form of classification error analysis.
text mining; recommender system; k-nearest neighbour; content-based classification; document-term matrix.
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
25-29.
2013.
objavljeno
Podaci o matičnoj publikaciji
DATA ANALYTICS 2013, The Second International Conference on Data Analytics
Friedrich Laux, Reutlingen University, Germany
Porto: International Academy, Research, and Industry Association (IARIA)
978-1-61208-295-0
Podaci o skupu
DATA ANALYTICS 2013, The Second International Conference on Data Analytics
predavanje
01.01.2013-01.01.2013
Porto, Portugal