Finding the most representative Latent Dirichlet Allocation run for topic modelling (CROSBI ID 724181)
Prilog sa skupa u zborniku | sažetak izlaganja sa skupa | međunarodna recenzija
Podaci o odgovornosti
Gusić Munđar, Jelena ; Rako, Sabina ; Šlibar, Barbara
engleski
Finding the most representative Latent Dirichlet Allocation run for topic modelling
The number of research publications is growing exponentially making the extraction of meaningful information increasingly challenging. Natural language processing may provide a solution. Latent Dirichlet Allocation (LDA) is frequently used to detect topics in a corpus of documents. It relies on Monte Carlo methods for estimation, which introduces a replicability risk. Recently, an approach to stabilization of topic- term allocation was proposed and implemented in R LDAPrototype package. Stabilization is achieved by analysing the topic-term frequency matrices from a set of LDA replications and choosing the LDA replication that is the most representative for the set. Another approach might be to base the choice of the most representative LDA replication on the document-topic frequency matrices. The objective of this research is to compare the two approaches to stabilization of LDA results on a corpus of papers on learning analytics and educational data mining.
learning analytics ; topic modelling ; Latent Dirichlet Allocation ; stability of allocation
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
117-117.
2022.
objavljeno
Podaci o matičnoj publikaciji
Book of abstracts, 19th International conference on operational research KOI 2022
Mijač, Tea ; Šestanović, Tea
Zagreb:
1849-5141
1849-5141
Podaci o skupu
19th International Conference on Operational Research KOI 2022
predavanje
28.09.2022-30.09.2022
Šibenik, Hrvatska