Automated Phonetic Transcription of Croatian Folklore Genres Using Supervised Machine Learning (CROSBI ID 683967)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Bakarić, Nikola ; Nikolić, Davor
engleski
Automated Phonetic Transcription of Croatian Folklore Genres Using Supervised Machine Learning
This paper aims to detect the possibilities of automatic text transcription for the purpose of preparing a corpus for further natural language processing analysis. The corpus contains various Croatian folklore genres. The transcription goal is to have one character represent one phoneme and remove spaces between accentuated and non-accentuated words. This knowledge independent system is trained using supervised learning methods and applied to the rest of the corpus using classifiers such as the naïve Bayes, k-nearest neighbour, support vector machine and others. The results are compared to a human-annotated sample to determine accuracy.
text transcription ; automation ; natural language processing ; supervised learning ; Croatian folklore genres
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
129-133.
2019.
objavljeno
10.17234/INFUTURE.2019.16
Podaci o matičnoj publikaciji
Bago, Petra ; Hebrang Grgić, Ivana ; Ivanjko, Tomislav ; Juričić, Vedran ; Miklošević, Željka ; Stublić, Helena
Zagreb: Filozofski fakultet Sveučilišta u Zagrebu
2706-3518
Podaci o skupu
7th International Conference The Future of Information Sciences (INFuture 2019)
predavanje
21.11.2019-22.11.2019
Zagreb, Hrvatska