Contextual Spellchecking Based on N-grams (CROSBI ID 655164)
Prilog sa skupa u časopisu | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Srdić, Ivan ; Gledec, Gordan
engleski
Contextual Spellchecking Based on N-grams
Croatian Academic Spellchecker is an online web-service used for almost 20 years by thousands of users every day. In recent years, the service enabled rudimentary contextual spellchecking, based on pattern matching. In this paper we describe how it is possible to perform n-gram based contextual spellchecking of texts written in Croatian, regardless of the orthographic complexity of the Croatian language. Simple upgrade of the existing implementation was achieved by separating the system into several components. Using a well- known classifier, tweaking the frequency estimator and separating errors into confusion sets resulted in a contextual spellchecking system with a high score of F1 = 0.95 on the examined example.
Contextual spellchecking ; statistical approach ; n-grams
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
29-33.
2017.
nije evidentirano
objavljeno
Podaci o matičnoj publikaciji
Central European conference on information and intelligent systems
Strahonja, Vjeran ; Kirinić, Valentina
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu
1847-2001
1848-2295
Podaci o skupu
Central European Conference on Information and Intelligent Systems
predavanje
27.10.2017-29.10.2017
Varaždin, Hrvatska