Pregled bibliografske jedinice broj: 906825
Contextual Spellchecking Based on N-grams
Contextual Spellchecking Based on N-grams // Proceedings of the Central European Conference on Information and Intelligent Systems / Strahonja, Vjeran ; Kirinić, Valentina (ur.).
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2017. str. 29-33 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 906825 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Contextual Spellchecking Based on N-grams
Autori
Srdić, Ivan ; Gledec, Gordan
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of the Central European Conference on Information and Intelligent Systems
/ Strahonja, Vjeran ; Kirinić, Valentina - Varaždin : Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2017, 29-33
Skup
Central European Conference on Information and Intelligent Systems
Mjesto i datum
Varaždin, Hrvatska, 27.09.2017. - 29.09.2017
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Contextual spellchecking ; statistical approach ; n-grams
Sažetak
Croatian Academic Spellchecker is an online web-service used for almost 20 years by thousands of users every day. In recent years, the service enabled rudimentary contextual spellchecking, based on pattern matching. In this paper we describe how it is possible to perform n-gram based contextual spellchecking of texts written in Croatian, regardless of the orthographic complexity of the Croatian language. Simple upgrade of the existing implementation was achieved by separating the system into several components. Using a well- known classifier, tweaking the frequency estimator and separating errors into confusion sets resulted in a contextual spellchecking system with a high score of F1 = 0.95 on the examined example.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb