Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 906825

Contextual Spellchecking Based on N-grams


Srdić, Ivan; Gledec, Gordan
Contextual Spellchecking Based on N-grams // Proceedings of the Central European Conference on Information and Intelligent Systems / Strahonja, Vjeran ; Kirinić, Valentina (ur.).
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2017. str. 29-33 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 906825 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Contextual Spellchecking Based on N-grams

Autori
Srdić, Ivan ; Gledec, Gordan

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the Central European Conference on Information and Intelligent Systems / Strahonja, Vjeran ; Kirinić, Valentina - Varaždin : Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2017, 29-33

Skup
Central European Conference on Information and Intelligent Systems

Mjesto i datum
Varaždin, Hrvatska, 27.09.2017. - 29.09.2017

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Contextual spellchecking ; statistical approach ; n-grams

Sažetak
Croatian Academic Spellchecker is an online web-service used for almost 20 years by thousands of users every day. In recent years, the service enabled rudimentary contextual spellchecking, based on pattern matching. In this paper we describe how it is possible to perform n-gram based contextual spellchecking of texts written in Croatian, regardless of the orthographic complexity of the Croatian language. Simple upgrade of the existing implementation was achieved by separating the system into several components. Using a well- known classifier, tweaking the frequency estimator and separating errors into confusion sets resulted in a contextual spellchecking system with a high score of F1 = 0.95 on the examined example.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Gordan Gledec (autor)

Avatar Url Ivan Srdić (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada

Citiraj ovu publikaciju:

Srdić, Ivan; Gledec, Gordan
Contextual Spellchecking Based on N-grams // Proceedings of the Central European Conference on Information and Intelligent Systems / Strahonja, Vjeran ; Kirinić, Valentina (ur.).
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2017. str. 29-33 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Srdić, I. & Gledec, G. (2017) Contextual Spellchecking Based on N-grams. U: Strahonja, V. & Kirinić, V. (ur.)Proceedings of the Central European Conference on Information and Intelligent Systems.
@article{article, author = {Srdi\'{c}, Ivan and Gledec, Gordan}, year = {2017}, pages = {29-33}, keywords = {Contextual spellchecking, statistical approach, n-grams}, title = {Contextual Spellchecking Based on N-grams}, keyword = {Contextual spellchecking, statistical approach, n-grams}, publisher = {Fakultet organizacije i informatike Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Vara\v{z}din, Hrvatska} }
@article{article, author = {Srdi\'{c}, Ivan and Gledec, Gordan}, year = {2017}, pages = {29-33}, keywords = {Contextual spellchecking, statistical approach, n-grams}, title = {Contextual Spellchecking Based on N-grams}, keyword = {Contextual spellchecking, statistical approach, n-grams}, publisher = {Fakultet organizacije i informatike Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Vara\v{z}din, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font