Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

AI driven OCR: Resolving handwritten fonts recognizability problems (CROSBI ID 700367)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Bratić, Diana ; Stanić Loknar, Nikolina AI driven OCR: Resolving handwritten fonts recognizability problems // Proceedings of 10th International Symposium on Graphic Engineering and Design GRID 2020 / Dedijer, Sandra (ur.). Novi Sad: University of Novi Sad, Faculty of Technical Sciences, Department of Graphic Engineering and Design, 2020. str. 725-733

Podaci o odgovornosti

Bratić, Diana ; Stanić Loknar, Nikolina

engleski

AI driven OCR: Resolving handwritten fonts recognizability problems

Optical Character Recognition (OCR) is the electronic or mechanical conversion of images of typed, handwritten, or printed text into machine-encoded text. Advanced systems are capable to produce a high degree of recognition accuracy for most technic fonts, but when it comes to handwritten forms there is a problem occur in recognizing certain characters and limitations with conventional OCR processes persist. It is most pronounced in ascenders (k, b, l, d, h, t) and descenders (g, j, p, q, y). If the characters are linked by ligatures, the ascending and descending strokes are even less recognizable to the scanners. In order to reduce the likelihood of a recognition error, it is a necessary to create a large database of stored characters and their glyphs. Feature extraction decomposes glyphs into features like lines, closed loops, line direction, and line intersections. A Multilayer Perceptron (MLP) neural network based on Back Propagation Neural Network (BPNN) algorithm as a method of Artificial Intelligence (AI) has been used in text identification, classification and recognition using various methods: image pattern based, text-based, mark-based etc. Also, the application of AI generates of a large database of different letter cuts, and modifications, and variation of the same letter character structure. For this purpose, the recognizability test of handwritten fonts was performed. Within main group, subgroups of independent letter characters and letter characters linked by ligatures are created, and reading errors were observed. In each subgroup, four different font families (bold stroke, alternating stroke, monoline stroke, and brush stroke) were tested. In subgroup of independent letter characters, errors were observed in similar rounded lines such as the characters a, and e. In the subgroup of letter characters linked by ligatures, errors were also observed in similar rounded lines such as the letter characters a and e, m and n, but also in ascenders b and l, and descenders g and q. Furthermore, seven letter cuts were made from each basic test letters, and up to are thin, ultra-light, light, regular, semi-bold, bold, and ultra-bold, and stored in the existing EMNIST database. The scanning test was repeated, and recently obtained results showed a decrease in the deviation rate, i.e. higher accuracy. Reducing the number of deviations shows that the neural network gives acceptable answers but requires creation of a larger database within about 56, 000 different characters

OCR, handwritten font, letter character, artificial intelligence, MLP BPNN arhitecture

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

725-733.

2020.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of 10th International Symposium on Graphic Engineering and Design GRID 2020

Dedijer, Sandra

Novi Sad: University of Novi Sad, Faculty of Technical Sciences, Department of Graphic Engineering and Design

978-86-6022-303-8

2620-1437

Podaci o skupu

10th International Symposium On Graphic Engineering And Design (GRID 2020)

poster

12.11.2020-14.11.2020

Novi Sad, Srbija

Povezanost rada

Grafička tehnologija, Informacijske i komunikacijske znanosti, Interdisciplinarne tehničke znanosti