Improving Optical Character Recognition Performance for Low Quality Images (CROSBI ID 655607)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Brisinello, Matteo ; Grbić, Ratko ; Pul, Matija ; Anđelić, Tihomir
engleski
Improving Optical Character Recognition Performance for Low Quality Images
Efficient Optical Character Recognition (OCR) in images grabbed from Set-Top Boxes (STBs) plays an important role in STB testing. However, running OCR software on such images usually ends with low OCR performance since images can have low resolution, low image quality or colorful background. In order to improve OCR performance, four different image preprocessing methods are proposed. In this paper OCR is performed with Tesseract 3.5 and the relatively new Tesseract 4.0 on the images grabbed from different STBs. On the original images Tesseract 3.5 provides a 35.7% accuracy while Tesseract 4.0 attains a 70.2% accuracy. The proposed preprocessing methods improve OCR performance by 33.3% for Tesseract 3.5 and 22.6% for Tesseract 4.0 on the available images.
OCR ; Tesseract ; low quality images ; image preprocessing
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
1-5.
2017.
objavljeno
10.23919/ELMAR.2017.8124460
Podaci o matičnoj publikaciji
Proceedings of ELMAR-2017
Muštra, Mario ; Vitas, Dijana ; Zovko-Cihlar, Branka (ur.)
Zagreb:
1334-2630
Podaci o skupu
59th International Symposium ELMAR-2017
predavanje
18.09.2017-20.09.2017
Zagreb, Hrvatska