Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi

Optimisation of archival processes involving digitisation of typewritten documents (CROSBI ID 285224)

Prilog u časopisu | izvorni znanstveni rad | međunarodna recenzija

Stančić, Hrvoje ; Trbušić, Željko Optimisation of archival processes involving digitisation of typewritten documents // Aslib journal of information management, 72 (2020), 4; 545-559. doi: 10.1108/ajim-11-2019-0326

Podaci o odgovornosti

Stančić, Hrvoje ; Trbušić, Željko

engleski

Optimisation of archival processes involving digitisation of typewritten documents

Purpose – The authors investigate optical character recognition (OCR) technology and discuss its implementation in the context of digitisation of archival materials. Design/methodology/approach – The typewritten transcripts of the Croatian Writers’ Society from the mid-60s of the 20th century are used as the test data. The optimal digitisation setup is investigated in order to obtain the best OCR results. This was done by using the sample of 123 pages digitised at different resolution settings and binarisation levels. Findings – A series of tests showed that different settings produce significantly different results. The best OCR accuracy achieved at the test sample of the typewritten documents was 95.02%. The results show that the resolution is significantly more important than binarisation pre-processing procedure for achieving better OCR results. Originality/value – Based on the research results, the authors give recommendations for achieving optimal digitisation process setup with the aim of increasing the quality of OCR results. Finally, the authors put the research results in the context of digitisation of cultural heritage in general and discuss further investigation possibilities.

Digitisation, Optical character recognition, Resolution, Binarisation, Typewritten documents, Archival materials, Cultural heritage

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o izdanju

72 (4)

2020.

545-559

objavljeno

2050-3806

10.1108/ajim-11-2019-0326

Povezanost rada

Informacijske i komunikacijske znanosti

Poveznice
Indeksiranost