Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1089336

Optimisation of archival processes involving digitisation of typewritten documents


Stančić, Hrvoje; Trbušić, Željko
Optimisation of archival processes involving digitisation of typewritten documents // Aslib Journal of Information Management, 72 (2020), 4; 545-559 doi:10.1108/ajim-11-2019-0326 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1089336 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Optimisation of archival processes involving digitisation of typewritten documents

Autori
Stančić, Hrvoje ; Trbušić, Željko

Izvornik
Aslib Journal of Information Management (2050-3806) 72 (2020), 4; 545-559

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Digitisation, Optical character recognition, Resolution, Binarisation, Typewritten documents, Archival materials, Cultural heritage

Sažetak
Purpose – The authors investigate optical character recognition (OCR) technology and discuss its implementation in the context of digitisation of archival materials. Design/methodology/approach – The typewritten transcripts of the Croatian Writers’ Society from the mid-60s of the 20th century are used as the test data. The optimal digitisation setup is investigated in order to obtain the best OCR results. This was done by using the sample of 123 pages digitised at different resolution settings and binarisation levels. Findings – A series of tests showed that different settings produce significantly different results. The best OCR accuracy achieved at the test sample of the typewritten documents was 95.02%. The results show that the resolution is significantly more important than binarisation pre-processing procedure for achieving better OCR results. Originality/value – Based on the research results, the authors give recommendations for achieving optimal digitisation process setup with the aim of increasing the quality of OCR results. Finally, the authors put the research results in the context of digitisation of cultural heritage in general and discuss further investigation possibilities.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Hrvatska akademija znanosti i umjetnosti,
Filozofski fakultet, Zagreb

Profili:

Avatar Url Željko Trbušić (autor)

Avatar Url Hrvoje Stančić (autor)

Poveznice na cjeloviti tekst rada:

doi www.emerald.com

Citiraj ovu publikaciju:

Stančić, Hrvoje; Trbušić, Željko
Optimisation of archival processes involving digitisation of typewritten documents // Aslib Journal of Information Management, 72 (2020), 4; 545-559 doi:10.1108/ajim-11-2019-0326 (međunarodna recenzija, članak, znanstveni)
Stančić, H. & Trbušić, Ž. (2020) Optimisation of archival processes involving digitisation of typewritten documents. Aslib Journal of Information Management, 72 (4), 545-559 doi:10.1108/ajim-11-2019-0326.
@article{article, author = {Stan\v{c}i\'{c}, Hrvoje and Trbu\v{s}i\'{c}, \v{Z}eljko}, year = {2020}, pages = {545-559}, DOI = {10.1108/ajim-11-2019-0326}, keywords = {Digitisation, Optical character recognition, Resolution, Binarisation, Typewritten documents, Archival materials, Cultural heritage}, journal = {Aslib Journal of Information Management}, doi = {10.1108/ajim-11-2019-0326}, volume = {72}, number = {4}, issn = {2050-3806}, title = {Optimisation of archival processes involving digitisation of typewritten documents}, keyword = {Digitisation, Optical character recognition, Resolution, Binarisation, Typewritten documents, Archival materials, Cultural heritage} }
@article{article, author = {Stan\v{c}i\'{c}, Hrvoje and Trbu\v{s}i\'{c}, \v{Z}eljko}, year = {2020}, pages = {545-559}, DOI = {10.1108/ajim-11-2019-0326}, keywords = {Digitisation, Optical character recognition, Resolution, Binarisation, Typewritten documents, Archival materials, Cultural heritage}, journal = {Aslib Journal of Information Management}, doi = {10.1108/ajim-11-2019-0326}, volume = {72}, number = {4}, issn = {2050-3806}, title = {Optimisation of archival processes involving digitisation of typewritten documents}, keyword = {Digitisation, Optical character recognition, Resolution, Binarisation, Typewritten documents, Archival materials, Cultural heritage} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • Social Science Citation Index (SSCI)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font