Gaining Awareness of Different Causes of OCR Errors: Examining Messy Data

ORCID Identifier(s)

0000-0003-0833-6871

Document Type

Article

Source Publication Title

Journal of Interactive Technology and Pedagogy

Abstract

Individual experience with how OCR programs decide on letter choices provides students with insights on how text recognition distinguishes shapes within their documents. The interactive mode helps them assess how font size, letter shape, and copy quality can affect the success of optical character recognition during the digitizing of texts, especially low quality copies.

Disciplines

Computational Linguistics | Digital Humanities | Linguistics

Publication Date

2021

Language

English

License

Creative Commons Attribution 4.0 International License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Share

COinS