Gaining Awareness of Different Causes of OCR Errors: Examining Messy Data
Document Type
Article
Source Publication Title
Journal of Interactive Technology and Pedagogy
Abstract
Individual experience with how OCR programs decide on letter choices provides students with insights on how text recognition distinguishes shapes within their documents. The interactive mode helps them assess how font size, letter shape, and copy quality can affect the success of optical character recognition during the digitizing of texts, especially low quality copies.
Disciplines
Computational Linguistics | Digital Humanities | Linguistics
Publication Date
2021
Language
English
License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Recommended Citation
Stvan, Laurel Smith, "Gaining Awareness of Different Causes of OCR Errors: Examining Messy Data" (2021). Linguistics & TESOL Faculty Publications & Presentations. 46.
https://mavmatrix.uta.edu/linguistics_tesol_facpubs/46