Toggle Main Menu Toggle Search

Open Access padlockePrints

On the Use of Neural Text Generation for the Task of Optical Character Recognition

Lookup NU author(s): Dr Stephen McGough, Professor Boguslaw Obara

Downloads


Licence

This is the authors' accepted manuscript of a conference proceedings (inc. abstract) that has been published in its final definitive form by IEEE, 2019.

For re-use rights please refer to the publisher's terms and conditions.


Abstract

Optical Character Recognition (OCR), is extraction of textual data from scanned text documents to facilitate their indexing, searching, editing and to reduce storage space. Although OCR systems have improved significantly in recent years, they still suffer in situations where the OCR output does not match the text in the original document. Deep learning models have contributed positively to many problems but their full potential to many other problems are yet to be explored. In this paper we propose a post-processing approach based on the application deep learning to improve the accuracy of OCR system (minimizing the error rate). We report on the use of neural network language models to accomplish the task of correcting incorrectly predicted characters/words by OCR systems. We applied our approach to the IAM handwriting database. Our proposed approach delivers significant accuracy improvement of 20.41% in F-score, 10.86% in character level comparison using Levenshtein distance and 20.69% in document level comparison over previously reported context based OCR empirical results of IAM handwriting database.


Publication metadata

Author(s): Mohammadi M, Jaf S, McGough AS, Breckon TP, Matthews P, Theodoropoulos G, Obara B

Publication type: Conference Proceedings (inc. Abstract)

Publication status: Published

Conference Name: 16th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2019)

Year of Conference: 2019

Online publication date: 16/03/2020

Acceptance date: 06/10/2019

Date deposited: 06/10/2019

ISSN: 2161-5330

Publisher: IEEE

URL: https://doi.org/10.1109/AICCSA47632.2019.9035333

DOI: 10.1109/AICCSA47632.2019.9035333

Library holdings: Search Newcastle University Library for this item

ISBN: 9781728150529


Actions

Link to this publication


Share