How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs

Details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. Gives a state of the art overview of how OCR software works on newspapers, factors that effect...

Full description

Saved in:
Bibliographic Details
Published inD-Lib magazine Vol. 15; no. 3/4
Main Author Holley, Rose
Format Journal Article
LanguageEnglish
Published 01.03.2009
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. Gives a state of the art overview of how OCR software works on newspapers, factors that effect OCR accuracy, methods of measuring accuracy, methods of improving accuracy, and testing methods and results for specific solutions that were considered viable for large scale text digitisation projects. Source: National Library of New Zealand Te Puna Matauranga o Aotearoa, licensed by the Department of Internal Affairs for re-use under the Creative Commons Attribution 3.0 New Zealand Licence.
Bibliography:Refs
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1082-9873
1082-9873
DOI:10.1045/march2009-holley