How Good Can It Get? Analysing and Improving OCR Accuracy in Large Scale Historic Newspaper Digitisation Programs
Details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. Gives a state of the art overview of how OCR software works on newspapers, factors that effect...
Saved in:
Published in | D-Lib magazine Vol. 15; no. 3/4 |
---|---|
Main Author | |
Format | Journal Article |
Language | English |
Published |
01.03.2009
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Details the work undertaken by the National Library of Australia Newspaper Digitisation Program on identifying and testing solutions to improve OCR accuracy in large scale newspaper digitisation programs. Gives a state of the art overview of how OCR software works on newspapers, factors that effect OCR accuracy, methods of measuring accuracy, methods of improving accuracy, and testing methods and results for specific solutions that were considered viable for large scale text digitisation projects. Source: National Library of New Zealand Te Puna Matauranga o Aotearoa, licensed by the Department of Internal Affairs for re-use under the Creative Commons Attribution 3.0 New Zealand Licence. |
---|---|
Bibliography: | Refs ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1082-9873 1082-9873 |
DOI: | 10.1045/march2009-holley |