Data Synthesis for Document Layout Analysis

Layout analysis plays an important role in various document image processing tasks such as OCR and document understanding, and the methods based on deep learning have achieved significant achievements. In recent years, pre-training and transfer learning techniques have become a common practice in a...

Full description

Saved in:

Bibliographic Details
Published in	Learning Technologies and Systems Vol. 12511; pp. 244 - 252
Main Authors	Wan, Lin, Zhou, Ju, Zhang, Bailing
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2021 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Automatic annotations Data synthesis Fine-tuning Heuristic rules Layout analysis
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Layout analysis plays an important role in various document image processing tasks such as OCR and document understanding, and the methods based on deep learning have achieved significant achievements. In recent years, pre-training and transfer learning techniques have become a common practice in a variety of computer vision and natural language processing tasks. In this paper, we present an efficient approach of data synthesis for pretraining deep learning models in document layout analysis. The synthesized data is automatically annotated based on heuristic rules, and then applied to the PubLayNet pre-trained models. The models are fine-tuned with real document layout data. Three types of document elements are taken into account: text lines, tables, and figures/images. The experiments demonstrate that the pre-training model with synthesized data is very effective for transfer learning on different document domains.
ISBN:	303066905X 9783030669058
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-030-66906-5_23