Data Synthesis for Document Layout Analysis

Layout analysis plays an important role in various document image processing tasks such as OCR and document understanding, and the methods based on deep learning have achieved significant achievements. In recent years, pre-training and transfer learning techniques have become a common practice in a...

Full description

Saved in:
Bibliographic Details
Published inLearning Technologies and Systems Vol. 12511; pp. 244 - 252
Main Authors Wan, Lin, Zhou, Ju, Zhang, Bailing
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2021
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Layout analysis plays an important role in various document image processing tasks such as OCR and document understanding, and the methods based on deep learning have achieved significant achievements. In recent years, pre-training and transfer learning techniques have become a common practice in a variety of computer vision and natural language processing tasks. In this paper, we present an efficient approach of data synthesis for pretraining deep learning models in document layout analysis. The synthesized data is automatically annotated based on heuristic rules, and then applied to the PubLayNet pre-trained models. The models are fine-tuned with real document layout data. Three types of document elements are taken into account: text lines, tables, and figures/images. The experiments demonstrate that the pre-training model with synthesized data is very effective for transfer learning on different document domains.
ISBN:303066905X
9783030669058
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-66906-5_23