DOCUMENT IMAGE UNDERSTANDING

A neural network training method and a document image understanding method is provided. The neural network training method includes: acquiring text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a samp...

Full description

Saved in:

Bibliographic Details
Main Authors	CAO, Yuhui, LUO, Bin, PENG, Qiming, FENG, Shikun, CHEN, Yongfeng
Format	Patent
Language	English
Published	08.06.2023
Subjects	CALCULATING COMPUTING COUNTING PHYSICS
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A neural network training method and a document image understanding method is provided. The neural network training method includes: acquiring text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a sample image including a plurality of first regions and a ground truth label for indicating whether each first region is a replaced region; acquiring image comprehensive features of the plurality of first regions; inputting the text comprehensive features of the plurality of first texts and the image comprehensive features of the plurality of first regions into a neural network model together to obtain text representation features of the plurality of first texts; determining a predicted label based on the text representation features of the plurality of first texts; and training the neural network model based on the ground truth label and the predicted label.
Bibliography:	Application Number: US202218063564