DOCUMENT IMAGE UNDERSTANDING

A neural network training method and a document image understanding method is provided. The neural network training method includes: acquiring text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a samp...

Full description

Saved in:
Bibliographic Details
Main Authors CAO, Yuhui, LUO, Bin, PENG, Qiming, FENG, Shikun, CHEN, Yongfeng
Format Patent
LanguageEnglish
Published 08.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A neural network training method and a document image understanding method is provided. The neural network training method includes: acquiring text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a sample image including a plurality of first regions and a ground truth label for indicating whether each first region is a replaced region; acquiring image comprehensive features of the plurality of first regions; inputting the text comprehensive features of the plurality of first texts and the image comprehensive features of the plurality of first regions into a neural network model together to obtain text representation features of the plurality of first texts; determining a predicted label based on the text representation features of the plurality of first texts; and training the neural network model based on the ground truth label and the predicted label.
Bibliography:Application Number: US202218063564