DOCUMENT IMAGE UNDERSTANDING
A neural network training method and a document image understanding method is provided. The neural network training method includes: acquiring text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a samp...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
08.06.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A neural network training method and a document image understanding method is provided. The neural network training method includes: acquiring text comprehensive features of a plurality of first texts in an original image; replacing at least one original region in the original image to obtain a sample image including a plurality of first regions and a ground truth label for indicating whether each first region is a replaced region; acquiring image comprehensive features of the plurality of first regions; inputting the text comprehensive features of the plurality of first texts and the image comprehensive features of the plurality of first regions into a neural network model together to obtain text representation features of the plurality of first texts; determining a predicted label based on the text representation features of the plurality of first texts; and training the neural network model based on the ground truth label and the predicted label. |
---|---|
Bibliography: | Application Number: US202218063564 |