INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM

To allow for more accurately dividing read image data obtained by collectively reading multiple documents into image datasets corresponding to respective documents.SOLUTION: A pair of page images of read image data obtained by collectively reading multiple documents on a page-by-page basis is sequen...

Full description

Saved in:

Bibliographic Details
Main Author	TODOROKI YUSUKE
Format	Patent
Language	English Japanese
Published	16.05.2023
Subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online Access	Get full text

Cover

Loading…

More Information
Summary:	To allow for more accurately dividing read image data obtained by collectively reading multiple documents into image datasets corresponding to respective documents.SOLUTION: A pair of page images of read image data obtained by collectively reading multiple documents on a page-by-page basis is sequentially acquired, and a document separation position based on text data of the two page images constituting the pair is determined using a neural network model. For the determination, vectors corresponding to tokens obtained by decomposing text of each of the two page images constituting the pair are input to the neutral network model. The document separation position is then determined according to a score output from the neural network model, the score numerically representing the possibility that each of the two page images constituting the pair belongs to a different document.SELECTED DRAWING: Figure 2 【課題】複数の文書をまとめて読み取って得られた読取画像データを、文書単位の画像データにより精度良く分割できるようにする。【解決手段】複数の文書をページ単位でまとめて読み取って得られた読取画像データのページ画像のペアを順に取得し、当該ペアを構成する２つのページ画像のテキストデータに基づいて文書の区切り位置をニューラルネットワークモデルを用いて決定する。決定に際しては、ペアを構成する２つのページ画像それぞれのテキストを分解したトークンに対応するベクトルを生成してニューラルネットワークモデルに入力する。そして、ニューラルネットワークモデルから出力される、ペアを構成する２つのページ画像それぞれが異なる文書に属している可能性の高さを数値で表したスコアに基づき文書の区切り位置を決定する。【選択図】図２
Bibliography:	Application Number: JP20210178618