SMART IDENTIFICATION OF INDICATOR TEXT WITH FULL-TEXT SEARCH OR OPTIMIZED DOCUMENT ANALYSIS

Several aspects for optimizing unstructured document analysis comprise operating a document system, where the document system comprises a plurality of documents comprising unstructured content and a full-text index; receiving a request to identify documents comprising a type of data elements; select...

Full description

Saved in:
Bibliographic Details
Main Authors Hampp-Bahnmueller, Thomas, Saillet, Yannick, Baessler, Michael
Format Patent
LanguageEnglish
Published 20.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Several aspects for optimizing unstructured document analysis comprise operating a document system, where the document system comprises a plurality of documents comprising unstructured content and a full-text index; receiving a request to identify documents comprising a type of data elements; selecting a sample out of the plurality of documents; determining data elements of the type in the sample of documents; determining an indicator context expression for the type of data elements out of the determined data elements of the type; determining a query for searching, using a search engine, the full-text index using the indicator context expression; and determining the documents in the document system being compliant to the determined query.
Bibliography:Application Number: US202218068022