SMART IDENTIFICATION OF INDICATOR TEXT WITH FULL-TEXT SEARCH OR OPTIMIZED DOCUMENT ANALYSIS
Several aspects for optimizing unstructured document analysis comprise operating a document system, where the document system comprises a plurality of documents comprising unstructured content and a full-text index; receiving a request to identify documents comprising a type of data elements; select...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English |
Published |
20.06.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Several aspects for optimizing unstructured document analysis comprise operating a document system, where the document system comprises a plurality of documents comprising unstructured content and a full-text index; receiving a request to identify documents comprising a type of data elements; selecting a sample out of the plurality of documents; determining data elements of the type in the sample of documents; determining an indicator context expression for the type of data elements out of the determined data elements of the type; determining a query for searching, using a search engine, the full-text index using the indicator context expression; and determining the documents in the document system being compliant to the determined query. |
---|---|
Bibliography: | Application Number: US202218068022 |