METHOD AND SYSTEM FOR DOCUMENT INDEXING AND RETRIEVAL

Existing systems for document processing are either based on a supervised approach using annotated tags, and these systems identify section-based data from the unstructured documents without considering the statistical variations in content, which results in highly inaccurate content extraction. The...

Full description

Saved in:
Bibliographic Details
Main Authors TRIPATHY, Saswati Soumya, RANA, Rahul, THAKARE, Shreya Sanjay, SHAH, Pranav Champaklal, ANSARI, Saad, POOJARY, Sudhakara Deva, PATEL, Hemil
Format Patent
LanguageEnglish
Published 27.10.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Existing systems for document processing are either based on a supervised approach using annotated tags, and these systems identify section-based data from the unstructured documents without considering the statistical variations in content, which results in highly inaccurate content extraction. The disclosure herein generally relates to document processing, and, more particularly, to method and system for document indexing and retrieval. The system provides a mechanism to correlate unique words in a document with different topics identified in the document, based on a word pattern identified from the document. The correlations are captured in a knowledge graph, and can be further used in applications such as but not limited to document retrieval.
Bibliography:Application Number: US202217682246