FUZZY MATCHING OF OBSCURE TEXTS WITH MEANINGFUL TERMS INCLUDED IN A GLOSSARY

A method comprising: obtaining multiple glossary terms each comprising one or more words; generating multiple fuzzy tokens from each word of each of the glossary terms; calculating a similarity score for each of the fuzzy tokens, the similarity score denoting a similarity between the respective fuzz...

Full description

Saved in:
Bibliographic Details
Main Authors Moffie, Micha Gideon, Boehm, Omer Yehuda, Shachor, Shlomit Ifergan, Razinkov, Natalia
Format Patent
LanguageEnglish
Published 22.02.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method comprising: obtaining multiple glossary terms each comprising one or more words; generating multiple fuzzy tokens from each word of each of the glossary terms; calculating a similarity score for each of the fuzzy tokens, the similarity score denoting a similarity between the respective fuzzy token and its respective word; obtaining multiple input terms to be matched with the multiple glossary terms; separating each of the input terms into multiple input tokens; generating multiple n-grams from each of the input tokens; comparing the n-grams with the fuzzy tokens, to output a list of matching n-grams and fuzzy tokens; based on the list of matching n-grams and fuzzy tokens, identifying, from the glossary terms, candidate glossary term matches for each of the input terms; and calculating one or more scores that quantify the match between each of the candidate glossary term matches and its respective input term.
Bibliography:Application Number: US202217892169