FUZZY MATCHING OF OBSCURE TEXTS WITH MEANINGFUL TERMS INCLUDED IN A GLOSSARY
A method comprising: obtaining multiple glossary terms each comprising one or more words; generating multiple fuzzy tokens from each word of each of the glossary terms; calculating a similarity score for each of the fuzzy tokens, the similarity score denoting a similarity between the respective fuzz...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | English |
Published |
22.02.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A method comprising: obtaining multiple glossary terms each comprising one or more words; generating multiple fuzzy tokens from each word of each of the glossary terms; calculating a similarity score for each of the fuzzy tokens, the similarity score denoting a similarity between the respective fuzzy token and its respective word; obtaining multiple input terms to be matched with the multiple glossary terms; separating each of the input terms into multiple input tokens; generating multiple n-grams from each of the input tokens; comparing the n-grams with the fuzzy tokens, to output a list of matching n-grams and fuzzy tokens; based on the list of matching n-grams and fuzzy tokens, identifying, from the glossary terms, candidate glossary term matches for each of the input terms; and calculating one or more scores that quantify the match between each of the candidate glossary term matches and its respective input term. |
---|---|
Bibliography: | Application Number: US202217892169 |