CUSTODIAN DISAMBIGUATION AND DATA MATCHING
Provided is a technique for matching different user representations of a person in a plurality of computer systems may be provided. The technique includes collecting information sets about user representations from a plurality of computer systems; normalizing the information sets to a unified format...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
27.10.2016
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Provided is a technique for matching different user representations of a person in a plurality of computer systems may be provided. The technique includes collecting information sets about user representations from a plurality of computer systems; normalizing the information sets to a unified format; grouping the information sets in the unified format into indexing buckets based on a user name using a non-phonetic algorithm; determining a similarity score for each pair of information sets in each of the indexing buckets; classifying each information set pair into a set of classes based on the similarity scores, wherein the set of classes comprise at least matches and non-matches; and using a data structure for merging information of information set pairs classified as matches. |
---|---|
Bibliography: | Application Number: US201514692543 |