SYSTEM AND METHOD FOR DIGITALLY FINDERPRINTING PHISHING ACTORS
ABSTRACT Websites, having associated features, are clustered by filtering entries that may be legitimate, determining feature similarity scores between the website features, and generating an aggregated similarity matrix containing website sim ilarity scores between the websites. Websites are cluste...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | English |
Published |
22.05.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | ABSTRACT Websites, having associated features, are clustered by filtering entries that may be legitimate, determining feature similarity scores between the website features, and generating an aggregated similarity matrix containing website sim ilarity scores between the websites. Websites are clustered into clusters or groups, based in part on the aggregated sim ilarity matrix. Each cluster is identified by a cluster identifier and represents a centroid website and other websites at a normalized similarity score from the centroid. It is determined for each website whether the normalized sim ilarity score is less than a threshold, and if so is identified as weakly-similar. Above the threshold, the website is labelled with the cluster identifier. Further clustering and thresholding is performed on the weakly-similar websites into additional clusters. Date Recue/Date Received 2020-11-20 |
---|---|
Bibliography: | Application Number: CA20203100237 |