EXTRACTION DEVICE, EXTRACTION METHOD, AND EXTRACTION PROGRAM
An extraction apparatus (10) includes an input unit (11) configured to receive an input of information about a plurality of web pages including a hypertext markup language (HTML) element that is known to reach a malicious web page through browser operation and an HTML element that is known to reach...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | English French German |
Published |
09.03.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | An extraction apparatus (10) includes an input unit (11) configured to receive an input of information about a plurality of web pages including a hypertext markup language (HTML) element that is known to reach a malicious web page through browser operation and an HTML element that is known to reach a benign web page through browser operation, a cluster determination unit (12) configured to classify the plurality of web pages whose input is received into clusters, an element character string extraction unit (13) configured to extract an HTML element that reaches the malicious web page and an HTML element that reaches the benign web page from a web page of each cluster that is classified to extract a first character string included in HTML elements that are extracted, and a keyword extraction unit (14) configured to extract, as a keyword, a second character string that characterizes the HTML element that reaches the malicious web page from the first character string. |
---|---|
Bibliography: | Application Number: EP20190930433 |