EXTRACTION DEVICE, EXTRACTION METHOD, AND EXTRACTION PROGRAM

An extraction apparatus (10) includes an input unit (11) configured to receive an input of information about a plurality of web pages including a hypertext markup language (HTML) element that is known to reach a malicious web page through browser operation and an HTML element that is known to reach...

Full description

Saved in:
Bibliographic Details
Main Authors CHIBA, Daiki, KOIDE, Takashi
Format Patent
LanguageEnglish
French
German
Published 09.03.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:An extraction apparatus (10) includes an input unit (11) configured to receive an input of information about a plurality of web pages including a hypertext markup language (HTML) element that is known to reach a malicious web page through browser operation and an HTML element that is known to reach a benign web page through browser operation, a cluster determination unit (12) configured to classify the plurality of web pages whose input is received into clusters, an element character string extraction unit (13) configured to extract an HTML element that reaches the malicious web page and an HTML element that reaches the benign web page from a web page of each cluster that is classified to extract a first character string included in HTML elements that are extracted, and a keyword extraction unit (14) configured to extract, as a keyword, a second character string that characterizes the HTML element that reaches the malicious web page from the first character string.
Bibliography:Application Number: EP20190930433