Optimal Penetration Path Generation Based on Maximum Entropy Reinforcement Learning

Analyzing intrusion intentions and penetration behaviors from the attackers' perspective is of great significance for guiding network security defense.However, most existing penetration paths are constructed based on the instantaneous network environment, resulting in reduced reference value.Ai...

Full description

Saved in:
Bibliographic Details
Published inJi suan ji ke xue Vol. 51; no. 3; p. 360
Main Authors Wang, Yan, Wang, Tianjing, Shen, Hang, Bai, Guangwei
Format Journal Article
LanguageChinese
Published Chongqing Guojia Kexue Jishu Bu 01.01.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Analyzing intrusion intentions and penetration behaviors from the attackers' perspective is of great significance for guiding network security defense.However, most existing penetration paths are constructed based on the instantaneous network environment, resulting in reduced reference value.Aiming at this problem, this paper proposes an optimal penetration path generation method based on maximum entropy reinforcement learning, which can capture the approximate optimal behavior of multiple modes in the form of exploration under dynamic network environments.Firstly, the penetration process is modeled according to the attack graph and the vulnerability score, and the threat degree of the penetration behavior is described by quantifying the attack benefits.Then, considering the complexity of the intrusion behavior, a soft Q-learning method based on the maximum entropy model is developed.The stability of the penetration path is ensured by controlling the entropy value and the importance of the reward.Finally, the
ISSN:1002-137X