Defense method, system and equipment for reinforcement learning backdoor attack
The invention discloses a defense method, system and device for reinforcement learning backdoor attack, and relates to the technical field of artificial intelligence security, the method comprises the following steps: training an agent by using an offline reinforcement learning algorithm according t...
Saved in:
Main Authors | , , , , , , , , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
20.02.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention discloses a defense method, system and device for reinforcement learning backdoor attack, and relates to the technical field of artificial intelligence security, the method comprises the following steps: training an agent by using an offline reinforcement learning algorithm according to a reinforcement learning data set; in a safe environment, obtaining state transition information of interaction between the defense object and the environment; training a state-action error correction environment dynamic model by using the state transition information; detecting whether an attacker triggers a back door or not by utilizing the trained state-action error correction environment dynamic model; when the trigger backdoor exists, the defense object executes an action according to the predicted environment feedback state; and if the trigger backdoor does not exist, the defense object executes an action according to the environment feedback state at the current moment. According to the invention, backdoor |
---|---|
Bibliography: | Application Number: CN202311519555 |