Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
Everitt, Tom, Hutter, Marcus, Kumar, Ramana, Krakovna, Victoria
Published in Synthese (Dordrecht) (01.11.2021)
Published in Synthese (Dordrecht) (01.11.2021)
Get full text
Journal Article