Information asymmetry in KL-regularized RL
Galashov, Alexandre, Jayakumar, Siddhant M, Hasenclever, Leonard, Tirumala, Dhruva, Schwarz, Jonathan, Desjardins, Guillaume, Czarnecki, Wojciech M, Teh, Yee Whye, Pascanu, Razvan, Heess, Nicolas
Year of Publication 03.05.2019
Year of Publication 03.05.2019
Get full text
Journal Article
Mix&Match - Agent Curricula for Reinforcement Learning
Czarnecki, Wojciech Marian, Jayakumar, Siddhant M, Jaderberg, Max, Hasenclever, Leonard, Teh, Yee Whye, Osindero, Simon, Heess, Nicolas, Pascanu, Razvan
Year of Publication 05.06.2018
Year of Publication 05.06.2018
Get full text
Journal Article
Progress & Compress: A scalable framework for continual learning
Schwarz, Jonathan, Luketina, Jelena, Czarnecki, Wojciech M, Grabska-Barwinska, Agnieszka, Teh, Yee Whye, Pascanu, Razvan, Hadsell, Raia
Year of Publication 16.05.2018
Year of Publication 16.05.2018
Get full text
Journal Article
Sobolev Training for Neural Networks
Czarnecki, Wojciech Marian, Osindero, Simon, Jaderberg, Max, Świrszcz, Grzegorz, Pascanu, Razvan
Year of Publication 15.06.2017
Year of Publication 15.06.2017
Get full text
Journal Article
REINFORCEMENT LEARNING WITH AUXILIARY TASKS
Mnih, Volodymyr, Silver, David, Schaul, Tom, Czarnecki, Wojciech, Jaderberg, Maxwell Elliot, Kavukcuoglu, Koray
Year of Publication 17.06.2021
Get full text
Year of Publication 17.06.2021
Patent
alpha$-Rank: Multi-Agent Evaluation by Evolution
Omidshafiei, Shayegan, Papadimitriou, Christos, Piliouras, Georgios, Tuyls, Karl, Rowland, Mark, Lespiau, Jean-Baptiste, Czarnecki, Wojciech M, Lanctot, Marc, Perolat, Julien, Munos, Remi
Year of Publication 04.03.2019
Year of Publication 04.03.2019
Get full text
Journal Article
Reinforcement learning with auxiliary tasks
Mnih, Volodymyr, Silver, David, Schaul, Tom, Czarnecki, Wojciech, Jaderberg, Maxwell Elliot, Kavukcuoglu, Koray
Year of Publication 23.03.2021
Get full text
Year of Publication 23.03.2021
Patent
Understanding Synthetic Gradients and Decoupled Neural Interfaces
Czarnecki, Wojciech Marian, Świrszcz, Grzegorz, Jaderberg, Max, Osindero, Simon, Vinyals, Oriol, Kavukcuoglu, Koray
Year of Publication 01.03.2017
Year of Publication 01.03.2017
Get full text
Journal Article
POPULATION BASED TRAINING OF NEURAL NETWORKS
Dalibard, Valentin Clement, Czarnecki, Wojciech, Jaderberg, Maxwell Elliot, Green, Timothy Frederick Goldie
Year of Publication 07.01.2021
Get full text
Year of Publication 07.01.2021
Patent
Discovering Reinforcement Learning Algorithms
Oh, Junhyuk, Hessel, Matteo, Czarnecki, Wojciech M, Xu, Zhongwen, Hado van Hasselt, Singh, Satinder, Silver, David
Published in arXiv.org (05.01.2021)
Get full text
Published in arXiv.org (05.01.2021)
Paper
Reinforcement Learning with Unsupervised Auxiliary Tasks
Jaderberg, Max, Mnih, Volodymyr, Czarnecki, Wojciech Marian, Schaul, Tom, Leibo, Joel Z, Silver, David, Kavukcuoglu, Koray
Year of Publication 16.11.2016
Year of Publication 16.11.2016
Get full text
Journal Article