Differential Recurrent Neural Networks for Action Recognition
Veeriah, Vivek, Naifan Zhuang, Guo-Jun Qi
Published in 2015 IEEE International Conference on Computer Vision (ICCV) (01.12.2015)
Published in 2015 IEEE International Conference on Computer Vision (ICCV) (01.12.2015)
Get full text
Conference Proceeding
Journal Article
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
Moskovitz, Ted, O'Donoghue, Brendan, Veeriah, Vivek, Flennerhag, Sebastian, Singh, Satinder, Zahavy, Tom
Year of Publication 02.02.2023
Year of Publication 02.02.2023
Get full text
Journal Article
Learning State Representations from Random Deep Action-conditional Predictions
Zheng, Zeyu, Veeriah, Vivek, Vuorio, Risto, Lewis, Richard, Singh, Satinder
Year of Publication 09.02.2021
Year of Publication 09.02.2021
Get full text
Journal Article
Diversifying AI: Towards Creative Chess with AlphaZero
Zahavy, Tom, Veeriah, Vivek, Hou, Shaobo, Waugh, Kevin, Lai, Matthew, Leurent, Edouard, Tomasev, Nenad, Schut, Lisa, Hassabis, Demis, Singh, Satinder
Year of Publication 17.08.2023
Year of Publication 17.08.2023
Get full text
Journal Article
How Should an Agent Practice?
Rajendran, Janarthanan, Lewis, Richard, Veeriah, Vivek, Lee, Honglak, Singh, Satinder
Year of Publication 15.12.2019
Year of Publication 15.12.2019
Get full text
Journal Article
Discovery of Options via Meta-Learned Subgoals
Veeriah, Vivek, Zahavy, Tom, Hessel, Matteo, Xu, Zhongwen, Oh, Junhyuk, Kemaev, Iurii, van Hasselt, Hado, Silver, David, Singh, Satinder
Year of Publication 12.02.2021
Year of Publication 12.02.2021
Get full text
Journal Article
Diversifying AI: Towards Creative Chess with AlphaZero
Zahavy, Tom, Veeriah, Vivek, Hou, Shaobo, Waugh, Kevin, Lai, Matthew, Leurent, Edouard, Tomasev, Nenad, Schut, Lisa, Hassabis, Demis, Singh, Satinder
Published in arXiv.org (31.07.2024)
Get full text
Published in arXiv.org (31.07.2024)
Paper
Learning State Representations from Random Deep Action-conditional Predictions
Zheng, Zeyu, Veeriah, Vivek, Vuorio, Risto, Lewis, Richard, Singh, Satinder
Published in arXiv.org (05.11.2021)
Get full text
Published in arXiv.org (05.11.2021)
Paper
A Self-Tuning Actor-Critic Algorithm
Zahavy, Tom, Xu, Zhongwen, Veeriah, Vivek, Hessel, Matteo, Oh, Junhyuk, van Hasselt, Hado, Silver, David, Singh, Satinder
Year of Publication 28.02.2020
Year of Publication 28.02.2020
Get full text
Journal Article
Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning
Kearney, Alex, Veeriah, Vivek, Travnik, Jaden, Pilarski, Patrick M, Sutton, Richard S
Year of Publication 07.03.2019
Year of Publication 07.03.2019
Get full text
Journal Article