Information-Directed Policy Search in Sparse-Reward Settings via the Occupancy Information Ratio
Suttle, Wesley A., Koppel, Alec, Liu, Ji
Published in 2023 57th Annual Conference on Information Sciences and Systems (CISS) (22.03.2023)
Published in 2023 57th Annual Conference on Information Sciences and Systems (CISS) (22.03.2023)
Get full text
Conference Proceeding
Policy Gradient for Ratio Optimization: A Case Study
Suttle, Wesley A., Koppel, Alec, Liu, Ji
Published in 2022 56th Annual Conference on Information Sciences and Systems (CISS) (09.03.2022)
Published in 2022 56th Annual Conference on Information Sciences and Systems (CISS) (09.03.2022)
Get full text
Conference Proceeding
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
Suttle, Wesley, Yang, Zhuoran, Zhang, Kaiqing, Wang, Zhaoran, Başar, Tamer, Liu, Ji
Published in IFAC-PapersOnLine (2020)
Published in IFAC-PapersOnLine (2020)
Get full text
Journal Article
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Singh, Utsav, Chakraborty, Souradip, Suttle, Wesley A, Sadler, Brian M, Namboodiri, Vinay P, Bedi, Amrit Singh
Year of Publication 16.06.2024
Year of Publication 16.06.2024
Get full text
Journal Article
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Singh, Utsav, Suttle, Wesley A, Sadler, Brian M, Namboodiri, Vinay P, Bedi, Amrit Singh
Year of Publication 20.04.2024
Year of Publication 20.04.2024
Get full text
Journal Article
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
Patel, Bhrij, Suttle, Wesley A, Koppel, Alec, Aggarwal, Vaneet, Sadler, Brian M, Bedi, Amrit Singh, Manocha, Dinesh
Year of Publication 18.03.2024
Year of Publication 18.03.2024
Get full text
Journal Article
Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems
Suttle, Wesley A, Sharma, Vipul K, Kosaraju, Krishna C, Sivaranjani, S, Liu, Ji, Gupta, Vijay, Sadler, Brian M
Year of Publication 06.03.2024
Year of Publication 06.03.2024
Get full text
Journal Article
Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic
Suttle, Wesley A, Bedi, Amrit Singh, Patel, Bhrij, Sadler, Brian M, Koppel, Alec, Manocha, Dinesh
Year of Publication 27.01.2023
Year of Publication 27.01.2023
Get full text
Journal Article
LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Shek, Chak Lam, Wu, Xiyang, Suttle, Wesley A, Busart, Carl, Zaroukian, Erin, Manocha, Dinesh, Tokekar, Pratap, Bedi, Amrit Singh
Year of Publication 30.09.2023
Year of Publication 30.09.2023
Get full text
Journal Article
CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration
Patel, Bhrij, Weerakoon, Kasun, Suttle, Wesley A, Koppel, Alec, Sadler, Brian M, Zhou, Tianyi, Bedi, Amrit Singh, Manocha, Dinesh
Year of Publication 09.06.2023
Year of Publication 09.06.2023
Get full text
Journal Article