Search Results - "Suttle, Wesley" :: K.UTB vyhledávací portál

Information-Directed Policy Search in Sparse-Reward Settings via the Occupancy Information Ratio

by Suttle, Wesley A., Koppel, Alec, Liu, Ji
Published in 2023 57th Annual Conference on Information Sciences and Systems (CISS) (22.03.2023)

Get full text

Conference Proceeding

Loading…

Policy Gradient for Ratio Optimization: A Case Study

by Suttle, Wesley A., Koppel, Alec, Liu, Ji
Published in 2022 56th Annual Conference on Information Sciences and Systems (CISS) (09.03.2022)

Get full text

Conference Proceeding

Loading…

Reinforcement Learning Based Distributed Control of Dissipative Networked Systems

by Kosaraju, Krishna Chaitanya, Sivaranjani, S., Suttle, Wesley, Gupta, Vijay, Liu, Ji
Published in IEEE transactions on control of network systems (01.06.2022)

Get full text

Journal Article

Loading…

A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning

by Suttle, Wesley, Yang, Zhuoran, Zhang, Kaiqing, Wang, Zhaoran, Başar, Tamer, Liu, Ji
Published in IFAC-PapersOnLine (2020)

Get full text

Journal Article

Loading…

Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks

by Fatemi, Michael Y, Suttle, Wesley A, Sadler, Brian M
Year of Publication 09.02.2024

Get full text

Journal Article

Loading…

Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search

by Suttle, Wesley A, Koppel, Alec, Liu, Ji
Year of Publication 21.01.2022

Get full text

Journal Article

Loading…

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

by Singh, Utsav, Chakraborty, Souradip, Suttle, Wesley A, Sadler, Brian M, Namboodiri, Vinay P, Bedi, Amrit Singh
Year of Publication 16.06.2024

Get full text

Journal Article

Loading…

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

by Singh, Utsav, Suttle, Wesley A, Sadler, Brian M, Namboodiri, Vinay P, Bedi, Amrit Singh
Year of Publication 20.04.2024

Get full text

Journal Article

Loading…

Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search

by Suttle, Wesley A, Koppel, Alec, Liu, Ji
Published in arXiv.org (28.12.2023)

Get full text

Paper

Loading…

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

by Patel, Bhrij, Suttle, Wesley A, Koppel, Alec, Aggarwal, Vaneet, Sadler, Brian M, Bedi, Amrit Singh, Manocha, Dinesh
Year of Publication 18.03.2024

Get full text

Journal Article

Loading…

Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

by Suttle, Wesley A, Sharma, Vipul K, Kosaraju, Krishna C, Sivaranjani, S, Liu, Ji, Gupta, Vijay, Sadler, Brian M
Year of Publication 06.03.2024

Get full text

Journal Article

Loading…

Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

by Suttle, Wesley A, Bedi, Amrit Singh, Patel, Bhrij, Sadler, Brian M, Koppel, Alec, Manocha, Dinesh
Year of Publication 27.01.2023

Get full text

Journal Article

Loading…

LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments

by Shek, Chak Lam, Wu, Xiyang, Suttle, Wesley A, Busart, Carl, Zaroukian, Erin, Manocha, Dinesh, Tokekar, Pratap, Bedi, Amrit Singh
Year of Publication 30.09.2023

Get full text

Journal Article

Loading…

CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration

by Patel, Bhrij, Weerakoon, Kasun, Suttle, Wesley A, Koppel, Alec, Sadler, Brian M, Zhou, Tianyi, Bedi, Amrit Singh, Manocha, Dinesh
Year of Publication 09.06.2023

Get full text

Journal Article

Loading…

Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks

by Fatemi, Michael Y, Suttle, Wesley A, Sadler, Brian M
Published in arXiv.org (09.02.2024)

Get full text

Paper

Loading…

A Convergence Result for Regularized Actor-Critic Methods

by Suttle, Wesley, Yang, Zhuoran, Zhang, Kaiqing, Liu, Ji
Year of Publication 13.07.2019

Get full text

Journal Article

Loading…

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

by Singh, Utsav, Suttle, Wesley A, Sadler, Brian M, Namboodiri, Vinay P, Amrit Singh Bedi
Published in arXiv.org (16.06.2024)

Get full text

Paper

Loading…

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

by Singh, Utsav, Chakraborty, Souradip, Suttle, Wesley A, Sadler, Brian M, Namboodiri, Vinay P, Amrit Singh Bedi
Published in arXiv.org (16.06.2024)

Get full text

Paper

Loading…

CCE: Sample Efficient Sparse Reward Policy Learning for Robotic Navigation via Confidence-Controlled Exploration

by Patel, Bhrij, Weerakoon, Kasun, Suttle, Wesley A, Koppel, Alec, Sadler, Brian M, Zhou, Tianyi, Amrit Singh Bedi, Manocha, Dinesh
Published in arXiv.org (24.09.2024)

Get full text

Paper

Loading…

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

by Patel, Bhrij, Suttle, Wesley A, Koppel, Alec, Aggarwal, Vaneet, Sadler, Brian M, Amrit Singh Bedi, Manocha, Dinesh
Published in arXiv.org (20.06.2024)

Get full text

Paper

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database