Towards Safer Rehabilitation: Improving Gait Trajectory Tracking for Lower Limb Exoskeletons Using Offline Reinforcement Learning
The application of online reinforcement learning (RL) in lower limb exoskeleton control has the potential to improve gait rehabilitation for individuals with impaired mobility. However, online RL approaches require real-time exploration and pose safety risks during training as suboptimal policies ca...
Saved in:
Published in | IEEE International Conference on Rehabilitation Robotics Vol. 2025; pp. 577 - 582 |
---|---|
Main Authors | , , , |
Format | Conference Proceeding Journal Article |
Language | English |
Published |
United States
IEEE
01.05.2025
|
Subjects | |
Online Access | Get full text |
ISSN | 1945-7901 1945-7901 |
DOI | 10.1109/ICORR66766.2025.11063146 |
Cover
Abstract | The application of online reinforcement learning (RL) in lower limb exoskeleton control has the potential to improve gait rehabilitation for individuals with impaired mobility. However, online RL approaches require real-time exploration and pose safety risks during training as suboptimal policies can lead to unstable or unsafe actions being executed by the exoskeleton. This study explores the application of offline RL methods, including Implicit Q-Learning (IQL), Twin Delayed Deep Deterministic Policy Gradient with Behavior Cloning (TD3+BC), and Revisited Behavior Regularized Actor-Critic (ReBRAC), for trajectory control of lower limb exoskeletons using a pre-collected dataset. The transition involved generating a diverse and representative dataset using online RL methods like Proximal Policy Optimization (PPO), which was then utilized to optimize offline RL models with advanced hyperparameter tuning via Optuna. Our results demonstrate improved gait trajectory tracking over a PPO baseline, with our TD3+BC model achieving the best performance. These findings highlight the potential of offline RL to enhance exoskeleton trajectory control while minimizing safety risks inherent in online approaches. |
---|---|
AbstractList | The application of online reinforcement learning (RL) in lower limb exoskeleton control has the potential to improve gait rehabilitation for individuals with impaired mobility. However, online RL approaches require real-time exploration and pose safety risks during training as suboptimal policies can lead to unstable or unsafe actions being executed by the exoskeleton. This study explores the application of offline RL methods, including Implicit Q-Learning (IQL), Twin Delayed Deep Deterministic Policy Gradient with Behavior Cloning (TD3+BC), and Revisited Behavior Regularized Actor-Critic (ReBRAC), for trajectory control of lower limb exoskeletons using a pre-collected dataset. The transition involved generating a diverse and representative dataset using online RL methods like Proximal Policy Optimization (PPO), which was then utilized to optimize offline RL models with advanced hyperparameter tuning via Optuna. Our results demonstrate improved gait trajectory tracking over a PPO baseline, with our TD3+BC model achieving the best performance. These findings highlight the potential of offline RL to enhance exoskeleton trajectory control while minimizing safety risks inherent in online approaches.The application of online reinforcement learning (RL) in lower limb exoskeleton control has the potential to improve gait rehabilitation for individuals with impaired mobility. However, online RL approaches require real-time exploration and pose safety risks during training as suboptimal policies can lead to unstable or unsafe actions being executed by the exoskeleton. This study explores the application of offline RL methods, including Implicit Q-Learning (IQL), Twin Delayed Deep Deterministic Policy Gradient with Behavior Cloning (TD3+BC), and Revisited Behavior Regularized Actor-Critic (ReBRAC), for trajectory control of lower limb exoskeletons using a pre-collected dataset. The transition involved generating a diverse and representative dataset using online RL methods like Proximal Policy Optimization (PPO), which was then utilized to optimize offline RL models with advanced hyperparameter tuning via Optuna. Our results demonstrate improved gait trajectory tracking over a PPO baseline, with our TD3+BC model achieving the best performance. These findings highlight the potential of offline RL to enhance exoskeleton trajectory control while minimizing safety risks inherent in online approaches. The application of online reinforcement learning (RL) in lower limb exoskeleton control has the potential to improve gait rehabilitation for individuals with impaired mobility. However, online RL approaches require real-time exploration and pose safety risks during training as suboptimal policies can lead to unstable or unsafe actions being executed by the exoskeleton. This study explores the application of offline RL methods, including Implicit Q-Learning (IQL), Twin Delayed Deep Deterministic Policy Gradient with Behavior Cloning (TD3+BC), and Revisited Behavior Regularized Actor-Critic (ReBRAC), for trajectory control of lower limb exoskeletons using a pre-collected dataset. The transition involved generating a diverse and representative dataset using online RL methods like Proximal Policy Optimization (PPO), which was then utilized to optimize offline RL models with advanced hyperparameter tuning via Optuna. Our results demonstrate improved gait trajectory tracking over a PPO baseline, with our TD3+BC model achieving the best performance. These findings highlight the potential of offline RL to enhance exoskeleton trajectory control while minimizing safety risks inherent in online approaches. |
Author | Faisal, A. Aldo Narayan, Jyotindra Omarali, Bukeikhan Sang, Matthew Wong |
Author_xml | – sequence: 1 givenname: Matthew Wong surname: Sang fullname: Sang, Matthew Wong organization: Imperial College London,Brain and Behaviour Lab,Department of Bioengineering,UK – sequence: 2 givenname: Jyotindra surname: Narayan fullname: Narayan, Jyotindra organization: Universität Bayreuth,Chair in Digital Health,Germany – sequence: 3 givenname: Bukeikhan surname: Omarali fullname: Omarali, Bukeikhan organization: Imperial College London,Brain and Behaviour Lab,Department of Computing,UK – sequence: 4 givenname: A. Aldo surname: Faisal fullname: Faisal, A. Aldo email: aldo.faisal@imperial.ac.uk organization: Imperial College London,Brain and Behaviour Lab,Department of Bioengineering,UK |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/40643995$$D View this record in MEDLINE/PubMed |
BookMark | eNpNkU1rGzEQhpWSUqep_0EoOubiVB8rrdRbMUlqWDC4ztlI61EiZ1dypXU-jv3n1eKk9DQv8z4zMO98RqchBkAIU3JFKdHfFvPlaiVlLeUVI0yMTclpJU_QVNdacUG4IlKxD-iM6krMak3o6X96gqY57wghlCnJavkJTSoiK661OEN_1vHZpG3Gv4yDhFfwYKzv_GAGH8N3vOj3KT75cI9vjR_wOpkdtENMr6NsH0fDxYSb-FyGG99bfP0S8yN0MMSQ8V0eiaVznQ9QlvtQ6BZ6CANuwKRQ7C_oozNdhulbPUd3N9fr-c9Zs7xdzH80M08rPcysboWGigJnRAslHJWuVOmEMdKWeLRgypLK1M4AOG63WjEr1ZZZqDWT_BxdHveWi34fIA-b3ucWus4EiIe84YxpMaakCvr1DT3YHrabffK9Sa-b99gKcHEEPAD8s98fw_8CckWBKw |
ContentType | Conference Proceeding Journal Article |
DBID | 6IE 6IL CBEJK RIE RIL CGR CUY CVF ECM EIF NPM 7X8 |
DOI | 10.1109/ICORR66766.2025.11063146 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
DatabaseTitle | MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Occupational Therapy & Rehabilitation |
EISBN | 9798350380682 |
EISSN | 1945-7901 |
EndPage | 582 |
ExternalDocumentID | 40643995 11063146 |
Genre | orig-research Journal Article |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI OCL RIE RIL RNS CGR CUY CVF ECM EIF NPM 7X8 |
ID | FETCH-LOGICAL-i149t-b9c59e41e3209585f16f9586f5aa6b6679528b04a7faeef3bd982b68d2be79263 |
IEDL.DBID | RIE |
ISSN | 1945-7901 |
IngestDate | Sat Jul 12 17:30:12 EDT 2025 Sun Jul 13 01:31:32 EDT 2025 Wed Jul 16 07:53:35 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i149t-b9c59e41e3209585f16f9586f5aa6b6679528b04a7faeef3bd982b68d2be79263 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
PMID | 40643995 |
PQID | 3229500018 |
PQPubID | 23479 |
PageCount | 6 |
ParticipantIDs | ieee_primary_11063146 pubmed_primary_40643995 proquest_miscellaneous_3229500018 |
PublicationCentury | 2000 |
PublicationDate | 2025-May |
PublicationDateYYYYMMDD | 2025-05-01 |
PublicationDate_xml | – month: 05 year: 2025 text: 2025-May |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | IEEE International Conference on Rehabilitation Robotics |
PublicationTitleAbbrev | ICORR |
PublicationTitleAlternate | IEEE Int Conf Rehabil Robot |
PublicationYear | 2025 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0001286276 |
Score | 2.2927966 |
Snippet | The application of online reinforcement learning (RL) in lower limb exoskeleton control has the potential to improve gait rehabilitation for individuals with... |
SourceID | proquest pubmed ieee |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 577 |
SubjectTerms | Adaptation models Exoskeleton Device Exoskeletons Gait - physiology Humans Limbs Lower Extremity - physiology Real-time systems Reinforcement, Psychology Safety Testing Training Trajectory Trajectory tracking Tuning |
Title | Towards Safer Rehabilitation: Improving Gait Trajectory Tracking for Lower Limb Exoskeletons Using Offline Reinforcement Learning |
URI | https://ieeexplore.ieee.org/document/11063146 https://www.ncbi.nlm.nih.gov/pubmed/40643995 https://www.proquest.com/docview/3229500018 |
Volume | 2025 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bS8MwFA66J5-8TZ03Iqhv3da0TVtfx7yhm8wJvo2c9kTmsBPXgfrmP_ckXecFBJ8aKAmh56Tny8l3vjB2GIeuDpvoOUJr7fhahQ6YKhBQEUCEnkAru3jdked3_uV9cD8rVre1MIhoyWdYN017lp-Ok6lJlTUoVEmPlvYiWyQ_K4q1viVUCJyHsmTrNOPGRavb6xkOp6EiiKBedp9dpPI3prSx5XSZdcpZFZSSUX2aQz15_yXY-O9pr7DqVxkfv5kHqFW2gNkaO_quK8z7hagAP-a9H5Ld6-yjbwm1E36raKxfr0_4PBnBz9Qw5xTyHm3-_800E5N_5wSH-ZW5hI1fDZ-At1_HkxEFOQKbE26ZCryrtUG5NLjVb01sqpLPJF8fquzutN1vnTuz-xqcIe2zcgfiJIjRd8nCBNyiQLtS01PqQCkJZIg4EBF5ggq1QtQepHEkQEapAAxjIb0NVsnGGW4xnib0LxEgQEnPT2mbDi6kboBugiHQFrPGquY7D54LSY5B-Ylr7KC06YDWiTn8UBmOp5OBZ-4tN24S1dhmYex5b9_isjjY_mPUHbZkHKjgOe6ySv4yxT3CIjnsWx_8BMIk4HE |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fT9swED5t7IE9AaOwjh_zpLG3lMaJnYRXBCtbKVNXJN4iX3KeClo7rakEe9t_ztlpCkNC2lMsRTlZvnPufP7uO4CPWRLapEtRIK21QWxNEqCrAkGTIqYUSfK0i-cD3buMv1ypq0Wxuq-FISIPPqOOG_q7_HJazF2q7JBdlY54a7-EV-z4Y1WXaz1KqXB4nugGr9PNDs-OL4ZDh-J0YASpOo2ARSuV56NK711O12DQzKsGldx05hV2ij9PKBv_e-Lr0Hoo5BPfli5qA17Q5A0cPGYWFqOaVkB8EsN_SLs34e_IQ2pn4rthWU9eH4llOkJ8NuNKsNO79jcAd25YuAy84IBY9F0bNtEf_0Rxcjud3bCb43BzJjxWQVxY6-JcFu4ZXAufrBQL0tcfLbg8PRkd94JFx4ZgzCetKsCsUBnFIeuYQ7dU2VBbfmqrjNHIisiUTNkWTGINkY2wzFKJOi0lUpJJHW3BymQ6obcgyoL_JhIlGh3FJR_UMcQyVBQWlCAfMtvQcuuc_6pJOfJmidvwodFpzjvFXX-YCU3nszxyncudmaRt2K6Vvfw69pFZpt49I_U9rPZG5_28fzb4ugOvnTHVqMddWKl-z2mPI5MK97093gOj3eO- |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Conference+on+Rehabilitation+Robotics&rft.atitle=Towards+Safer+Rehabilitation%3A+Improving+Gait+Trajectory+Tracking+for+Lower+Limb+Exoskeletons+Using+Offline+Reinforcement+Learning&rft.au=Sang%2C+Matthew+Wong&rft.au=Narayan%2C+Jyotindra&rft.au=Omarali%2C+Bukeikhan&rft.au=Faisal%2C+A.+Aldo&rft.date=2025-05-01&rft.pub=IEEE&rft.eissn=1945-7901&rft.spage=577&rft.epage=582&rft_id=info:doi/10.1109%2FICORR66766.2025.11063146&rft.externalDocID=11063146 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1945-7901&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1945-7901&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1945-7901&client=summon |