Methods And Apparatus For Implementing Reinforcement Learning

Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state...

Full description

Saved in:
Bibliographic Details
Main Authors Nikou, Alexandros, Mujumdar, Anusha Pradeep
Format Patent
LanguageEnglish
Published 19.09.2024
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state of the environment using a set of features, comprises obtaining an intent, wherein the intent specifies one or more criteria to be satisfied by the environment. The method further comprises determining a Companion Markov Decision Process (CMDP) that encodes states of the environment using a subset of the set of features used by the RL agent. The method further comprises generating a finite state automaton that represents the intent as a series of logic states, and computing a product of CMDP output states and logic states, wherein the product contains all of the potential combinations of a CMDP output state and a logic state. The method further comprises selecting an action to be performed on the environment from one or more suggested actions obtained from the policy, the selection being based on the product of CMDP output states and logic state.
AbstractList Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state of the environment using a set of features, comprises obtaining an intent, wherein the intent specifies one or more criteria to be satisfied by the environment. The method further comprises determining a Companion Markov Decision Process (CMDP) that encodes states of the environment using a subset of the set of features used by the RL agent. The method further comprises generating a finite state automaton that represents the intent as a series of logic states, and computing a product of CMDP output states and logic states, wherein the product contains all of the potential combinations of a CMDP output state and a logic state. The method further comprises selecting an action to be performed on the environment from one or more suggested actions obtained from the policy, the selection being based on the product of CMDP output states and logic state.
Author Nikou, Alexandros
Mujumdar, Anusha Pradeep
Author_xml – fullname: Nikou, Alexandros
– fullname: Mujumdar, Anusha Pradeep
BookMark eNrjYmDJy89L5WSw9U0tychPKVZwzEtRcCwoSCxKLCktVnDLL1LwzC3ISc1NzSvJzEtXCErNzEvLL0oGCyj4pCYW5QGFeRhY0xJzilN5oTQ3g7Kba4izh25qQX58anFBYnJqXmpJfGiwkYGRibGhoZmFuaOhMXGqANzDMsM
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
ExternalDocumentID US2024311687A1
GroupedDBID EVB
ID FETCH-epo_espacenet_US2024311687A13
IEDL.DBID EVB
IngestDate Fri Nov 01 05:52:16 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_US2024311687A13
Notes Application Number: US202118272956
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240919&DB=EPODOC&CC=US&NR=2024311687A1
ParticipantIDs epo_espacenet_US2024311687A1
PublicationCentury 2000
PublicationDate 20240919
PublicationDateYYYYMMDD 2024-09-19
PublicationDate_xml – month: 09
  year: 2024
  text: 20240919
  day: 19
PublicationDecade 2020
PublicationYear 2024
RelatedCompanies Telefonaktiebolaget LM Ericsson (publ)
RelatedCompanies_xml – name: Telefonaktiebolaget LM Ericsson (publ)
Score 3.5600429
Snippet Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs...
SourceID epo
SourceType Open Access Repository
SubjectTerms CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC COMMUNICATION TECHNIQUE
ELECTRICITY
PHYSICS
TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION
Title Methods And Apparatus For Implementing Reinforcement Learning
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240919&DB=EPODOC&locale=&CC=US&NR=2024311687A1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1La8MwDBale962bmOPbhg2egubGydpDmG0SUMZ9EEfo7fiOE4ZjLQ0Kfv7U0S69dSjJRDCIEuyP38CeBHWG4-T2DG44JEhpFSGxELDaNqRaGplO0lCaIuB3ZuJj7k1r8D37i8M8YT-EDkiRpTCeM_pvF7_X2IFhK3MXqMvFK3ew6kXNMruGNOTixEYdLzuaBgM_Ybve7NJYzAmncm53XLa2CsdYSHtUNv22Sn-paz3k0p4AccjtJfml1DRaQ3O_N3stRqc9ssn7xqcEEZTZSgs4zC7Aq9Pg58z1k5jhoVkwd-9zVi42jCi-yUMULpkY03EqIoErORSXV7Dc9id-j0DPVr8bcBiNtl337yBarpK9S0wKS0eJbFuCa4EF8p1Le3aypR2U8jIdO6gfsjS_WH1A5wXywIdwd06VPPNVj9iCs6jJ9q5X7vAihY
link.rule.ids 230,309,783,888,25577,76883
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB5KfdSbVsVH1QWlt6CbbJLmEKRNGqr2RR_SW9lsNkWQtDQp_n03Q6o99ToDyzAwOzO733wD8MTMFxrFka1RRkONcS40rgoNTbdCpkth2XGMaIu-1Zmy95k5K8H3dhYGeUJ_kBxRRZRQ8Z7hfb36f8TyEVuZPodfSrR8DSauXy-6Y5WeHBWBfsttDwf-wKt7njsd1_sj1BmUWg27qXqlA1VkN7BZ-mzlcymr3aQSnMLhUJ2XZGdQkkkVKt5291oVjnvFl3cVjhCjKVIlLOIwPQe3h4ufU9JMIqIKyZy_e5OSYLkmSPeLGKBkQUYSiVEFCkjBpbq4gMegPfE6mrJo_ueA-XS8a75xCeVkmcgrIJybNIwj2WBUMMqE45jSsYTBLZ3x0LCvobbvpJv96geodCa97rz71v-4hZNclSMlqFODcrbeyDuVjrPwHr34C11fjQY
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Methods+And+Apparatus+For+Implementing+Reinforcement+Learning&rft.inventor=Nikou%2C+Alexandros&rft.inventor=Mujumdar%2C+Anusha+Pradeep&rft.date=2024-09-19&rft.externalDBID=A1&rft.externalDocID=US2024311687A1