Methods And Apparatus For Implementing Reinforcement Learning
Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state...
Saved in:
Main Authors | , |
---|---|
Format | Patent |
Language | English |
Published |
19.09.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state of the environment using a set of features, comprises obtaining an intent, wherein the intent specifies one or more criteria to be satisfied by the environment. The method further comprises determining a Companion Markov Decision Process (CMDP) that encodes states of the environment using a subset of the set of features used by the RL agent. The method further comprises generating a finite state automaton that represents the intent as a series of logic states, and computing a product of CMDP output states and logic states, wherein the product contains all of the potential combinations of a CMDP output state and a logic state. The method further comprises selecting an action to be performed on the environment from one or more suggested actions obtained from the policy, the selection being based on the product of CMDP output states and logic state. |
---|---|
AbstractList | Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state of the environment using a set of features, comprises obtaining an intent, wherein the intent specifies one or more criteria to be satisfied by the environment. The method further comprises determining a Companion Markov Decision Process (CMDP) that encodes states of the environment using a subset of the set of features used by the RL agent. The method further comprises generating a finite state automaton that represents the intent as a series of logic states, and computing a product of CMDP output states and logic states, wherein the product contains all of the potential combinations of a CMDP output state and a logic state. The method further comprises selecting an action to be performed on the environment from one or more suggested actions obtained from the policy, the selection being based on the product of CMDP output states and logic state. |
Author | Nikou, Alexandros Mujumdar, Anusha Pradeep |
Author_xml | – fullname: Nikou, Alexandros – fullname: Mujumdar, Anusha Pradeep |
BookMark | eNrjYmDJy89L5WSw9U0tychPKVZwzEtRcCwoSCxKLCktVnDLL1LwzC3ISc1NzSvJzEtXCErNzEvLL0oGCyj4pCYW5QGFeRhY0xJzilN5oTQ3g7Kba4izh25qQX58anFBYnJqXmpJfGiwkYGRibGhoZmFuaOhMXGqANzDMsM |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
ExternalDocumentID | US2024311687A1 |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_US2024311687A13 |
IEDL.DBID | EVB |
IngestDate | Fri Nov 01 05:52:16 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_US2024311687A13 |
Notes | Application Number: US202118272956 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240919&DB=EPODOC&CC=US&NR=2024311687A1 |
ParticipantIDs | epo_espacenet_US2024311687A1 |
PublicationCentury | 2000 |
PublicationDate | 20240919 |
PublicationDateYYYYMMDD | 2024-09-19 |
PublicationDate_xml | – month: 09 year: 2024 text: 20240919 day: 19 |
PublicationDecade | 2020 |
PublicationYear | 2024 |
RelatedCompanies | Telefonaktiebolaget LM Ericsson (publ) |
RelatedCompanies_xml | – name: Telefonaktiebolaget LM Ericsson (publ) |
Score | 3.5600429 |
Snippet | Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY PHYSICS TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION |
Title | Methods And Apparatus For Implementing Reinforcement Learning |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240919&DB=EPODOC&locale=&CC=US&NR=2024311687A1 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1La8MwDBale962bmOPbhg2egubGydpDmG0SUMZ9EEfo7fiOE4ZjLQ0Kfv7U0S69dSjJRDCIEuyP38CeBHWG4-T2DG44JEhpFSGxELDaNqRaGplO0lCaIuB3ZuJj7k1r8D37i8M8YT-EDkiRpTCeM_pvF7_X2IFhK3MXqMvFK3ew6kXNMruGNOTixEYdLzuaBgM_Ybve7NJYzAmncm53XLa2CsdYSHtUNv22Sn-paz3k0p4AccjtJfml1DRaQ3O_N3stRqc9ssn7xqcEEZTZSgs4zC7Aq9Pg58z1k5jhoVkwd-9zVi42jCi-yUMULpkY03EqIoErORSXV7Dc9id-j0DPVr8bcBiNtl337yBarpK9S0wKS0eJbFuCa4EF8p1Le3aypR2U8jIdO6gfsjS_WH1A5wXywIdwd06VPPNVj9iCs6jJ9q5X7vAihY |
link.rule.ids | 230,309,783,888,25577,76883 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB5KfdSbVsVH1QWlt6CbbJLmEKRNGqr2RR_SW9lsNkWQtDQp_n03Q6o99ToDyzAwOzO733wD8MTMFxrFka1RRkONcS40rgoNTbdCpkth2XGMaIu-1Zmy95k5K8H3dhYGeUJ_kBxRRZRQ8Z7hfb36f8TyEVuZPodfSrR8DSauXy-6Y5WeHBWBfsttDwf-wKt7njsd1_sj1BmUWg27qXqlA1VkN7BZ-mzlcymr3aQSnMLhUJ2XZGdQkkkVKt5291oVjnvFl3cVjhCjKVIlLOIwPQe3h4ufU9JMIqIKyZy_e5OSYLkmSPeLGKBkQUYSiVEFCkjBpbq4gMegPfE6mrJo_ueA-XS8a75xCeVkmcgrIJybNIwj2WBUMMqE45jSsYTBLZ3x0LCvobbvpJv96geodCa97rz71v-4hZNclSMlqFODcrbeyDuVjrPwHr34C11fjQY |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Methods+And+Apparatus+For+Implementing+Reinforcement+Learning&rft.inventor=Nikou%2C+Alexandros&rft.inventor=Mujumdar%2C+Anusha+Pradeep&rft.date=2024-09-19&rft.externalDBID=A1&rft.externalDocID=US2024311687A1 |