Methods And Apparatus For Implementing Reinforcement Learning

Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state...

Full description

Saved in:

Bibliographic Details
Main Authors	Nikou, Alexandros, Mujumdar, Anusha Pradeep
Format	Patent
Language	English
Published	19.09.2024
Subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY PHYSICS TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION
Online Access	Get full text

Cover

Loading…

Abstract	Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state of the environment using a set of features, comprises obtaining an intent, wherein the intent specifies one or more criteria to be satisfied by the environment. The method further comprises determining a Companion Markov Decision Process (CMDP) that encodes states of the environment using a subset of the set of features used by the RL agent. The method further comprises generating a finite state automaton that represents the intent as a series of logic states, and computing a product of CMDP output states and logic states, wherein the product contains all of the potential combinations of a CMDP output state and a logic state. The method further comprises selecting an action to be performed on the environment from one or more suggested actions obtained from the policy, the selection being based on the product of CMDP output states and logic state.
AbstractList	Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs actions in an environment in accordance with a policy generated by a RL agent, wherein the RL agent models the environment and encodes a state of the environment using a set of features, comprises obtaining an intent, wherein the intent specifies one or more criteria to be satisfied by the environment. The method further comprises determining a Companion Markov Decision Process (CMDP) that encodes states of the environment using a subset of the set of features used by the RL agent. The method further comprises generating a finite state automaton that represents the intent as a series of logic states, and computing a product of CMDP output states and logic states, wherein the product contains all of the potential combinations of a CMDP output state and a logic state. The method further comprises selecting an action to be performed on the environment from one or more suggested actions obtained from the policy, the selection being based on the product of CMDP output states and logic state.
Author	Nikou, Alexandros Mujumdar, Anusha Pradeep
Author_xml	– fullname: Nikou, Alexandros – fullname: Mujumdar, Anusha Pradeep
BookMark	eNrjYmDJy89L5WSw9U0tychPKVZwzEtRcCwoSCxKLCktVnDLL1LwzC3ISc1NzSvJzEtXCErNzEvLL0oGCyj4pCYW5QGFeRhY0xJzilN5oTQ3g7Kba4izh25qQX58anFBYnJqXmpJfGiwkYGRibGhoZmFuaOhMXGqANzDMsM
ContentType	Patent
DBID	EVB
DatabaseName	esp@cenet
DatabaseTitleList
Database_xml	– sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Chemistry Sciences Physics
ExternalDocumentID	US2024311687A1
GroupedDBID	EVB
ID	FETCH-epo_espacenet_US2024311687A13
IEDL.DBID	EVB
IngestDate	Fri Nov 01 05:52:16 EDT 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-epo_espacenet_US2024311687A13
Notes	Application Number: US202118272956
OpenAccessLink	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240919&DB=EPODOC&CC=US&NR=2024311687A1
ParticipantIDs	epo_espacenet_US2024311687A1
PublicationCentury	2000
PublicationDate	20240919
PublicationDateYYYYMMDD	2024-09-19
PublicationDate_xml	– month: 09 year: 2024 text: 20240919 day: 19
PublicationDecade	2020
PublicationYear	2024
RelatedCompanies	Telefonaktiebolaget LM Ericsson (publ)
RelatedCompanies_xml	– name: Telefonaktiebolaget LM Ericsson (publ)
Score	3.5600429
Snippet	Methods and apparatus for implementing reinforcement learning (RL) are provided. A method of operation for a node implementing RL, wherein the node instructs...
SourceID	epo
SourceType	Open Access Repository
SubjectTerms	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC COMMUNICATION TECHNIQUE ELECTRICITY PHYSICS TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHICCOMMUNICATION
Title	Methods And Apparatus For Implementing Reinforcement Learning
URI	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240919&DB=EPODOC&locale=&CC=US&NR=2024311687A1
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1La8MwDBale962bmOPbhg2egubGydpDmG0SUMZ9EEfo7fiOE4ZjLQ0Kfv7U0S69dSjJRDCIEuyP38CeBHWG4-T2DG44JEhpFSGxELDaNqRaGplO0lCaIuB3ZuJj7k1r8D37i8M8YT-EDkiRpTCeM_pvF7_X2IFhK3MXqMvFK3ew6kXNMruGNOTixEYdLzuaBgM_Ybve7NJYzAmncm53XLa2CsdYSHtUNv22Sn-paz3k0p4AccjtJfml1DRaQ3O_N3stRqc9ssn7xqcEEZTZSgs4zC7Aq9Pg58z1k5jhoVkwd-9zVi42jCi-yUMULpkY03EqIoErORSXV7Dc9id-j0DPVr8bcBiNtl337yBarpK9S0wKS0eJbFuCa4EF8p1Le3aypR2U8jIdO6gfsjS_WH1A5wXywIdwd06VPPNVj9iCs6jJ9q5X7vAihY
link.rule.ids	230,309,783,888,25577,76883
linkProvider	European Patent Office
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8NAEB5KfdSbVsVH1QWlt6CbbJLmEKRNGqr2RR_SW9lsNkWQtDQp_n03Q6o99ToDyzAwOzO733wD8MTMFxrFka1RRkONcS40rgoNTbdCpkth2XGMaIu-1Zmy95k5K8H3dhYGeUJ_kBxRRZRQ8Z7hfb36f8TyEVuZPodfSrR8DSauXy-6Y5WeHBWBfsttDwf-wKt7njsd1_sj1BmUWg27qXqlA1VkN7BZ-mzlcymr3aQSnMLhUJ2XZGdQkkkVKt5291oVjnvFl3cVjhCjKVIlLOIwPQe3h4ufU9JMIqIKyZy_e5OSYLkmSPeLGKBkQUYSiVEFCkjBpbq4gMegPfE6mrJo_ueA-XS8a75xCeVkmcgrIJybNIwj2WBUMMqE45jSsYTBLZ3x0LCvobbvpJv96geodCa97rz71v-4hZNclSMlqFODcrbeyDuVjrPwHr34C11fjQY
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Methods+And+Apparatus+For+Implementing+Reinforcement+Learning&rft.inventor=Nikou%2C+Alexandros&rft.inventor=Mujumdar%2C+Anusha+Pradeep&rft.date=2024-09-19&rft.externalDBID=A1&rft.externalDocID=US2024311687A1