DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network th...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English |
Published |
05.12.2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network. |
---|---|
AbstractList | Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network. |
Author | Jaderberg, Maxwell Elliot Czarnecki, Wojciech Dunning, Iain Robert |
Author_xml | – fullname: Dunning, Iain Robert – fullname: Czarnecki, Wojciech – fullname: Jaderberg, Maxwell Elliot |
BookMark | eNqNyr0KgzAUQOEM7dC_d7jQuaANKB2DXmtomshNQkaRkk5FBX1_WqEP4HTg8O3Zph_6uGNzidgAodSVoQKfqB0oFKSlvkOQroZKWAe-KYVbFmHhiRal0ZNQv7hg6GFB6BKsMmGFPbLtu_tM8fTvgZ0rdEV9iePQxmnsXrGPc-vtNUlvPE8ynouUr1NfZOA8Tg |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
ExternalDocumentID | US2019370637A1 |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_US2019370637A13 |
IEDL.DBID | EVB |
IngestDate | Fri Jul 19 12:46:43 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_US2019370637A13 |
Notes | Application Number: US201916425717 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20191205&DB=EPODOC&CC=US&NR=2019370637A1 |
ParticipantIDs | epo_espacenet_US2019370637A1 |
PublicationCentury | 2000 |
PublicationDate | 20191205 |
PublicationDateYYYYMMDD | 2019-12-05 |
PublicationDate_xml | – month: 12 year: 2019 text: 20191205 day: 05 |
PublicationDecade | 2010 |
PublicationYear | 2019 |
RelatedCompanies | DeepMind Technologies Limited |
RelatedCompanies_xml | – name: DeepMind Technologies Limited |
Score | 3.244507 |
Snippet | Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS |
Title | DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20191205&DB=EPODOC&locale=&CC=US&NR=2019370637A1 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjR1dS8Mw8Bjz802n4seUgNK34taPZXsY0rWpm860pK3b21i7DgTphqv4972EqXsaPoXcHQe5cLlcch8Ad9PM6KTN9lzPqGxhNjUsPbU6uW7nluy8l7Vnmar2yVv9xHoa2-MKvP_kwqg6oV-qOCJqVIb6Xqrzevn3iOWp2MrVffqGoMWDH3c9be0do_NhNGzN63VZGHiBq7luN4k0LhTOpGiPqYO-0g5epKkMAGOvPZmXstw0Kv4R7IbIryiPoZIXNThwf3qv1WD_Zf3lXYM9FaOZrRC41sPVCZQeYyERbMDRiXNVQX4yZI7gA_5IRoO4T3wnikkSek4sQYK5iRCSirNEOEMc4lEgniPicI9Ew2D0D9pTuPVZ7PZ1XMjkV26TJNpctXkG1WJR5OdAZqbRME06a7WM1MI96sytOeotmktqTZuUXkB9G6fL7egrOJRTFfNh16Fafnzm12i5y_RGCfwbQfaS3Q |
link.rule.ids | 230,309,783,888,25578,76884 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dT8IwEL8Q_MA3RY0fqE00eyPCPhg8EDO2zk1Gt3Sb8EbYGImJGURm_Pe9Nag8EZ-a3DVN7prr9dq73wE8zFK5l7S7i2aqly3MZrLaTNRe1tQytey8l3bnqUD7ZB0nVl8m2qQC7z-1MAIn9EuAI6JFpWjvhTivV3-PWJbIrVw_Jm9IWj7ZUd-SNtExBh9yS5OsQZ8GvuWbkmn241BiXPAUHf2xbmCstIeX7G6JtE9fB2VdymrbqdjHsB_genlxApUsr0PN_Om9VofD0ebLuw4HIkczXSNxY4frUygsSgPCqcswiDMFID_xqMGZy57J2I0cYhthROLAMqKSxKkZc17OYjTmhodDNPb5MCQGs0jo-eN_zD2De5tGptNEQaa_epvG4bbUyjlU82WeXQCZK3JLUfR5pyMnKu5Rb6Eu0G7RXerqrK3rl9DYtdLVbvYd1Jxo5E09lw2v4ahkifwPrQHV4uMzu0EvXiS3QvnfToeVzQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=DEEP+REINFORCEMENT+LEARNING+WITH+FAST+UPDATING+RECURRENT+NEURAL+NETWORKS+AND+SLOW+UPDATING+RECURRENT+NEURAL+NETWORKS&rft.inventor=Dunning%2C+Iain+Robert&rft.inventor=Czarnecki%2C+Wojciech&rft.inventor=Jaderberg%2C+Maxwell+Elliot&rft.date=2019-12-05&rft.externalDBID=A1&rft.externalDocID=US2019370637A1 |