DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network th...

Full description

Saved in:
Bibliographic Details
Main Authors Dunning, Iain Robert, Czarnecki, Wojciech, Jaderberg, Maxwell Elliot
Format Patent
LanguageEnglish
Published 05.12.2019
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network.
AbstractList Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network.
Author Jaderberg, Maxwell Elliot
Czarnecki, Wojciech
Dunning, Iain Robert
Author_xml – fullname: Dunning, Iain Robert
– fullname: Czarnecki, Wojciech
– fullname: Jaderberg, Maxwell Elliot
BookMark eNqNyr0KgzAUQOEM7dC_d7jQuaANKB2DXmtomshNQkaRkk5FBX1_WqEP4HTg8O3Zph_6uGNzidgAodSVoQKfqB0oFKSlvkOQroZKWAe-KYVbFmHhiRal0ZNQv7hg6GFB6BKsMmGFPbLtu_tM8fTvgZ0rdEV9iePQxmnsXrGPc-vtNUlvPE8ynouUr1NfZOA8Tg
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
ExternalDocumentID US2019370637A1
GroupedDBID EVB
ID FETCH-epo_espacenet_US2019370637A13
IEDL.DBID EVB
IngestDate Fri Jul 19 12:46:43 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_US2019370637A13
Notes Application Number: US201916425717
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20191205&DB=EPODOC&CC=US&NR=2019370637A1
ParticipantIDs epo_espacenet_US2019370637A1
PublicationCentury 2000
PublicationDate 20191205
PublicationDateYYYYMMDD 2019-12-05
PublicationDate_xml – month: 12
  year: 2019
  text: 20191205
  day: 05
PublicationDecade 2010
PublicationYear 2019
RelatedCompanies DeepMind Technologies Limited
RelatedCompanies_xml – name: DeepMind Technologies Limited
Score 3.244507
Snippet Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes...
SourceID epo
SourceType Open Access Repository
SubjectTerms CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
Title DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20191205&DB=EPODOC&locale=&CC=US&NR=2019370637A1
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjR1dS8Mw8Bjz802n4seUgNK34taPZXsY0rWpm860pK3b21i7DgTphqv4972EqXsaPoXcHQe5cLlcch8Ad9PM6KTN9lzPqGxhNjUsPbU6uW7nluy8l7Vnmar2yVv9xHoa2-MKvP_kwqg6oV-qOCJqVIb6Xqrzevn3iOWp2MrVffqGoMWDH3c9be0do_NhNGzN63VZGHiBq7luN4k0LhTOpGiPqYO-0g5epKkMAGOvPZmXstw0Kv4R7IbIryiPoZIXNThwf3qv1WD_Zf3lXYM9FaOZrRC41sPVCZQeYyERbMDRiXNVQX4yZI7gA_5IRoO4T3wnikkSek4sQYK5iRCSirNEOEMc4lEgniPicI9Ew2D0D9pTuPVZ7PZ1XMjkV26TJNpctXkG1WJR5OdAZqbRME06a7WM1MI96sytOeotmktqTZuUXkB9G6fL7egrOJRTFfNh16Fafnzm12i5y_RGCfwbQfaS3Q
link.rule.ids 230,309,783,888,25578,76884
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dT8IwEL8Q_MA3RY0fqE00eyPCPhg8EDO2zk1Gt3Sb8EbYGImJGURm_Pe9Nag8EZ-a3DVN7prr9dq73wE8zFK5l7S7i2aqly3MZrLaTNRe1tQytey8l3bnqUD7ZB0nVl8m2qQC7z-1MAIn9EuAI6JFpWjvhTivV3-PWJbIrVw_Jm9IWj7ZUd-SNtExBh9yS5OsQZ8GvuWbkmn241BiXPAUHf2xbmCstIeX7G6JtE9fB2VdymrbqdjHsB_genlxApUsr0PN_Om9VofD0ebLuw4HIkczXSNxY4frUygsSgPCqcswiDMFID_xqMGZy57J2I0cYhthROLAMqKSxKkZc17OYjTmhodDNPb5MCQGs0jo-eN_zD2De5tGptNEQaa_epvG4bbUyjlU82WeXQCZK3JLUfR5pyMnKu5Rb6Eu0G7RXerqrK3rl9DYtdLVbvYd1Jxo5E09lw2v4ahkifwPrQHV4uMzu0EvXiS3QvnfToeVzQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=DEEP+REINFORCEMENT+LEARNING+WITH+FAST+UPDATING+RECURRENT+NEURAL+NETWORKS+AND+SLOW+UPDATING+RECURRENT+NEURAL+NETWORKS&rft.inventor=Dunning%2C+Iain+Robert&rft.inventor=Czarnecki%2C+Wojciech&rft.inventor=Jaderberg%2C+Maxwell+Elliot&rft.date=2019-12-05&rft.externalDBID=A1&rft.externalDocID=US2019370637A1