DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network th...

Full description

Saved in:

Bibliographic Details
Main Authors	Dunning, Iain Robert, Czarnecki, Wojciech, Jaderberg, Maxwell Elliot
Format	Patent
Language	English
Published	05.12.2019
Subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online Access	Get full text

Cover

Loading…

Abstract	Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network.
AbstractList	Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes selecting an action to be performed by the agent using both a slow updating recurrent neural network and a fast updating recurrent neural network that receives a fast updating input that includes the hidden state of the slow updating recurrent neural network.
Author	Jaderberg, Maxwell Elliot Czarnecki, Wojciech Dunning, Iain Robert
Author_xml	– fullname: Dunning, Iain Robert – fullname: Czarnecki, Wojciech – fullname: Jaderberg, Maxwell Elliot
BookMark	eNqNyr0KgzAUQOEM7dC_d7jQuaANKB2DXmtomshNQkaRkk5FBX1_WqEP4HTg8O3Zph_6uGNzidgAodSVoQKfqB0oFKSlvkOQroZKWAe-KYVbFmHhiRal0ZNQv7hg6GFB6BKsMmGFPbLtu_tM8fTvgZ0rdEV9iePQxmnsXrGPc-vtNUlvPE8ynouUr1NfZOA8Tg
ContentType	Patent
DBID	EVB
DatabaseName	esp@cenet
DatabaseTitleList
Database_xml	– sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Chemistry Sciences Physics
ExternalDocumentID	US2019370637A1
GroupedDBID	EVB
ID	FETCH-epo_espacenet_US2019370637A13
IEDL.DBID	EVB
IngestDate	Fri Jul 19 12:46:43 EDT 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-epo_espacenet_US2019370637A13
Notes	Application Number: US201916425717
OpenAccessLink	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20191205&DB=EPODOC&CC=US&NR=2019370637A1
ParticipantIDs	epo_espacenet_US2019370637A1
PublicationCentury	2000
PublicationDate	20191205
PublicationDateYYYYMMDD	2019-12-05
PublicationDate_xml	– month: 12 year: 2019 text: 20191205 day: 05
PublicationDecade	2010
PublicationYear	2019
RelatedCompanies	DeepMind Technologies Limited
RelatedCompanies_xml	– name: DeepMind Technologies Limited
Score	3.244507
Snippet	Methods, systems, and apparatus, including computer programs encoded on computer storage media, for reinforcement learning. One of the methods includes...
SourceID	epo
SourceType	Open Access Repository
SubjectTerms	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Title	DEEP REINFORCEMENT LEARNING WITH FAST UPDATING RECURRENT NEURAL NETWORKS AND SLOW UPDATING RECURRENT NEURAL NETWORKS
URI	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20191205&DB=EPODOC&locale=&CC=US&NR=2019370637A1
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjR1dS8Mw8Bjz802n4seUgNK34taPZXsY0rWpm860pK3b21i7DgTphqv4972EqXsaPoXcHQe5cLlcch8Ad9PM6KTN9lzPqGxhNjUsPbU6uW7nluy8l7Vnmar2yVv9xHoa2-MKvP_kwqg6oV-qOCJqVIb6Xqrzevn3iOWp2MrVffqGoMWDH3c9be0do_NhNGzN63VZGHiBq7luN4k0LhTOpGiPqYO-0g5epKkMAGOvPZmXstw0Kv4R7IbIryiPoZIXNThwf3qv1WD_Zf3lXYM9FaOZrRC41sPVCZQeYyERbMDRiXNVQX4yZI7gA_5IRoO4T3wnikkSek4sQYK5iRCSirNEOEMc4lEgniPicI9Ew2D0D9pTuPVZ7PZ1XMjkV26TJNpctXkG1WJR5OdAZqbRME06a7WM1MI96sytOeotmktqTZuUXkB9G6fL7egrOJRTFfNh16Fafnzm12i5y_RGCfwbQfaS3Q
link.rule.ids	230,309,783,888,25578,76884
linkProvider	European Patent Office
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3dT8IwEL8Q_MA3RY0fqE00eyPCPhg8EDO2zk1Gt3Sb8EbYGImJGURm_Pe9Nag8EZ-a3DVN7prr9dq73wE8zFK5l7S7i2aqly3MZrLaTNRe1tQytey8l3bnqUD7ZB0nVl8m2qQC7z-1MAIn9EuAI6JFpWjvhTivV3-PWJbIrVw_Jm9IWj7ZUd-SNtExBh9yS5OsQZ8GvuWbkmn241BiXPAUHf2xbmCstIeX7G6JtE9fB2VdymrbqdjHsB_genlxApUsr0PN_Om9VofD0ebLuw4HIkczXSNxY4frUygsSgPCqcswiDMFID_xqMGZy57J2I0cYhthROLAMqKSxKkZc17OYjTmhodDNPb5MCQGs0jo-eN_zD2De5tGptNEQaa_epvG4bbUyjlU82WeXQCZK3JLUfR5pyMnKu5Rb6Eu0G7RXerqrK3rl9DYtdLVbvYd1Jxo5E09lw2v4ahkifwPrQHV4uMzu0EvXiS3QvnfToeVzQ
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=DEEP+REINFORCEMENT+LEARNING+WITH+FAST+UPDATING+RECURRENT+NEURAL+NETWORKS+AND+SLOW+UPDATING+RECURRENT+NEURAL+NETWORKS&rft.inventor=Dunning%2C+Iain+Robert&rft.inventor=Czarnecki%2C+Wojciech&rft.inventor=Jaderberg%2C+Maxwell+Elliot&rft.date=2019-12-05&rft.externalDBID=A1&rft.externalDocID=US2019370637A1