Search Results - "FIROIU, Vlad" :: K.UTB vyhledávací portál

Loading…

At Human Speed: Deep Reinforcement Learning with Action Delay

by Firoiu, Vlad, Ju, Tina, Tenenbaum, Josh
Year of Publication 16.10.2018

Get full text

Journal Article

Loading…

Proving Theorems using Incremental Learning and Hindsight Experience Replay

by Aygün, Eser, Orseau, Laurent, Anand, Ankit, Glorot, Xavier, Firoiu, Vlad, Zhang, Lei M, Precup, Doina, Mourad, Shibl
Year of Publication 20.12.2021

Get full text

Journal Article

Loading…

Learning to Prove from Synthetic Theorems

by Aygün, Eser, Ahmed, Zafarali, Anand, Ankit, Firoiu, Vlad, Glorot, Xavier, Orseau, Laurent, Precup, Doina, Mourad, Shibl
Year of Publication 19.06.2020

Get full text

Journal Article

Loading…

Training a First-Order Theorem Prover from Synthetic Data

by Firoiu, Vlad, Aygun, Eser, Anand, Ankit, Ahmed, Zafarali, Glorot, Xavier, Orseau, Laurent, Zhang, Lei, Precup, Doina, Mourad, Shibl
Year of Publication 05.03.2021

Get full text

Journal Article

Loading…

Automated curricula through setter-solver interactions

by Racaniere, Sebastien, Lampinen, Andrew K, Santoro, Adam, Reichert, David P, Firoiu, Vlad, Lillicrap, Timothy P
Year of Publication 27.09.2019

Get full text

Journal Article

Loading…

Automatic Inference for Inverting Software Simulators via Probabilistic Programming

by Saeedi, Ardavan, Firoiu, Vlad, Mansinghka, Vikash
Year of Publication 31.05.2015

Get full text

Journal Article

Loading…

Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning

by Firoiu, Vlad, Whitney, William F, Tenenbaum, Joshua B
Year of Publication 20.02.2017

Get full text

Journal Article

Loading…

Improving alignment of dialogue agents via targeted human judgements

by Glaese, Amelia, McAleese, Nat, Trębacz, Maja, Aslanides, John, Firoiu, Vlad, Ewalds, Timo, Rauh, Maribeth, Weidinger, Laura, Chadwick, Martin, Thacker, Phoebe, Campbell-Gillingham, Lucy, Uesato, Jonathan, Huang, Po-Sen, Comanescu, Ramona, Yang, Fan, See, Abigail, Dathathri, Sumanth, Greig, Rory, Chen, Charlie, Fritz, Doug, Elias, Jaume Sanchez, Green, Richard, Mokrá, Soňa, Fernando, Nicholas, Wu, Boxi, Foley, Rachel, Young, Susannah, Gabriel, Iason, Isaac, William, Mellor, John, Hassabis, Demis, Kavukcuoglu, Koray, Hendricks, Lisa Anne, Irving, Geoffrey
Year of Publication 28.09.2022

Get full text

Journal Article

Loading…

DISTRIBUTED TRAINING USING ACTOR-CRITIC REINFORCEMENT LEARNING WITH OFF-POLICY CORRECTION FACTORS

by Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 18.04.2024

Get full text

Patent

Loading…

Distributed training using actor-critic reinforcement learning with off-policy correction factors

by Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 09.01.2024

Get full text

Patent

Loading…

DISTRIBUTED TRAINING USING ACTOR-CRITIC REINFORCEMENT LEARNING WITH OFF-POLICY CORRECTION FACTORS

by Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 18.05.2023

Get full text

Patent

Loading…

Distributed training using actor-critic reinforcement learning with off-policy correction factors

by Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 28.02.2023

Get full text

Patent

Loading…

Proving Theorems using Incremental Learning and Hindsight Experience Replay

by Aygün, Eser, Orseau, Laurent, Anand, Ankit, Glorot, Xavier, Firoiu, Vlad, Zhang, Lei M, Precup, Doina, Shibl Mourad
Published in arXiv.org (20.12.2021)

Get full text

Paper

Loading…

Learning to Prove from Synthetic Theorems

by Aygün, Eser, Zafarali Ahmed, Anand, Ankit, Firoiu, Vlad, Glorot, Xavier, Orseau, Laurent, Precup, Doina, Shibl Mourad
Published in arXiv.org (19.06.2020)

Get full text

Paper

Loading…

Training a First-Order Theorem Prover from Synthetic Data

by Firoiu, Vlad, Aygun, Eser, Anand, Ankit, Zafarali Ahmed, Glorot, Xavier, Orseau, Laurent, Zhang, Lei, Precup, Doina, Shibl Mourad
Published in arXiv.org (06.04.2021)

Get full text

Paper

Loading…

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

by Espeholt, Lasse, Soyer, Hubert, Munos, Remi, Simonyan, Karen, Mnih, Volodymir, Ward, Tom, Doron, Yotam, Firoiu, Vlad, Harley, Tim, Dunning, Iain, Legg, Shane, Kavukcuoglu, Koray
Year of Publication 05.02.2018

Get full text

Journal Article

Loading…

Automated curricula through setter-solver interactions

by Racaniere, Sebastien, Lampinen, Andrew K, Santoro, Adam, Reichert, David P, Firoiu, Vlad, Lillicrap, Timothy P
Published in arXiv.org (22.01.2020)

Get full text

Paper

Loading…

Automatic Inference for Inverting Software Simulators via Probabilistic Programming

by Saeedi, Ardavan, Firoiu, Vlad, Mansinghka, Vikash
Published in arXiv.org (31.05.2015)

Get full text

Paper

Loading…

Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning

by Firoiu, Vlad, Whitney, William F, Tenenbaum, Joshua B
Published in arXiv.org (08.05.2017)

Get full text

Paper

Loading…

DISTRIBUTED TRAINING USING ACTOR-CRITIC REINFORCEMENT LEARNING WITH OFF-POLICY CORRECTION FACTORS

by Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 04.02.2021

Get full text

Patent

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database