Proving Theorems using Incremental Learning and Hindsight Experience Replay
Aygün, Eser, Orseau, Laurent, Anand, Ankit, Glorot, Xavier, Firoiu, Vlad, Zhang, Lei M, Precup, Doina, Mourad, Shibl
Year of Publication 20.12.2021
Year of Publication 20.12.2021
Get full text
Journal Article
Learning to Prove from Synthetic Theorems
Aygün, Eser, Ahmed, Zafarali, Anand, Ankit, Firoiu, Vlad, Glorot, Xavier, Orseau, Laurent, Precup, Doina, Mourad, Shibl
Year of Publication 19.06.2020
Year of Publication 19.06.2020
Get full text
Journal Article
Training a First-Order Theorem Prover from Synthetic Data
Firoiu, Vlad, Aygun, Eser, Anand, Ankit, Ahmed, Zafarali, Glorot, Xavier, Orseau, Laurent, Zhang, Lei, Precup, Doina, Mourad, Shibl
Year of Publication 05.03.2021
Year of Publication 05.03.2021
Get full text
Journal Article
Automated curricula through setter-solver interactions
Racaniere, Sebastien, Lampinen, Andrew K, Santoro, Adam, Reichert, David P, Firoiu, Vlad, Lillicrap, Timothy P
Year of Publication 27.09.2019
Year of Publication 27.09.2019
Get full text
Journal Article
Improving alignment of dialogue agents via targeted human judgements
Glaese, Amelia, McAleese, Nat, Trębacz, Maja, Aslanides, John, Firoiu, Vlad, Ewalds, Timo, Rauh, Maribeth, Weidinger, Laura, Chadwick, Martin, Thacker, Phoebe, Campbell-Gillingham, Lucy, Uesato, Jonathan, Huang, Po-Sen, Comanescu, Ramona, Yang, Fan, See, Abigail, Dathathri, Sumanth, Greig, Rory, Chen, Charlie, Fritz, Doug, Elias, Jaume Sanchez, Green, Richard, Mokrá, Soňa, Fernando, Nicholas, Wu, Boxi, Foley, Rachel, Young, Susannah, Gabriel, Iason, Isaac, William, Mellor, John, Hassabis, Demis, Kavukcuoglu, Koray, Hendricks, Lisa Anne, Irving, Geoffrey
Year of Publication 28.09.2022
Year of Publication 28.09.2022
Get full text
Journal Article
DISTRIBUTED TRAINING USING ACTOR-CRITIC REINFORCEMENT LEARNING WITH OFF-POLICY CORRECTION FACTORS
Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 18.04.2024
Get full text
Year of Publication 18.04.2024
Patent
Distributed training using actor-critic reinforcement learning with off-policy correction factors
Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 09.01.2024
Get full text
Year of Publication 09.01.2024
Patent
DISTRIBUTED TRAINING USING ACTOR-CRITIC REINFORCEMENT LEARNING WITH OFF-POLICY CORRECTION FACTORS
Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 18.05.2023
Get full text
Year of Publication 18.05.2023
Patent
Distributed training using actor-critic reinforcement learning with off-policy correction factors
Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 28.02.2023
Get full text
Year of Publication 28.02.2023
Patent
Learning to Prove from Synthetic Theorems
Aygün, Eser, Zafarali Ahmed, Anand, Ankit, Firoiu, Vlad, Glorot, Xavier, Orseau, Laurent, Precup, Doina, Shibl Mourad
Published in arXiv.org (19.06.2020)
Get full text
Published in arXiv.org (19.06.2020)
Paper
Training a First-Order Theorem Prover from Synthetic Data
Firoiu, Vlad, Aygun, Eser, Anand, Ankit, Zafarali Ahmed, Glorot, Xavier, Orseau, Laurent, Zhang, Lei, Precup, Doina, Shibl Mourad
Published in arXiv.org (06.04.2021)
Get full text
Published in arXiv.org (06.04.2021)
Paper
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Espeholt, Lasse, Soyer, Hubert, Munos, Remi, Simonyan, Karen, Mnih, Volodymir, Ward, Tom, Doron, Yotam, Firoiu, Vlad, Harley, Tim, Dunning, Iain, Legg, Shane, Kavukcuoglu, Koray
Year of Publication 05.02.2018
Year of Publication 05.02.2018
Get full text
Journal Article
Automated curricula through setter-solver interactions
Racaniere, Sebastien, Lampinen, Andrew K, Santoro, Adam, Reichert, David P, Firoiu, Vlad, Lillicrap, Timothy P
Published in arXiv.org (22.01.2020)
Get full text
Published in arXiv.org (22.01.2020)
Paper
DISTRIBUTED TRAINING USING ACTOR-CRITIC REINFORCEMENT LEARNING WITH OFF-POLICY CORRECTION FACTORS
Soyer, Hubert Josef, Mnih, Volodymyr, Ward, Thomas, Doron, Yotam, Simonyan, Karen, Firoiu, Vlad, Harley, Timothy James Alexander, Dunning, Iain Robert, Munos, Remi, Espeholt, Lasse, Kavukcuoglu, Koray
Year of Publication 04.02.2021
Get full text
Year of Publication 04.02.2021
Patent