Fast active learning for pure exploration in reinforcement learning
Ménard, Pierre, Domingues, Omar Darwiche, Jonsson, Anders, Kaufmann, Emilie, Leurent, Edouard, Valko, Michal
Year of Publication 27.07.2020
Year of Publication 27.07.2020
Get full text
Journal Article
Adaptive Reward-Free Exploration
Kaufmann, Emilie, Ménard, Pierre, Domingues, Omar Darwiche, Jonsson, Anders, Leurent, Edouard, Valko, Michal
Year of Publication 11.06.2020
Year of Publication 11.06.2020
Get full text
Journal Article
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Jonsson, Anders, Kaufmann, Emilie, Ménard, Pierre, Domingues, Omar Darwiche, Leurent, Edouard, Valko, Michal
Year of Publication 10.06.2020
Year of Publication 10.06.2020
Get full text
Journal Article
Optimizing Memory Mapping Using Deep Reinforcement Learning
Wang, Pengming, Sazanovich, Mikita, Ilbeyi, Berkin, Phothilimthana, Phitchaya Mangpo, Purohit, Manish, Tay, Han Yang, Vũ, Ngân, Wang, Miaosen, Paduraru, Cosmin, Leurent, Edouard, Zhernov, Anton, Huang, Po-Sen, Schrittwieser, Julian, Hubert, Thomas, Tung, Robert, Kurylowicz, Paula, Milan, Kieran, Vinyals, Oriol, Mankowitz, Daniel J
Year of Publication 11.05.2023
Year of Publication 11.05.2023
Get full text
Journal Article
Diversifying AI: Towards Creative Chess with AlphaZero
Zahavy, Tom, Veeriah, Vivek, Hou, Shaobo, Waugh, Kevin, Lai, Matthew, Leurent, Edouard, Tomasev, Nenad, Schut, Lisa, Hassabis, Demis, Singh, Satinder
Published in arXiv.org (31.07.2024)
Get full text
Published in arXiv.org (31.07.2024)
Paper
Budgeted Reinforcement Learning in Continuous State Space
Carrara, Nicolas, Leurent, Edouard, Laroche, Romain, Urvoy, Tanguy, Maillard, Odalric-Ambrym, Pietquin, Olivier
Year of Publication 03.03.2019
Year of Publication 03.03.2019
Get full text
Journal Article
Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning
Wang, Kaiwen, Kidambi, Rahul, Sullivan, Ryan, Agarwal, Alekh, Dann, Christoph, Michi, Andrea, Gelmi, Marco, Li, Yunxuan, Gupta, Raghav, Dubey, Avinava, Ramé, Alexandre, Ferret, Johan, Cideron, Geoffrey, Hou, Le, Yu, Hongkun, Ahmed, Amr, Mehta, Aranyak, Léonard Hussenot, Bachem, Olivier, Leurent, Edouard
Published in arXiv.org (23.10.2024)
Get full text
Published in arXiv.org (23.10.2024)
Paper
OPTIMIZING ALGORITHMS FOR TARGET PROCESSORS USING REPRESENTATION NEURAL NETWORKS
MICHI, Andrea, MANDHANE, Amol Balkishan, SELVI, Marco, KOHLI, Pushmeet, ZHERNOV, Anton, GELMI, Marco Oreste, IQBAL, Shariq Nadeem, SILVER, David, VINYALS, Oriol, PADURARU, Cosmin, RIEDMILLER, Martin, MANKOWITZ, Daniel J, LEURENT, Edouard
Year of Publication 25.01.2024
Get full text
Year of Publication 25.01.2024
Patent
Approximate Robust Control of Uncertain Dynamical Systems
Leurent, Edouard, Blanco, Yann, Efimov, Denis, Maillard, Odalric-Ambrym
Published in arXiv.org (01.03.2019)
Get full text
Published in arXiv.org (01.03.2019)
Paper