Multiagent off-screen behavior prediction in football
Omidshafiei, Shayegan, Hennes, Daniel, Garnelo, Marta, Wang, Zhe, Recasens, Adria, Tarassov, Eugene, Yang, Yi, Elie, Romuald, Connor, Jerome T, Muller, Paul, Mackraz, Natalie, Cao, Kris, Moreno, Pol, Sprechmann, Pablo, Hassabis, Demis, Graham, Ian, Spearman, William, Heess, Nicolas, Tuyls, Karl
Published in Scientific reports (23.05.2022)
Published in Scientific reports (23.05.2022)
Get full text
Journal Article
Mastering the game of Stratego with model-free multiagent reinforcement learning
Perolat, Julien, De Vylder, Bart, Hennes, Daniel, Tarassov, Eugene, Strub, Florian, de Boer, Vincent, Muller, Paul, Connor, Jerome T., Burch, Neil, Anthony, Thomas, McAleer, Stephen, Elie, Romuald, Cen, Sarah H., Wang, Zhe, Gruslys, Audrunas, Malysheva, Aleksandra, Khan, Mina, Ozair, Sherjil, Timbers, Finbarr, Pohlen, Toby, Eccles, Tom, Rowland, Mark, Lanctot, Marc, Lespiau, Jean-Baptiste, Piot, Bilal, Omidshafiei, Shayegan, Lockhart, Edward, Sifre, Laurent, Beauguerlange, Nathalie, Munos, Remi, Silver, David, Singh, Satinder, Hassabis, Demis, Tuyls, Karl
Published in Science (American Association for the Advancement of Science) (02.12.2022)
Published in Science (American Association for the Advancement of Science) (02.12.2022)
Get full text
Journal Article
Developing, evaluating and scaling learning agents in multi-agent environments
Gemp, Ian, Anthony, Thomas, Bachrach, Yoram, Bhoopchand, Avishkar, Bullard, Kalesha, Connor, Jerome, Dasagi, Vibhavari, De Vylder, Bart, Duéñez-Guzmán, Edgar A., Elie, Romuald, Everett, Richard, Hennes, Daniel, Hughes, Edward, Khan, Mina, Lanctot, Marc, Larson, Kate, Lever, Guy, Liu, Siqi, Marris, Luke, McKee, Kevin R., Muller, Paul, Pérolat, Julien, Strub, Florian, Tacchetti, Andrea, Tarassov, Eugene, Wang, Zhe, Tuyls, Karl
Published in Ai communications (01.01.2022)
Published in Ai communications (01.01.2022)
Get full text
Journal Article
Understanding the performance gap between online and offline alignment algorithms
Tang, Yunhao, Guo, Daniel Zhaohan, Zheng, Zeyu, Calandriello, Daniele, Cao, Yuan, Tarassov, Eugene, Munos, Rémi, Bernardo Ávila Pires, Valko, Michal, Cheng, Yong, Dabney, Will
Published in arXiv.org (14.05.2024)
Published in arXiv.org (14.05.2024)
Get full text
Paper
Journal Article
Offline Regularised Reinforcement Learning for Large Language Models Alignment
Richemond, Pierre Harvey, Tang, Yunhao, Guo, Daniel, Calandriello, Daniele, Azar, Mohammad Gheshlaghi, Rafailov, Rafael, Pires, Bernardo Avila, Tarassov, Eugene, Spangher, Lucas, Ellsworth, Will, Severyn, Aliaksei, Mallinson, Jonathan, Shani, Lior, Shamir, Gil, Joshi, Rishabh, Liu, Tianqi, Munos, Remi, Piot, Bilal
Year of Publication 29.05.2024
Year of Publication 29.05.2024
Get full text
Journal Article
Time-series Imputation of Temporally-occluded Multiagent Trajectories
Shayegan Omidshafiei, Hennes, Daniel, Garnelo, Marta, Tarassov, Eugene, Wang, Zhe, Elie, Romuald, Connor, Jerome T, Muller, Paul, Graham, Ian, Spearman, William, Tuyls, Karl
Published in arXiv.org (08.06.2021)
Published in arXiv.org (08.06.2021)
Get full text
Paper
Journal Article
Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments
Gemp, Ian, Thomas, Anthony, Bachrach, Yoram, Bhoopchand, Avishkar, Bullard, Kalesha, Connor, Jerome, Dasagi, Vibhavari, De Vylder, Bart, Duenez-Guzman, Edgar, Elie, Romuald, Everett, Richard, Hennes, Daniel, Hughes, Edward, Khan, Mina, Lanctot, Marc, Larson, Kate, Lever, Guy, Liu, Siqi, Marris, Luke, McKee, Kevin R, Muller, Paul, Perolat, Julien, Strub, Florian, Tacchetti, Andrea, Tarassov, Eugene, Wang, Zhe, Tuyls, Karl
Published in arXiv.org (22.09.2022)
Published in arXiv.org (22.09.2022)
Get full text
Paper
Journal Article
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Perolat, Julien, de Vylder, Bart, Hennes, Daniel, Tarassov, Eugene, Strub, Florian, de Boer, Vincent, Muller, Paul, Connor, Jerome T, Burch, Neil, Thomas, Anthony, McAleer, Stephen, Elie, Romuald, Cen, Sarah H, Wang, Zhe, Gruslys, Audrunas, Malysheva, Aleksandra, Khan, Mina, Ozair, Sherjil, Timbers, Finbarr, Pohlen, Toby, Eccles, Tom, Rowland, Mark, Lanctot, Marc, Jean-Baptiste Lespiau, Piot, Bilal, Shayegan Omidshafiei, Lockhart, Edward, Sifre, Laurent, Beauguerlange, Nathalie, Munos, Remi, Silver, David, Singh, Satinder, Hassabis, Demis, Tuyls, Karl
Published in arXiv.org (30.06.2022)
Published in arXiv.org (30.06.2022)
Get full text
Paper
Journal Article