Clicked:Curriculum Learning Connects Knowledge Distillation for Four-Player No-Limit Texas Hold'em Poker
Equilibrium solution is the core subject of imperfect information game. Although game solving paradigms vary from knowledge-guided tree search, through game reinforcement learning, to offline blueprint strategy pre-training and online adaptation, strategy transferability is the key to game solving,...
Saved in:
Published in | Chinese Control and Decision Conference pp. 833 - 837 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
25.05.2024
|
Subjects | |
Online Access | Get full text |
ISSN | 1948-9447 |
DOI | 10.1109/CCDC62350.2024.10587560 |
Cover
Summary: | Equilibrium solution is the core subject of imperfect information game. Although game solving paradigms vary from knowledge-guided tree search, through game reinforcement learning, to offline blueprint strategy pre-training and online adaptation, strategy transferability is the key to game solving, that is, how to train robust response strategies for various types of opponents. In this paper, we propose a CLICKED method of curriculum learning connects knowledge distillation to solve multi-player imperfect information games. Different kinds of four-player curriculum are designed for curriculum knowledge distillation. Based on this framework, we designed a four-player Texas Hold 'em Poker agent that competed in the 2023 AAMAS Imperfect Information Game and won the third place. In addition, the experimental results prove the feasibility of the curriculum learning connects knowledge distillation method. |
---|---|
ISSN: | 1948-9447 |
DOI: | 10.1109/CCDC62350.2024.10587560 |