Federated Best Arm Identification With Heterogeneous Clients

We study best arm identification in a federated multi-armed bandit setting with a central server and multiple clients, when each client has access to a subset of arms and each arm yields independent Gaussian observations. The goal is to identify the best arm of each client subject to an upper bound...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on information theory Vol. 70; no. 6; pp. 4258 - 4279
Main Authors Chen, Zhirui, Karthik, P. N., Tan, Vincent Y. F., Chee, Yeow Meng
Format Journal Article
LanguageEnglish
Published New York IEEE 01.06.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We study best arm identification in a federated multi-armed bandit setting with a central server and multiple clients, when each client has access to a subset of arms and each arm yields independent Gaussian observations. The goal is to identify the best arm of each client subject to an upper bound on the error probability; here, the best arm is one that has the largest average value of the means averaged across all clients having access to the arm. Our interest is in the asymptotics as the error probability vanishes. We provide an asymptotic lower bound on the growth rate of the expected stopping time of any algorithm. Furthermore, we show that for any algorithm whose upper bound on the expected stopping time matches with the lower bound up to a multiplicative constant (almost-optimal algorithm), the ratio of any two consecutive communication time instants must be bounded, a result that is of independent interest. We thereby infer that an algorithm can communicate no more sparsely than at exponential time instants in order to be almost-optimal. For the class of almost-optimal algorithms, we present the first-of-its-kind asymptotic lower bound on the expected number of communication rounds until stoppage. We propose a novel algorithm that communicates at exponential time instants, and demonstrate that it is asymptotically almost-optimal.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0018-9448
1557-9654
DOI:10.1109/TIT.2023.3338027