Minimizing the AoI in Resource-Constrained Multi-Source Relaying Systems: Dynamic and Learning-based Scheduling
We consider a multi-source relaying system where independent sources randomly generate status update packets which are sent to the destination with the aid of a relay through unreliable links. We develop transmission scheduling policies to minimize the weighted sum average age of information (AoI) s...
Saved in:
Main Authors | , , , |
---|---|
Format | Journal Article |
Language | English |
Published |
10.03.2022
|
Online Access | Get full text |
DOI | 10.48550/arxiv.2203.05656 |
Cover
Summary: | We consider a multi-source relaying system where independent sources randomly
generate status update packets which are sent to the destination with the aid
of a relay through unreliable links. We develop transmission scheduling
policies to minimize the weighted sum average age of information (AoI) subject
to transmission capacity and long-run average resource constraints. We
formulate a stochastic control optimization problem and solve it using a
constrained Markov decision process (CMDP) approach and a drift-plus-penalty
method. The CMDP problem is solved by transforming it into an MDP problem using
the Lagrangian relaxation method. We theoretically analyze the structure of
optimal policies for the MDP problem and subsequently propose a structure-aware
algorithm that returns a practical near-optimal policy. Using the
drift-plus-penalty method, we devise a near-optimal low-complexity policy that
performs the scheduling decisions dynamically. We also develop a model-free
deep reinforcement learning policy for which the Lyapunov optimization theory
and a dueling double deep Q-network are employed. The complexities of the
proposed policies are analyzed. Simulation results are provided to assess the
performance of our policies and validate the theoretical results. The results
show up to 91% performance improvement compared to a baseline policy. |
---|---|
DOI: | 10.48550/arxiv.2203.05656 |