A Reinforcement Learning Hyper-Heuristic with Cumulative Rewards for Dual-Peak Time-Varying Network Optimization in Heterogeneous Multi-Trip Vehicle Routing
Urban logistics face complexity due to traffic congestion, fleet heterogeneity, warehouse constraints, and driver workload balancing, especially in the Heterogeneous Multi-Trip Vehicle Routing Problem with Time Windows and Time-Varying Networks (HMTVRPTW-TVN). We develop a mixed-integer linear progr...
Saved in:
Published in | Algorithms Vol. 18; no. 9; p. 536 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
22.08.2025
|
Online Access | Get full text |
ISSN | 1999-4893 1999-4893 |
DOI | 10.3390/a18090536 |
Cover
Loading…