A Reinforcement Learning Hyper-Heuristic with Cumulative Rewards for Dual-Peak Time-Varying Network Optimization in Heterogeneous Multi-Trip Vehicle Routing

Urban logistics face complexity due to traffic congestion, fleet heterogeneity, warehouse constraints, and driver workload balancing, especially in the Heterogeneous Multi-Trip Vehicle Routing Problem with Time Windows and Time-Varying Networks (HMTVRPTW-TVN). We develop a mixed-integer linear progr...

Full description

Saved in:
Bibliographic Details
Published inAlgorithms Vol. 18; no. 9; p. 536
Main Authors Wang, Xiaochuan, Li, Na, Jin, Xingchen
Format Journal Article
LanguageEnglish
Published 22.08.2025
Online AccessGet full text
ISSN1999-4893
1999-4893
DOI10.3390/a18090536

Cover

Loading…