TY - JOUR
T1 - Anticipatory scheduling of synchromodal transport using approximate dynamic programming
AU - Rivera, Arturo E.Pérez
AU - Mes, Martijn R.K.
N1 - Funding Information:
This research has been partially funded by the Dutch Institute for Advanced Logistics, DINALOG, under the project SynchromodalIT.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/4/13
Y1 - 2022/4/13
N2 - We study the problem of scheduling container transport in synchromodal networks considering stochastic demand. In synchromodal networks, the transportation modes can be selected dynamically given the actual circumstances and performance is measured over the entire network and over time. We model this problem as a Markov Decision Process and propose a heuristic solution based on Approximate Dynamic Programming (ADP). Due to the multi-period nature of the problem, the one-step look-ahead perspective of the traditional approximate value-iteration approach can make the heuristic flounder and end in a local-optimum. To tackle this, we study the inclusion of Bayesian exploration using the Value of Perfect Information (VPI). In a series of numerical experiments, we show how VPI significantly improves a traditional ADP algorithm. Furthermore, we show how our proposed ADP–VPI combination achieves significant gains over common practice heuristics.
AB - We study the problem of scheduling container transport in synchromodal networks considering stochastic demand. In synchromodal networks, the transportation modes can be selected dynamically given the actual circumstances and performance is measured over the entire network and over time. We model this problem as a Markov Decision Process and propose a heuristic solution based on Approximate Dynamic Programming (ADP). Due to the multi-period nature of the problem, the one-step look-ahead perspective of the traditional approximate value-iteration approach can make the heuristic flounder and end in a local-optimum. To tackle this, we study the inclusion of Bayesian exploration using the Value of Perfect Information (VPI). In a series of numerical experiments, we show how VPI significantly improves a traditional ADP algorithm. Furthermore, we show how our proposed ADP–VPI combination achieves significant gains over common practice heuristics.
KW - UT-Hybrid-D
KW - Approximate dynamic programming
KW - Intermodal transport
KW - Reinforcement learning
KW - Synchromodal transport
KW - Anticipatory scheduling
UR - http://www.scopus.com/inward/record.url?scp=85127936126&partnerID=8YFLogxK
U2 - 10.1007/s10479-022-04668-6
DO - 10.1007/s10479-022-04668-6
M3 - Article
AN - SCOPUS:85127936126
SN - 0254-5330
JO - Annals of operations research
JF - Annals of operations research
ER -