TY - JOUR
T1 - A Monotone Approximate Dynamic Programming Approach for the Stochastic Scheduling, Allocation, and Inventory Replenishment Problem
T2 - Applications to Drone and Electric Vehicle Battery Swap Stations
AU - Asadi, Amin
AU - Pinkley, Sarah Nurre
N1 - Funding Information:
Funding: This research is supported by the Arkansas High Performance Computing Center, which is funded through multiple National Science Foundation grants and the Arkansas Economic Develop-ment Commission. Supplemental Material: The online appendix is available at https://doi.org/10.1287/trsc.2021.1108.
Publisher Copyright:
Copyright: © 2022 INFORMS.
PY - 2022/7/1
Y1 - 2022/7/1
N2 - There is a growing interest in using electric vehicles (EVs) and drones for many applications. However, battery-oriented issues, including range anxiety and battery degradation, impede adoption. Battery swap stations are one alternative to reduce these concerns that allow the swap of depleted for full batteries in minutes. We consider the problem of deriving actions at a battery swap station when explicitly considering the uncertain arrival of swap demand, battery degradation, and replacement. We model the operations at a battery swap station using a finite horizon Markov Decision Process model for the stochastic scheduling, allocation, and inventory replenishment problem (SAIRP), which determines when and how many batteries are charged, discharged, and replaced over time. We present theoretical proofs for the monotonicity of the value function and monotone structure of an optimal policy for special SAIRP cases. Due to the curses of dimensionality, we develop a new monotone approximate dynamic programming (ADP) method, which intelligently initializes a value function approximation using regression. In computational tests, we demonstrate the superior performance of the new regression-based monotone ADP method as compared to exact methods and other monotone ADP methods. Further, with the tests, we deduce policy insights for drone swap stations.
AB - There is a growing interest in using electric vehicles (EVs) and drones for many applications. However, battery-oriented issues, including range anxiety and battery degradation, impede adoption. Battery swap stations are one alternative to reduce these concerns that allow the swap of depleted for full batteries in minutes. We consider the problem of deriving actions at a battery swap station when explicitly considering the uncertain arrival of swap demand, battery degradation, and replacement. We model the operations at a battery swap station using a finite horizon Markov Decision Process model for the stochastic scheduling, allocation, and inventory replenishment problem (SAIRP), which determines when and how many batteries are charged, discharged, and replaced over time. We present theoretical proofs for the monotonicity of the value function and monotone structure of an optimal policy for special SAIRP cases. Due to the curses of dimensionality, we develop a new monotone approximate dynamic programming (ADP) method, which intelligently initializes a value function approximation using regression. In computational tests, we demonstrate the superior performance of the new regression-based monotone ADP method as compared to exact methods and other monotone ADP methods. Further, with the tests, we deduce policy insights for drone swap stations.
KW - math.OC
KW - cs.AI
KW - cs.LG
KW - math.PR
KW - Electric vehicles and drones
KW - Battery swap station
KW - Markov decision processes
KW - Battery degradation
KW - Monotone policy and value function
KW - Regression-based initialization
KW - Approximate dynamic programming
KW - 22/1 OA procedure
U2 - 10.1287/trsc.2021.1108
DO - 10.1287/trsc.2021.1108
M3 - Article
SN - 0041-1655
VL - 56
SP - 1085
EP - 1110
JO - Transportation science
JF - Transportation science
IS - 4
ER -