Reinforcement learning for Order Acceptance on a shared resource

M. Mainegra Hing, Aart van Harten, Peter Schuur

Research output: Contribution to conferencePaper

1 Citation (Scopus)
163 Downloads (Pure)


Order acceptance (OA) is one of the main functions in business control. Basically, OA involves for each order a reject/accept decision. Always accepting an order when capacity is available could disable the system to accept more convenient orders in the future with opportunity losses as a consequence. Another important aspect is the availability of information to the decision-maker. We use the stochastic modeling approach, Markov decision theory and learning methods from artificial intelligence to find decision policies, even under uncertain information. Reinforcement learning (RL) is a quite new approach in OA. It is capable of learning both the decision policy and incomplete information, simultaneously. It is shown here that RL works well compared with heuristics. Finding good heuristics in a complex situation is a delicate art. It is demonstrated that a RL trained agent can be used to support the detection of good heuristics.
Original languageUndefined
Publication statusPublished - 2002
Event9th International Conference on Neural Information Processing. ICONIP '02 -
Duration: 1 Jan 20021 Jan 2002


Conference9th International Conference on Neural Information Processing. ICONIP '02


  • Markov Processes
  • IR-55872
  • order processing
  • Optimisation
  • learning (artificial intelligence)
  • Decision theory
  • Resource Allocation

Cite this