Reinforcement learning for Order Acceptance on a shared resource

M. Mainegra Hing, Aart van Harten, Peter Schuur

Research output: Contribution to conferencePaperAcademic

62 Downloads (Pure)

Abstract

Order acceptance (OA) is one of the main functions in business control. Basically, OA involves for each order a reject/accept decision. Always accepting an order when capacity is available could disable the system to accept more convenient orders in the future with opportunity losses as a consequence. Another important aspect is the availability of information to the decision-maker. We use the stochastic modeling approach, Markov decision theory and learning methods from artificial intelligence to find decision policies, even under uncertain information. Reinforcement learning (RL) is a quite new approach in OA. It is capable of learning both the decision policy and incomplete information, simultaneously. It is shown here that RL works well compared with heuristics. Finding good heuristics in a complex situation is a delicate art. It is demonstrated that a RL trained agent can be used to support the detection of good heuristics.
Original languageUndefined
Pages1454-1458
DOIs
Publication statusPublished - 2002
Event9th International Conference on Neural Information Processing. ICONIP '02 -
Duration: 1 Jan 20021 Jan 2002

Conference

Conference9th International Conference on Neural Information Processing. ICONIP '02
Period1/01/021/01/02
Other2002

Keywords

  • Markov Processes
  • IR-55872
  • order processing
  • Optimisation
  • learning (artificial intelligence)
  • Decision theory
  • Resource Allocation

Cite this