Selecting treatment strategies with dynamic limited-memory influence diagrams

Marcel A.J. van Gerven*, Francisco J. Díez, Babs G. Taal, Peter J.F. Lucas

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

18 Citations (Scopus)

Abstract

Objective: The development of dynamic limited-memory influence diagrams as a framework for representing factorized infinite-horizon partially observable Markov decision processes (POMDPs), the introduction of algorithms for their (approximate) solution, and the application to a dynamic decision problem in clinical oncology. Materials and methods: A dynamic limited-memory influence diagram for high-grade carcinoid tumor pathophysiology was developed in collaboration with an expert physician. Three algorithms, known as single policy updating, single rule updating, and simulated annealing have been examined for approximating the optimal treatment strategy from a space of 1 019 possible strategies. Results: Single policy updating proved intractable for finding a treatment strategy for carcinoid tumors. Single rule updating and simulated annealing both found the treatment strategy that is applied by physicians in practice. Conclusions: Dynamic limited-memory influence diagrams are a suitable framework for the representation of factorized infinite-horizon POMDPs, and the developed algorithms find acceptable solutions under the assumption of limited memory about past observations. The framework allows for finding reasonable treatment strategies for complex dynamic decision problems in medicine.

Original languageEnglish
Pages (from-to)171-186
Number of pages16
JournalArtificial intelligence in medicine
Volume40
Issue number3
DOIs
Publication statusPublished - Jul 2007
Externally publishedYes

Keywords

  • n/a OA procedure
  • Limited-memory influence diagrams
  • Partially observable Markov decision processes
  • Planning
  • Simulated annealing
  • Carcinoid tumors

Fingerprint

Dive into the research topics of 'Selecting treatment strategies with dynamic limited-memory influence diagrams'. Together they form a unique fingerprint.

Cite this