The knowledge-gradient policy for correlated normal beliefs

Peter Frazier, Warren Buckler Powell, Savas Dayanik

Research output: Contribution to journalArticlepeer-review

304 Scopus citations

Abstract

We consider a Bayesian ranking and selection problem with independent normal rewards and a correlated multivariate normal belief on the mean values of these rewards. Because this formulation of the ranking and selection problem models dependence between alternatives' mean values, algorithms may use this dependence to perform efficiently even when the number of alternatives is very large. We propose a fully sequential sampling policy called the knowledge-gradient policy, which is provably optimal in some special cases and has bounded suboptimality in all others. We then demonstrate how this policy may be applied to efficiently maximize a continuous function on a continuous domain while constrained to a fixed number of noisy measurements.

Original languageEnglish (US)
Pages (from-to)599-613
Number of pages15
JournalINFORMS Journal on Computing
Volume21
Issue number4
DOIs
StatePublished - Sep 2009

All Science Journal Classification (ASJC) codes

  • Software
  • Information Systems
  • Computer Science Applications
  • Management Science and Operations Research

Keywords

  • Bayesian
  • Decision analysis: sequential
  • Simulation: design of experiments
  • Statistics

Fingerprint

Dive into the research topics of 'The knowledge-gradient policy for correlated normal beliefs'. Together they form a unique fingerprint.

Cite this