Abstract
We are interested in maximizing smooth functions where observations are noisy and expensive to compute, as might arise in computer simulations or laboratory experimentations. We derive a knowledge gradient policy, which chooses measurements which maximize the expected value of information, while using a locally parametric belief model that uses linear approximations with radial basis functions. The method uses a compact representation of the function which avoids storing the entire history, as is typically required by nonparametric methods. Our technique uses the expected value of a measurement in terms of its ability to improve our estimate of the optimum, capturing correlations in our beliefs about neighboring regions of the function, without posing any assumptions on the global shape of the underlying function a priori. Experimental work suggests that the method adapts to a range of arbitrary, continuous functions, and appears to reliably find the optimal solution. Moreover, the policy is shown to be asymptotically optimal.
Original language | English (US) |
---|---|
Pages (from-to) | 401-425 |
Number of pages | 25 |
Journal | Journal of Global Optimization |
Volume | 63 |
Issue number | 2 |
DOIs | |
State | Published - Oct 22 2015 |
All Science Journal Classification (ASJC) codes
- Control and Optimization
- Applied Mathematics
- Business, Management and Accounting (miscellaneous)
- Computer Science Applications
- Management Science and Operations Research
Keywords
- Local parametric model
- Optimal learning
- Ranking and selection
- Stochastic search