Predictive Entropy Search for Efficient Global Optimization of Black-box Functions (1406.2541v1)

Published 10 Jun 2014 in stat.ML and cs.LG

Abstract: We propose a novel information-theoretic approach for Bayesian optimization called Predictive Entropy Search (PES). At each iteration, PES selects the next evaluation point that maximizes the expected information gained with respect to the global maximum. PES codifies this intractable acquisition function in terms of the expected reduction in the differential entropy of the predictive distribution. This reformulation allows PES to obtain approximations that are both more accurate and efficient than other alternatives such as Entropy Search (ES). Furthermore, PES can easily perform a fully Bayesian treatment of the model hyperparameters while ES cannot. We evaluate PES in both synthetic and real-world applications, including optimization problems in machine learning, finance, biotechnology, and robotics. We show that the increased accuracy of PES leads to significant gains in optimization performance.

Authors (3)

José Miguel Hernández-Lobato (151 papers)
Matthew W. Hoffman (14 papers)
Zoubin Ghahramani (108 papers)

Citations (615)

View on Semantic Scholar

Summary

The paper introduces Predictive Entropy Search (PES) to efficiently maximize expected information gain about the global optimum of black-box functions.
PES employs Gaussian process approximations and random feature sampling to overcome computational challenges in entropy estimation.
Empirical results show that PES outperforms traditional methods like EI and ES in both synthetic and real-world optimization tasks.

Predictive Entropy Search for Efficient Global Optimization of Black-box Functions

The paper, "Predictive Entropy Search for Efficient Global Optimization of Black-box Functions," presents a new approach within the field of Bayesian optimization techniques aimed at optimal and efficient function evaluations. The authors introduce Predictive Entropy Search (PES), an acquisition function designed to maximize the expected information gain about the global maximum of a black-box function.

Bayesian Optimization Framework

Bayesian optimization is an effective method for optimizing costly, noisy, and derivative-free functions—common challenges in fields such as robotics, pharmacology, and machine learning. This paper builds on this framework by proposing an information-theoretic acquisition function that selects sequential input data points to efficiently identify the function's global maximum. The fundamental innovation of PES lies in its use of the expected reduction in uncertainty about the global maximum, refocusing the acquisition strategy from direct estimation of the function's value to modeling the uncertainty of its peak.

Predictive Entropy Search (PES)

PES improves upon existing methods like Entropy Search (ES) by offering a more accurate and computationally efficient approach. PES directly estimates the expected information gain about the location of the global maximum using the mutual information between the function evaluations and the potential maximum. By reformulating the acquisition function, PES addresses inherent difficulties in prior entropic approaches, particularly concerning the non-analytic computations of differential entropy.

Sampling and Practical Implementation

The paper provides a detailed methodology for approximating the intractable parts of the PES acquisition function. It introduces novel techniques for sampling from the posterior distribution over the global maximum using Gaussian process (GP) approximations with random features. This enables efficient computation of the necessary expectations, even over the high-dimensional feature space.

Moreover, PES differentiates itself by enabling a complete Bayesian treatment of the model hyperparameters, which is a significant advancement over similar methods like ES that typically require fixed hyperparameters. This fully Bayesian approach ensures robustness in scenarios with varying model uncertainties and parameter sensitivities.

Experimental Results

The empirical evaluation demonstrates the effectiveness of PES across both synthetic and real-world optimization tasks. The paper highlights substantial improvements in optimization performance when using PES, as quantified by the reduction in immediate regret compared to ES and expected improvement (EI) methods. Synthetic experiments confirm PES's superiority, especially in multimodal landscapes where efficient exploration-exploitation trade-offs are crucial. In real-world scenarios, PES shows competitive performance across varied domains, such as neural network tuning and portfolio optimization.

Implications and Future Directions

The introduction of PES extends the theoretical and practical landscape of Bayesian optimization. The methodology not only enhances convergence rates but also provides a pathway for more adaptive exploration strategies in complex optimization problems. This advancement implies significant potential for application in diverse areas like automated machine learning, experimental design, and derivative-free optimization.

Further research could explore enhancing PES for higher-dimensional and more complex decision spaces. Additionally, integrating PES into frameworks for adaptive experimental design could enhance its applicability to dynamic, data-driven environments where model parameters evolve over time.

Overall, this paper's contribution solidifies PES as a valuable tool in global optimization, providing both rigorous theoretical insights and practical benefits for data-driven decision-making processes.

PDF Markdown