Glossary

Bayesian Optimization

Bayesian optimization is built for situations where testing every option is slow or expensive.

It does not guess randomly or rely on gradients. Instead, it builds a model of what it already knows and uses that model to decide what to try next.

This helps you get better results in fewer steps.

It is widely used in machine learning, engineering, scientific research, and any case where you have a black box function with costly evaluations.

‍

What Is Bayesian Optimization?

Bayesian optimization is a strategy for finding the best result of a function when you cannot directly observe its structure.

This function is treated as a black box. You can try inputs and record the results, but you do not know how the outputs are generated.

Bayesian optimization uses a surrogate model, often a Gaussian process, to estimate the function's behavior. This model is updated with each new observation.

At every step, the algorithm uses an acquisition function to decide where to sample next. This function scores every possible input based on two factors:

How likely the point is to improve on the current best
How uncertain the model is about that point

By balancing both goals, Bayesian optimization can explore new areas while still focusing on the most promising inputs.

The process continues until a stopping condition is reached. This might be a limit on the number of trials or a point where no further improvement is expected.

‍

How Bayesian Optimization Works

The process can be broken into a few repeatable steps.

‍

Start with some initial data Sample the objective function at a few points, either randomly or using a method like Latin Hypercube Sampling.
Build a surrogate model Use a Gaussian process to predict the function and how uncertain the predictions are. The model gives a mean and variance for each point.
Update the posterior distribution Based on current observations, update the model to better reflect what has already been learned.
Use an acquisition function Score potential inputs with a utility function. The most common are:
- Expected Improvement (EI)
- Probability of Improvement (PI)
- Upper Confidence Bound (UCB)
Select the next input to test Choose the input with the highest score from the acquisition function.
Evaluate the function Run the test at the selected point and collect the result.
Add the result to the data Update the model with the new data and repeat the cycle.

This loop continues until you have enough data or reach a performance threshold.

‍

Why It Works Well on Black Box Problems

Bayesian optimization is useful when you do not have access to the function’s gradient or formula.

It works entirely through observation. Each function evaluation gives a single data point.

This is especially helpful when the function is:

Noisy
Expensive to evaluate
Non-differentiable
Unknown in shape

By treating the function as a distribution rather than a fixed formula, the algorithm can learn as it goes. It avoids wasting time in unpromising areas and focuses effort where it matters.

‍

Surrogate Models

A surrogate model stands in for the real function. It provides predictions and a confidence score for each input.

In most cases, this model is a Gaussian process, which provides a probability distribution over possible functions.

This means it does not just estimate the function's value but also estimates how sure it is about that value.

That uncertainty helps the acquisition function balance between exploring new areas and exploiting known good ones.

Other models, such as random forests or neural networks, can also be used, but Gaussian processes are common due to their well-understood behavior and performance in low to moderate dimensions.

‍

Acquisition Functions

The acquisition function decides where to test next.

It is a separate function built on top of the surrogate model. It scores each possible input based on expected value and uncertainty.

The three most common types are:

Expected Improvement (EI) Picks points expected to improve on the current best result. It considers both high values and high uncertainty.
Probability of Improvement (PI) Focuses only on the likelihood that a point will beat the current best, regardless of how much better it is.
Upper Confidence Bound (UCB) Adds a weight to uncertainty. A higher weight means more exploration. A lower weight means more focus on known good areas.

Maximizing the acquisition function is quick because it does not require testing the real function. Once the best input is found, that input is tested, and the result is used to improve the model.

‍

When To Use Bayesian Optimization

Bayesian optimization is not the fastest method for every problem. But it is effective when:

The function is expensive to test
You cannot access gradients
You do not know the shape of the function
You want to minimize the number of tests

Examples include:

Tuning hyperparameters in machine learning
Simulating physical systems
Testing configurations in engineering
Designing lab experiments
Optimizing black box processes in software

In these cases, saving even a few tests can result in big time or cost savings.

‍

Tools and Libraries

Several libraries are available for applying Bayesian optimization in Python:

scikit-optimize Simple and works well with scikit-learn.
bayes_opt Lightweight, well-documented, and uses Gaussian processes.
GPyOpt Built on top of GPy for advanced Gaussian process modeling.

Each lets you define an objective function, set parameter bounds, and run the optimization. The model and acquisition logic are handled behind the scenes.

‍

FAQ

‍

What is Bayesian optimization used for?

It is used for problems where each test is slow or expensive. Examples include tuning machine learning models, optimizing simulations, and running lab experiments.

‍

How does it work?

It builds a surrogate model, often a Gaussian process, to estimate the function. It then uses an acquisition function to pick the next input based on prediction and uncertainty.

‍

What is a surrogate model?

A surrogate model is a fast, probabilistic estimate of the real function. It helps reduce the number of actual evaluations needed to find a good solution.

‍

What is the acquisition function?

It is a utility function that helps decide where to test next. It balances the desire to try high-performing areas and explore uncertain ones.

‍

What are the common acquisition functions?

Expected Improvement (EI), Probability of Improvement (PI), and Upper Confidence Bound (UCB).

‍

Why use Bayesian optimization over random search?

It uses past results to make smarter decisions, so it usually needs fewer tests to find a good answer.

‍

What are its limitations?

It can struggle with very high-dimensional problems or when the surrogate model is not accurate. It also has some overhead from fitting the model and optimizing the acquisition function.

‍

Is it only for machine learning?

No. It is also used in engineering, chemistry, physics, finance, and other fields where tests are expensive and slow.

What tools are available?

scikit-optimize, bayes_opt, and GPyOpt are common Python libraries that simplify the process.

‍

When should I avoid it?

If your function is cheap to test or has thousands of parameters, simpler methods like random search or gradient descent may work better.

‍

Summary

Bayesian optimization helps you find better results with fewer trials.

Instead of testing every input, it builds a model of the function and uses it to make informed choices. It chooses each new test based on both the expected outcome and the uncertainty.

It works best when:

Each test is expensive
You cannot see or define the function
You need to optimize with limited evaluations

By building and refining a surrogate model, Bayesian optimization focuses your resources on the most promising parts of the input space.

It is a useful tool in many fields, including machine learning, engineering, and experimental science.

Discover how we can help your data into your most valuable asset.

We help businesses boost revenue, save time, and make smarter decisions with Data and AI

Talk to an expert