From 6da7ca52e42e8f4e5283a2326115188f4d5276fc Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 10:43:00 +0200
Subject: [PATCH 01/64] docs(installation): add install instructs for windows

to be precise: point to the github issue bc there are probably many ways to Rome but not able to test it
---
 docs/1_installation.rst | 15 ++++++++++++++-
 1 file changed, 14 insertions(+), 1 deletion(-)
diff --git a/docs/1_installation.rst b/docs/1_installation.rst
index a835d9cf1..fc3a1a6ac 100644
--- a/docs/1_installation.rst
+++ b/docs/1_installation.rst
@@ -13,7 +13,9 @@ environment or on your system directly. The command to install swig on linux mac
     apt-get install swig
 
 
-SMAC is tested on Linux and Mac machines with python 3.8, 3.9 and 3.10.
+.. note::
+
+    SMAC is tested on Linux and Mac machines with python >=3.8.
 
 
 Anaconda
@@ -76,3 +78,14 @@ SMAC can be installed under Windows in a WSL (Windows Subsystem for Linux).
 You can find an instruction on how to do this here: :ref:`Experimental<Experimental>`.
 However, this is experimental and might not work in each case. 
 If you would like to suggest any changes, please let us know. 
+
+Windows (Experimental)
+~~~~~~~~~~~~~~~~~~~~~~
+
+SMAC can also be installed under Windows.
+
+.. warning::
+
+    This is an experimental feature and might not work in each case. 
+
+Please refer to this `issue <https://github.com/automl/SMAC3/issues/952>`_ for installation instructions for SMAC3-1.4 and SMAC3-2.x.
\ No newline at end of file

From 36d8597a95bb6bd36d1da833f1613248b19c9f78 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 10:46:19 +0200
Subject: [PATCH 02/64] docs(links): fix redirection

---
 docs/8_faq.rst     | 2 +-
 docs/9_license.rst | 2 +-
 docs/index.rst     | 2 +-
 3 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/docs/8_faq.rst b/docs/8_faq.rst
index 490d66272..111e84455 100644
--- a/docs/8_faq.rst
+++ b/docs/8_faq.rst
@@ -6,7 +6,7 @@ Should I use SMAC2 or SMAC3?
   SMAC3 is a reimplementation of the original SMAC tool (`Sequential Model-Based Optimization for
   General Algorithm Configuration <https://ml.informatik.uni-freiburg.de/wp-content/uploads/papers/11-LION5-SMAC.pdf>`_, Hutter et al., 2021). However, the reimplementation slightly differs from the original
   SMAC. For comparisons against the original SMAC, we refer to a stable release of SMAC (v2) in Java
-  which can be found `here <http://www.cs.ubc.ca/labs/beta/Projects/SMAC/>`_.
+  which can be found `here <https://www.cs.ubc.ca/labs/algorithms/Projects/SMAC/>`_.
   Since SMAC3 is actively maintained, we recommend to use SMAC3 for any AutoML applications.
 
 
diff --git a/docs/9_license.rst b/docs/9_license.rst
index af0142ed4..c00628f58 100644
--- a/docs/9_license.rst
+++ b/docs/9_license.rst
@@ -6,4 +6,4 @@ This program is free software: you can redistribute it and/or modify it under th
 This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY;
 without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
 You should have received a copy of the 3-clause BSD license along with this program
-(see LICENSE file). If not, see `<https://opensource.org/licenses/BSD-3-Clause>`_.
\ No newline at end of file
+(see LICENSE file). If not, see `<https://opensource.org/license/BSD-3-Clause>`_.
\ No newline at end of file
diff --git a/docs/index.rst b/docs/index.rst
index 736448d78..3d8f03520 100644
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -52,6 +52,6 @@ For the original idea, we refer to:
 Contact
 -------
 
-SMAC3 is developed by `<https://automl.org>`_.
+SMAC3 is developed by `<https://www.automl.org>`_.
 If you want to contribute or found an issue please visit our github page `<https://github.com/automl/SMAC3>`_.
 Our guidelines for contributing to this package can be found `here <https://github.com/automl/SMAC3/blob/main/CONTRIBUTING.md>`_.

From 80146962ae03ec227efdcb86269095f7858503a7 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 11:29:38 +0200
Subject: [PATCH 03/64] docs(installation): rst -> md

---
 docs/1_installation.md  | 78 +++++++++++++++++++++++++++++++++++
 docs/1_installation.rst | 91 -----------------------------------------
 2 files changed, 78 insertions(+), 91 deletions(-)
 create mode 100644 docs/1_installation.md
 delete mode 100644 docs/1_installation.rst

diff --git a/docs/1_installation.md b/docs/1_installation.md
new file mode 100644
index 000000000..52d78b6a8
--- /dev/null
+++ b/docs/1_installation.md
@@ -0,0 +1,78 @@
+# Installation
+
+## Requirements
+
+SMAC is written in python3 and therefore requires an environment with python>=3.8.
+Furthermore, the Random Forest used in SMAC requires SWIG as a build dependency. Install it either in your
+environment or on your system directly. The command to install swig on linux machines is the following:
+
+```bash
+apt-get install swig
+```
+
+!!! info 
+
+    SMAC is tested on Linux and Mac machines with python >=3.8.
+
+
+## Anaconda
+
+We recommend using Anaconda to create and activate an environment:
+
+```bash
+conda create -n SMAC python=3.10
+conda activate SMAC
+```
+
+If you haven't installed swig yet, you can install it directly inside the Anaconda environment:
+
+```bash
+conda install gxx_linux-64 gcc_linux-64 swig
+```
+
+Now install SMAC via PyPI:
+
+```bash
+pip install smac
+```
+
+Or alternatively, clone the environment from GitHub directly:
+
+```bash
+git clone https://github.com/automl/SMAC3.git && cd SMAC3
+pip install -e .[dev]
+```
+
+## Conda-forge
+
+Installing SMAC from the `conda-forge` channel can be achieved by adding `conda-forge` to your channels with:
+
+```bash
+conda config --add channels conda-forge
+conda config --set channel_priority strict
+```
+
+You must have `conda >= 4.9` installed. To update conda or check your current conda version, please follow the instructions from [the official anaconda documentation](https://docs.anaconda.com/anaconda/install/update-version/). Once the `conda-forge` channel has been enabled, SMAC can be installed with:
+
+```bash
+conda install smac
+```
+
+Read [SMAC feedstock](https://github.com/conda-forge/smac-feedstock) for more details.
+
+## Windows via WSL (Experimental)
+
+SMAC can be installed under Windows in a WSL (Windows Subsystem for Linux). 
+You can find an instruction on how to do this here: [Experimental](#Experimental).
+However, this is experimental and might not work in each case. 
+If you would like to suggest any changes, please let us know. 
+
+## Windows (Experimental)
+
+SMAC can also be installed under Windows.
+
+!!! warning
+
+    This is an experimental feature and might not work in each case.
+
+Please refer to this [issue](https://github.com/automl/SMAC3/issues/952) for installation instructions for SMAC3-1.4 and SMAC3-2.x.
diff --git a/docs/1_installation.rst b/docs/1_installation.rst
deleted file mode 100644
index fc3a1a6ac..000000000
--- a/docs/1_installation.rst
+++ /dev/null
@@ -1,91 +0,0 @@
-Installation
-============
-
-Requirements
-~~~~~~~~~~~~
-
-SMAC is written in python3 and therefore requires an environment with python>=3.8.
-Furthermore, the Random Forest used in SMAC requires SWIG as a build dependency. Install it either in your
-environment or on your system directly. The command to install swig on linux machines is the following:
-
-.. code-block::
-
-    apt-get install swig
-
-
-.. note::
-
-    SMAC is tested on Linux and Mac machines with python >=3.8.
-
-
-Anaconda
-~~~~~~~~
-
-We recommend using Anaconda to create and activate an environment:
-
-.. code-block::
-
-    conda create -n SMAC python=3.10
-    conda activate SMAC
-
-
-If you haven't installed swig yet, you can install it directly inside the Anaconda environment:
-
-.. code-block::
-
-    conda install gxx_linux-64 gcc_linux-64 swig
-
-
-Now install SMAC via PyPI:
-
-.. code-block::
-
-    pip install smac
-
-
-Or alternatively, clone the environment from GitHub directly:
-
-.. code-block::
-
-    git clone https://github.com/automl/SMAC3.git && cd SMAC3
-    pip install -e .[dev]
-
-
-Conda-forge
-~~~~~~~~~~~
-
-Installing SMAC from the `conda-forge` channel can be achieved by adding `conda-forge` to your channels with:
-
-.. code:: bash
-
-    conda config --add channels conda-forge
-    conda config --set channel_priority strict
-
-
-You must have `conda >= 4.9` installed. To update conda or check your current conda version, please follow the instructions from `the official anaconda documentation <https://docs.anaconda.com/anaconda/install/update-version/>`_ . Once the `conda-forge` channel has been enabled, SMAC can be installed with:
-
-.. code:: bash
-
-    conda install smac
-    
-
-Read `SMAC feedstock <https://github.com/conda-forge/smac-feedstock>`_ for more details.
-
-Windows via WSL (Experimental)
-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-
-SMAC can be installed under Windows in a WSL (Windows Subsystem for Linux). 
-You can find an instruction on how to do this here: :ref:`Experimental<Experimental>`.
-However, this is experimental and might not work in each case. 
-If you would like to suggest any changes, please let us know. 
-
-Windows (Experimental)
-~~~~~~~~~~~~~~~~~~~~~~
-
-SMAC can also be installed under Windows.
-
-.. warning::
-
-    This is an experimental feature and might not work in each case. 
-
-Please refer to this `issue <https://github.com/automl/SMAC3/issues/952>`_ for installation instructions for SMAC3-1.4 and SMAC3-2.x.
\ No newline at end of file

From 2c79a29f1c400b6f8607b12d0b93f182a902a502 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 11:34:51 +0200
Subject: [PATCH 04/64] docs(package_overview): rst -> md

---
 docs/2_package_overview.md  | 48 +++++++++++++++++++++++++++
 docs/2_package_overview.rst | 66 -------------------------------------
 2 files changed, 48 insertions(+), 66 deletions(-)
 create mode 100644 docs/2_package_overview.md
 delete mode 100644 docs/2_package_overview.rst

diff --git a/docs/2_package_overview.md b/docs/2_package_overview.md
new file mode 100644
index 000000000..07466da19
--- /dev/null
+++ b/docs/2_package_overview.md
@@ -0,0 +1,48 @@
+# Package Overview
+
+SMAC supports you in determining well-performing hyperparameter configurations for your algorithms. By being a robust and flexible framework for [Bayesian Optimization](#), SMAC can improve performance within a few function evaluations. It offers several entry points and pre-sets for typical use cases, such as optimizing hyperparameters, solving low dimensional continuous (artificial) global optimization problems and configuring algorithms to perform well across multiple problem [instances](#).
+
+## Features
+
+SMAC has the following characteristics and capabilities:
+
+#### Global Optimizer
+[Bayesian Optimization](#) is used for sample-efficient optimization.
+
+#### Optimize [Black-Box](#) Functions
+Optimization is only aware of input and output. It is agnostic to internals of the function.
+
+#### Flexible Hyperparameters
+Use categorical, continuous, hierarchical and/or conditional hyperparameters with the well-integrated [ConfigurationSpace](https://automl.github.io/ConfigSpace). SMAC can optimize *up to 100 hyperparameters* efficiently.
+
+#### Any Objectives
+Optimization with any [objective](#) (e.g., accuracy, runtime, cross-validation, ...) is possible.
+
+#### [Multi-Objective](#) Optimization
+Optimize arbitrary number of objectives using scalarized multi-objective algorithms. Both ParEGO [Know06] and mean aggregation strategies are supported.
+
+#### [Multi-Fidelity](#) Optimization
+Judge configurations on multiple [budgets](#) to discard unsuitable configurations early on. This will result in a massive speed-up, depending on the budgets.
+
+#### [Instances](#)
+Find well-performing hyperparameter configurations not only for one instance (e.g. dataset) of an algorithm, but for many.
+
+#### Command-Line Interface
+SMAC can not only be executed within a python file but also from the command line. Consequently, not only algorithms in python can be optimized, but implementations in other languages as well.
+
+!!! note
+    Command-line interface has been temporarily disabled in v2.0. Please fall back to v1.4 if you need it.
+
+## Comparison
+
+The following table provides an overview of SMAC's capabilities in comparison with other optimization tools.
+
+| Package      | Complex Hyperparameter Space | [Multi-Objective](#) | [Multi-Fidelity](#) | [Instances](#) | Command-Line Interface | Parallelism |
+|--------------|------------------------------|----------------------|---------------------|----------------|------------------------|-------------|
+| HyperMapper  | ✅                            | ✅                    | ❌                   | ❌              | ❌                      | ❌           |
+| Optuna       | ✅                            | ✅                    | ✅                   | ❌              | ✅                      | ✅           |
+| Hyperopt     | ✅                            | ❌                    | ❌                   | ❌              | ✅                      | ✅           |
+| BoTorch      | ❌                            | ✅                    | ✅                   | ❌              | ❌                      | ✅           |
+| OpenBox      | ✅                            | ✅                    | ❌                   | ❌              | ❌                      | ✅           |
+| HpBandSter   | ✅                            | ❌                    | ✅                   | ❌              | ❌                      | ✅           |
+| SMAC         | ✅                            | ✅                    | ✅                   | ✅              | ✅                      | ✅           |
diff --git a/docs/2_package_overview.rst b/docs/2_package_overview.rst
deleted file mode 100644
index fee310a3f..000000000
--- a/docs/2_package_overview.rst
+++ /dev/null
@@ -1,66 +0,0 @@
-Package Overview 
-================
-
-SMAC supports you in determining well-performing hyperparameter configurations for your algorithms. By being a robust 
-and flexible framework for :term:`Bayesian Optimization<BO>`, SMAC can improve performance within a few function
-evaluations. It offers several entry points and pre-sets for typical use cases, such as optimizing
-hyperparameters, solving low dimensional continuous (artificial) global optimization problems and configuring algorithms 
-to perform well across multiple problem :term:`instances<Instances>`.
-
-
-Features
---------
-
-SMAC has the following characteristics and capabilities:
-
-Global Optimizer
-    :term:`Bayesian Optimization` is used for sample-efficient optimization.
-
-Optimize :term:`Black-Box` Functions
-    Optimization is only aware of input and output. It is agnostic to internals of the function.
-
-Flexible Hyperparameters
-    Use categorical, continuous, hierarchical and/or conditional hyperparameters with the well-integrated
-    `ConfigurationSpace <https://automl.github.io/ConfigSpace>`_. SMAC can optimize *up to 100 hyperparameters*
-    efficiently.
-
-Any Objectives
-    Optimization with any :term:`objective<Objective>` (e.g., accuracy, runtime, cross-validation, ...) is possible.
-
-:ref:`Multi-Objective<Multi-Objective Optimization>`
-    Optimize arbitrary number of objectives using scalarized multi-objective algorithms. Both ParEGO [Know06]_ and
-    mean aggregation strategies are supported.
-
-:ref:`Multi-Fidelity<Multi-Fidelity Optimization>` Optimization
-    Judge configurations on multiple :term:`budgets<Budget>` to discard unsuitable configurations
-    early on. This will result in a massive speed-up, depending on the budgets.
-    
-:ref:`Instances<Optimization across Instances>`
-    Find well-performing hyperparameter configurations not only for one instance (e.g. dataset) of
-    an algorithm, but for many.
-    
-Command-Line Interface
-    SMAC can not only be executed within a python file but also from the commandline. Consequently,
-    not only algorithms in python can be optimized, but implementations in other languages as well.
-
-    .. note ::
-
-        Command-line interface has been temporarily disabled in v2.0. Please fall back to v1.4 if you need it.
-
-
-Comparison
-----------
-
-The following table provides an overview of SMAC's capabilities in comparison with other optimization tools.
-
-.. csv-table::
-    :header: "Package", "Complex Hyperparameter Space", ":term:`Multi-Objective` ", ":term:`Multi-Fidelity`", ":term:`Instances`", "Command-Line Interface", "Parallelism"
-    :widths: 10, 10, 10, 10, 10, 10, 10
-
-    HyperMapper, ✅, ✅, ❌, ❌, ❌, ❌
-    Optuna, ✅, ✅, ✅, ❌, ✅, ✅
-    Hyperopt, ✅, ❌, ❌, ❌, ✅, ✅
-    BoTorch, ❌, ✅, ✅, ❌, ❌, ✅
-    OpenBox, ✅, ✅, ❌, ❌, ❌, ✅
-    HpBandSter, ✅, ❌, ✅, ❌, ❌, ✅
-    SMAC, ✅, ✅, ✅, ✅, ✅, ✅

From bc1989c4bad98a6215e6db97a5c122183c0e0b5a Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 11:48:35 +0200
Subject: [PATCH 05/64] docs(getting_started): rst -> md

---
 docs/3_getting_started.md  | 131 +++++++++++++++++++++++++++++++++
 docs/3_getting_started.rst | 145 -------------------------------------
 2 files changed, 131 insertions(+), 145 deletions(-)
 create mode 100644 docs/3_getting_started.md
 delete mode 100644 docs/3_getting_started.rst

diff --git a/docs/3_getting_started.md b/docs/3_getting_started.md
new file mode 100644
index 000000000..854d87971
--- /dev/null
+++ b/docs/3_getting_started.md
@@ -0,0 +1,131 @@
+# Getting Started
+
+SMAC needs four core components (configuration space, target function, scenario and a facade) to run an
+optimization process, all of which are explained on this page.
+
+They interact in the following way:
+
+<img src="images/smac_components_interaction.jpg" width="400" alt="Interaction of SMAC's components">
+
+
+## Configuration Space
+
+The configuration space defines the search space of the hyperparameters and, therefore, the tunable parameters' legal
+ranges and default values.
+
+```python
+from ConfigSpace import ConfigSpace
+
+cs = ConfigurationSpace({
+    "myfloat": (0.1, 1.5),                # Uniform Float
+    "myint": (2, 10),                     # Uniform Integer
+    "species": ["mouse", "cat", "dog"],   # Categorical
+})
+```
+
+Please see the documentation of `ConfigSpace <https://automl.github.io/ConfigSpace/latest/>`_ for more details.
+
+
+## Target Function
+
+The target function takes a configuration from the configuration space and returns a performance value.
+For example, you could use a Neural Network to predict on your data and get some validation performance.
+If, for instance, you would tune the learning rate of the Network's optimizer, every learning rate will
+change the final validation performance of the network. This is the target function.
+SMAC tries to find the best performing learning rate by trying different values and evaluating the target function -
+in an efficient way.
+
+```python
+    def train(self, config: Configuration, seed: int) -> float:
+        model = MultiLayerPerceptron(learning_rate=config["learning_rate"])
+        model.fit(...)
+        accuracy = model.validate(...)
+
+        return 1 - accuracy  # SMAC always minimizes (the smaller the better)
+```
+
+!!! warning
+    SMAC *always* minimizes the value returned from the target function.
+
+
+!!! note
+    In general, the arguments of the target function depend on the intensifier. However,
+    in all cases, the first argument must be the configuration (arbitrary argument name is possible here) and a seed.
+    If you specified instances in the scenario, SMAC requires ``instance`` as argument additionally. If you use
+    ``SuccessiveHalving`` or ``Hyperband`` as intensifier but you did not specify instances, SMAC passes `budget` as
+    argument to the target function. But don't worry: SMAC will tell you if something is missing or if something is not
+    used.
+
+
+!!! warning
+    SMAC passes either `instance` or `budget` to the target function but never both.
+
+
+## Scenario
+
+The :ref:`Scenario<smac.scenario>` is used to provide environment variables. For example, 
+if you want to limit the optimization process by a time limit or want to specify where to save the results.
+
+```python
+from smac import Scenario
+
+scenario = Scenario(
+    configspace=cs,
+    output_directory=Path("your_output_directory")
+    walltime_limit=120,  # Limit to two minutes
+    n_trials=500,  # Evaluated max 500 trials
+    n_workers=8,  # Use eight workers
+    ...
+)
+```
+
+
+## Facade
+
+A :ref:`facade<smac.facade>` is the entry point to SMAC, which constructs a default optimization 
+pipeline for you. SMAC offers various facades, which satisfy many common use cases and are crucial to
+achieving peak performance. The idea behind the facades is to provide a simple interface to all of SMAC's components,
+which is easy to use and understand and without the need of deep diving into the material. However, experts are
+invited to change the components to their specific hyperparameter optimization needs. The following
+table (horizontally scrollable) shows you what is supported and reveals the default :ref:`components<Components>`:
+
+| | [Black-Box](smac.facade.blackbox_facade) | [Hyperparameter Optimization](smac.facade.hyperparameter_optimization_facade) | [Multi-Fidelity](smac.facade.multi_fidelity_facade) | [Algorithm Configuration](smac.facade.algorithm_configuration_facade) | [Random](smac.facade.random_facade) | [Hyperband](smac.facade.hyperband_facade) |
+| --- | --- | --- | --- | --- | --- | --- |
+| #Parameters | low | low/medium/high | low/medium/high | low/medium/high | low/medium/high | low/medium/high |
+| Supports Instances | ❌ | ✅ | ✅ | ✅ | ❌ | ✅ |
+| Supports Multi-Fidelity | ❌ | ❌ | ✅ | ✅ | ❌ | ✅ |
+| Initial Design | [Sobol](smac.initial_design.sobol_design) | [Sobol](smac.initial_design.sobol_design) | [Random](smac.initial_design.random_design) | [Default](smac.initial_design.default_design) | [Default](smac.initial_design.default_design) | [Default](smac.initial_design.default_design) |
+| Surrogate Model | [Gaussian Process](smac.model.gaussian_process.gaussian_process) | [Random Forest](smac.model.random_forest.random_forest) | [Random Forest](smac.model.random_forest.random_forest) | [Random Forest](smac.model.random_forest.random_forest) | Not used | Not used |
+| Acquisition Function | [Expected Improvement](smac.acquisition.function.expected_improvement) | [Log Expected Improvement](smac.acquisition.function.expected_improvement) | [Log Expected Improvement](smac.acquisition.function.expected_improvement) | [Expected Improvement](smac.acquisition.function.expected_improvement) | Not used | Not used |
+| Acquisition Maximizer | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.random_search) |
+| Intensifier | [Default](smac.intensifier.intensifier) | [Default](smac.intensifier.intensifier) | [Hyperband](smac.intensifier.hyperband) | [Hyperband](smac.intensifier.hyperband) | [Default](smac.intensifier.intensifier) | [Hyperband](smac.intensifier.hyperband) |
+| Runhistory Encoder | [Default](smac.runhistory.encoder.encoder) | [Log](smac.runhistory.encoder.log_encoder) | [Log](smac.runhistory.encoder.log_encoder) | [Default](smac.runhistory.encoder.encoder) | [Default](smac.runhistory.encoder.encoder) | [Default](smac.runhistory.encoder.encoder) |
+| Random Design Probability | 8.5% | 20% | 20% | 50% | Not used | Not used |
+
+
+!!! info
+    The multi-fidelity facade is the closest implementation to `BOHB <https://github.com/automl/HpBandSter>`_.
+
+
+!!! note
+    We want to emphasize that SMAC is a highly modular optimization framework.
+    The facade accepts many arguments to specify components of the pipeline. Please also note, that in contrast
+    to previous versions, instantiated objects are passed instead of *kwargs*.
+
+
+The facades can be imported directly from the ``smac`` module.
+
+```python
+from smac import BlackBoxFacade as BBFacade
+from smac import HyperparameterOptimizationFacade as HPOFacade
+from smac import MultiFidelityFacade as MFFacade
+from smac import AlgorithmConfigurationFacade as ACFacade
+from smac import RandomFacade as RFacade
+from smac import HyperbandFacade as HBFacade
+
+smac = HPOFacade(scenario=scenario, target_function=train)
+smac = MFFacade(scenario=scenario, target_function=train)
+smac = ACFacade(scenario=scenario, target_function=train)
+smac = RFacade(scenario=scenario, target_function=train)
+smac = HBFacade(scenario=scenario, target_function=train)
+```
\ No newline at end of file
diff --git a/docs/3_getting_started.rst b/docs/3_getting_started.rst
deleted file mode 100644
index dbc287368..000000000
--- a/docs/3_getting_started.rst
+++ /dev/null
@@ -1,145 +0,0 @@
-Getting Started
-===============
-
-SMAC needs four core components (configuration space, target function, scenario and a facade) to run an
-optimization process, all of which are explained on this page.
-
-They interact in the following way:
-
-.. image:: images/smac_components_interaction.jpg
-  :width: 400
-  :alt: Interaction of SMAC's components
-
-
-Configuration Space
--------------------
-
-The configuration space defines the search space of the hyperparameters and, therefore, the tunable parameters' legal
-ranges and default values.
-
-.. code-block:: python
-    
-    from ConfigSpace import ConfigSpace
-
-    cs = ConfigurationSpace({
-        "myfloat": (0.1, 1.5),                # Uniform Float
-        "myint": (2, 10),                     # Uniform Integer
-        "species": ["mouse", "cat", "dog"],   # Categorical
-    })
-
-Please see the documentation of `ConfigSpace <https://automl.github.io/ConfigSpace/latest/>`_ for more details.
-
-
-Target Function
----------------
-
-The target function takes a configuration from the configuration space and returns a performance value.
-For example, you could use a Neural Network to predict on your data and get some validation performance.
-If, for instance, you would tune the learning rate of the Network's optimizer, every learning rate will
-change the final validation performance of the network. This is the target function.
-SMAC tries to find the best performing learning rate by trying different values and evaluating the target function -
-in an efficient way.
-
-.. code-block:: python
-    
-    def train(self, config: Configuration, seed: int) -> float:
-        model = MultiLayerPerceptron(learning_rate=config["learning_rate"])
-        model.fit(...)
-        accuracy = model.validate(...)
-
-        return 1 - accuracy  # SMAC always minimizes (the smaller the better)
-
-.. warning::
-
-    SMAC *always* minimizes the value returned from the target function.
-
-
-.. note::
-
-    In general, the arguments of the target function depend on the intensifier. However,
-    in all cases, the first argument must be the configuration (arbitrary argument name is possible here) and a seed.
-    If you specified instances in the scenario, SMAC requires ``instance`` as argument additionally. If you use
-    ``SuccessiveHalving`` or ``Hyperband`` as intensifier but you did not specify instances, SMAC passes `budget` as
-    argument to the target function. But don't worry: SMAC will tell you if something is missing or if something is not
-    used.
-
-
-.. warning::
-
-    SMAC passes either `instance` or `budget` to the target function but never both.
-
-
-Scenario
---------
-
-The :ref:`Scenario<smac.scenario>` is used to provide environment variables. For example, 
-if you want to limit the optimization process by a time limit or want to specify where to save the results.
-
-.. code-block:: python
-
-    from smac import Scenario
-
-    scenario = Scenario(
-        configspace=cs,
-        output_directory=Path("your_output_directory")
-        walltime_limit=120,  # Limit to two minutes
-        n_trials=500,  # Evaluated max 500 trials
-        n_workers=8,  # Use eight workers
-        ...
-    )
-
-
-Facade
-------
-
-A :ref:`facade<smac.facade>` is the entry point to SMAC, which constructs a default optimization 
-pipeline for you. SMAC offers various facades, which satisfy many common use cases and are crucial to
-achieving peak performance. The idea behind the facades is to provide a simple interface to all of SMAC's components,
-which is easy to use and understand and without the need of deep diving into the material. However, experts are
-invited to change the components to their specific hyperparameter optimization needs. The following
-table (horizontally scrollable) shows you what is supported and reveals the default :ref:`components<Components>`:
-
-
-.. csv-table::
-    :header: "", ":ref:`Black-Box<smac.facade.blackbox\\_facade>`", ":ref:`Hyperparameter Optimization<smac.facade.hyperparameter\\_optimization\\_facade>`", ":ref:`Multi-Fidelity<smac.facade.multi\\_fidelity\\_facade>`", ":ref:`Algorithm Configuration<smac.facade.algorithm\\_configuration\\_facade>`", ":ref:`Random<smac.facade.random\\_facade>`", ":ref:`Hyperband<smac.facade.hyperband\\_facade>`"
-
-    "#Parameters", "low", "low/medium/high", "low/medium/high", "low/medium/high", "low/medium/high", "low/medium/high"
-    "Supports Instances", "❌", "✅", "✅", "✅", "❌", "✅"
-    "Supports Multi-Fidelity", "❌", "❌", "✅", "✅", "❌", "✅"
-    "Initial Design", ":ref:`Sobol<smac.initial\\_design.sobol\\_design>`", ":ref:`Sobol<smac.initial\\_design.sobol\\_design>`", ":ref:`Random<smac.initial\\_design.random\\_design>`", ":ref:`Default<smac.initial\\_design.default\\_design>`", ":ref:`Default<smac.initial\\_design.default\\_design>`", ":ref:`Default<smac.initial\\_design.default\\_design>`"
-    "Surrogate Model", ":ref:`Gaussian Process<smac.model.gaussian\\_process.gaussian\\_process>`", ":ref:`Random Forest<smac.model.random\\_forest.random\\_forest>`", ":ref:`Random Forest<smac.model.random\\_forest.random\\_forest>`", ":ref:`Random Forest<smac.model.random\\_forest.random\\_forest>`", "Not used", "Not used"
-    "Acquisition Function", ":ref:`Expected Improvement<smac.acquisition.function.expected\\_improvement>`", ":ref:`Log Expected Improvement<smac.acquisition.function.expected\\_improvement>`", ":ref:`Log Expected Improvement<smac.acquisition.function.expected\\_improvement>`", ":ref:`Expected Improvement<smac.acquisition.function.expected\\_improvement>`", "Not used", "Not used"
-    "Acquisition Maximizer", ":ref:`Local and Sorted Random Search<smac.acquisition.maximizer.local\\_and\\_random\\_search>`", ":ref:`Local and Sorted Random Search<smac.acquisition.maximizer.local\\_and\\_random\\_search>`", ":ref:`Local and Sorted Random Search<smac.acquisition.maximizer.local\\_and\\_random\\_search>`", ":ref:`Local and Sorted Random Search<smac.acquisition.maximizer.local\\_and\\_random\\_search>`", ":ref:`Local and Sorted Random Search<smac.acquisition.maximizer.random\\_search>`", ":ref:`Local and Sorted Random Search<smac.acquisition.maximizer.random\\_search>`"
-    "Intensifier", ":ref:`Default<smac.intensifier.intensifier>`", ":ref:`Default<smac.intensifier.intensifier>`", ":ref:`Hyperband<smac.intensifier.hyperband>`", ":ref:`Hyperband<smac.intensifier.hyperband>`", ":ref:`Default<smac.intensifier.intensifier>`", ":ref:`Hyperband<smac.intensifier.hyperband>`",
-    "Runhistory Encoder", ":ref:`Default<smac.runhistory.encoder.encoder>`", ":ref:`Log<smac.runhistory.encoder.log\\_encoder>`", ":ref:`Log<smac.runhistory.encoder.log\\_encoder>`", ":ref:`Default<smac.runhistory.encoder.encoder>`", ":ref:`Default<smac.runhistory.encoder.encoder>`", ":ref:`Default<smac.runhistory.encoder.encoder>`"
-    "Random Design Probability", "8.5%", "20%", "20%", "50%", "Not used", "Not used"
-
-
-.. note::
-
-    The multi-fidelity facade is the closest implementation to `BOHB <https://github.com/automl/HpBandSter>`_.
-
-
-.. note::
-
-    We want to emphasize that SMAC is a highly modular optimization framework.
-    The facade accepts many arguments to specify components of the pipeline. Please also note, that in contrast
-    to previous versions, instantiated objects are passed instead of *kwargs*.
-
-
-The facades can be imported directly from the ``smac`` module.
-
-.. code-block:: python
-
-    from smac import BlackBoxFacade as BBFacade
-    from smac import HyperparameterOptimizationFacade as HPOFacade
-    from smac import MultiFidelityFacade as MFFacade
-    from smac import AlgorithmConfigurationFacade as ACFacade
-    from smac import RandomFacade as RFacade
-    from smac import HyperbandFacade as HBFacade
-
-    smac = HPOFacade(scenario=scenario, target_function=train)
-    smac = MFFacade(scenario=scenario, target_function=train)
-    smac = ACFacade(scenario=scenario, target_function=train)
-    smac = RFacade(scenario=scenario, target_function=train)
-    smac = HBFacade(scenario=scenario, target_function=train)

From 2e578ee0c16ada2a9b45c2d1f9a0c31e104874b3 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 11:49:48 +0200
Subject: [PATCH 06/64] docs(minimal_example): rst -> md

---
 docs/4_minimal_example.md  | 32 ++++++++++++++++++++++++++++++++
 docs/4_minimal_example.rst | 33 ---------------------------------
 2 files changed, 32 insertions(+), 33 deletions(-)
 create mode 100644 docs/4_minimal_example.md
 delete mode 100644 docs/4_minimal_example.rst

diff --git a/docs/4_minimal_example.md b/docs/4_minimal_example.md
new file mode 100644
index 000000000..b8df179a4
--- /dev/null
+++ b/docs/4_minimal_example.md
@@ -0,0 +1,32 @@
+# Minimal Example
+
+The following code optimizes a support vector machine on the iris dataset.
+
+
+```python
+from ConfigSpace import Configuration, ConfigurationSpace
+
+import numpy as np
+from smac import HyperparameterOptimizationFacade, Scenario
+from sklearn import datasets
+from sklearn.svm import SVC
+from sklearn.model_selection import cross_val_score
+
+iris = datasets.load_iris()
+
+
+def train(config: Configuration, seed: int = 0) -> float:
+    classifier = SVC(C=config["C"], random_state=seed)
+    scores = cross_val_score(classifier, iris.data, iris.target, cv=5)
+    return 1 - np.mean(scores)
+
+
+configspace = ConfigurationSpace({"C": (0.100, 1000.0)})
+
+# Scenario object specifying the optimization environment
+scenario = Scenario(configspace, deterministic=True, n_trials=200)
+
+# Use SMAC to find the best configuration/hyperparameters
+smac = HyperparameterOptimizationFacade(scenario, train)
+incumbent = smac.optimize()
+```
\ No newline at end of file
diff --git a/docs/4_minimal_example.rst b/docs/4_minimal_example.rst
deleted file mode 100644
index c4eb2eb9a..000000000
--- a/docs/4_minimal_example.rst
+++ /dev/null
@@ -1,33 +0,0 @@
-Minimal Example
-===============
-
-The following code optimizes a support vector machine on the iris dataset.
-
-
-.. code-block:: python
-
-    from ConfigSpace import Configuration, ConfigurationSpace
-
-    import numpy as np
-    from smac import HyperparameterOptimizationFacade, Scenario
-    from sklearn import datasets
-    from sklearn.svm import SVC
-    from sklearn.model_selection import cross_val_score
-
-    iris = datasets.load_iris()
-
-
-    def train(config: Configuration, seed: int = 0) -> float:
-        classifier = SVC(C=config["C"], random_state=seed)
-        scores = cross_val_score(classifier, iris.data, iris.target, cv=5)
-        return 1 - np.mean(scores)
-
-
-    configspace = ConfigurationSpace({"C": (0.100, 1000.0)})
-
-    # Scenario object specifying the optimization environment
-    scenario = Scenario(configspace, deterministic=True, n_trials=200)
-
-    # Use SMAC to find the best configuration/hyperparameters
-    smac = HyperparameterOptimizationFacade(scenario, train)
-    incumbent = smac.optimize()
\ No newline at end of file

From 8b850d31ca78bbbc2ac352f8dd5730315edead11 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 14:20:18 +0200
Subject: [PATCH 07/64] docs(references): rst -> md

---
 docs/{6_references.rst => 6_references.md} | 12 +++++-------
 1 file changed, 5 insertions(+), 7 deletions(-)
 rename docs/{6_references.rst => 6_references.md} (69%)

diff --git a/docs/6_references.rst b/docs/6_references.md
similarity index 69%
rename from docs/6_references.rst
rename to docs/6_references.md
index f75d7048b..91e73aeac 100644
--- a/docs/6_references.rst
+++ b/docs/6_references.md
@@ -1,22 +1,20 @@
-References
-==========
+# References
 
-
-.. [LJDR18] L. Li, K. Jamieson, G. DeSalvo, A. Rostamizadeh, A. Talwalkar; 
+* [LJDR18] L. Li, K. Jamieson, G. DeSalvo, A. Rostamizadeh, A. Talwalkar; 
     Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization; 
     https://jmlr.org/papers/v18/16-558.html
 
 
-.. [HSSL22] Carl Hvarfner, Danny Stoll, Artur Souza, Marius Lindauer, Frank Hutter, Luigi Nardi; 
+* [HSSL22] Carl Hvarfner, Danny Stoll, Artur Souza, Marius Lindauer, Frank Hutter, Luigi Nardi; 
     πBO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization; 
     https://arxiv.org/pdf/2204.11051.pdf
 
 
-.. [Know06] J. Knowles; 
+* [Know06] J. Knowles; 
     ParEGO: A Hybrid Algorithm with on-Line Landscape Approximation for Expensive Multiobjective Optimization Problems; 
     https://www.semanticscholar.org/paper/ParEGO%3A-a-hybrid-algorithm-with-on-line-landscape-Knowles/73b5b196b35fb23e1f908d73b787c2c2942fadb5
 
 
-.. [SKKS10] N. Srinivas, S. M. Kakade, A. Krause, M. Seeger; 
+* [SKKS10] N. Srinivas, S. M. Kakade, A. Krause, M. Seeger; 
     Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design; 
     https://arxiv.org/pdf/0912.3995.pdf
\ No newline at end of file

From 96e353606e69c5525aeda87747291be15782d191 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:18:56 +0200
Subject: [PATCH 08/64] docs(glossary): rst -> md

but links do not work yet
---
 docs/7_glossary.md  |  29 +++++++++++++
 docs/7_glossary.rst | 101 --------------------------------------------
 2 files changed, 29 insertions(+), 101 deletions(-)
 create mode 100644 docs/7_glossary.md
 delete mode 100644 docs/7_glossary.rst

diff --git a/docs/7_glossary.md b/docs/7_glossary.md
new file mode 100644
index 000000000..912cd5a10
--- /dev/null
+++ b/docs/7_glossary.md
@@ -0,0 +1,29 @@
+# Glossary
+
+- **BB**: See `Black-Box`.
+- **BO**: See `Bayesian Optimization`.
+- **BOHB**: [Bayesian optimization and Hyperband](https://arxiv.org/abs/1807.01774).
+- **CLI**: Command-Line Interface.
+- **CV**: Cross-Validation.
+- **GP**: Gaussian Process.
+- **GP-MCMC**: Gaussian Process with Markov-Chain Monte-Carlo.
+- **HB**: See `Hyperband`.
+- **HP**: Hyperparameter.
+- **MF**: See `Multi-Fidelity`.
+- **RF**: Random Forest.
+- **ROAR**: See `Random Online Adaptive Racing`.
+- **SMAC**: Sequential Model-Based Algorithm Configuration.
+- **SMBO**: Sequential Mode-Based Optimization.
+- **Bayesian Optimization**: Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions. A Bayesian optimization weights exploration and exploitation to find the minimum of its objective.
+- **Black-Box**: Refers to an algorithm being optimized, where only input and output are observable.
+- **Budget**: Budget is another word for fidelity. Examples are the number of training epochs or the size of the data subset the algorithm is trained on. However, budget can also be used in the context of instances. For example, if you have 100 instances (let's say we optimize across datasets) and you want to run your algorithm on 10 of them, then the budget is 10.
+- **Hyperband**: [Hyperband](https://arxiv.org/abs/1603.06560). A novel bandit-based algorithm for hyperparameter optimization. Hyperband is an extension of successive halving and therefore works with multi-fidelities.
+- **Incumbent**: The incumbent is the current best known configuration.
+- **Instances**: Often you want to optimize across different datasets, subsets, or even different transformations (e.g. augmentation). In general, each of these is called an instance. Configurations are evaluated on multiple instances so that a configuration is found which performs superior on all instances instead of only a few.
+- **Intensification**: A mechanism that governs how many evaluations to perform with each configuration and when to trust a configuration enough to make it the new current best known configuration (the incumbent).
+- **Multi-Fidelity**: Multi-fidelity refers to running an algorithm on multiple budgets (such as number of epochs or subsets of data) and thereby evaluating the performance prematurely.
+- **Multi-Objective**: A multi-objective optimization problem is a problem with more than one objective. The goal is to find a solution that is optimal or at least a good compromise in all objectives.
+- **Objective**: An objective is a metric to evaluate the quality or performance of an algorithm.
+- **Random Online Adaptive Racing**: Random Online Adaptive Racing. A simple model-free instantiation of the general `SMBO` framework. It selects configurations uniformly at random and iteratively compares them against the current incumbent using the intensification mechanism. See [SMAC extended](https://ai.dmi.unibas.ch/research/reading_group/hutter-et-al-tr2010.pdf) chapter 3.2 for details.
+- **Target Function**: Your model, which returns a cost based on the given config, seed, budget, and/or instance.
+- **Trial**: Trial is a single run of a target function on a combination of configuration, seed, budget and/or instance.
diff --git a/docs/7_glossary.rst b/docs/7_glossary.rst
deleted file mode 100644
index 6e60e7095..000000000
--- a/docs/7_glossary.rst
+++ /dev/null
@@ -1,101 +0,0 @@
-Glossary
-========
-
-.. glossary::
-
-    BB
-        See :term:`Black-Box`.
-
-    BO
-        See :term:`Bayesian Optimization`.
-
-    BOHB
-        `Bayesian optimization and Hyperband <https://arxiv.org/abs/1807.01774>`_.
-
-    CLI
-        Command-Line Interface.
-
-    CV
-        Cross-Validation.
-
-    GP
-        Gaussian Process.
-
-    GP-MCMC
-        Gaussian Process with Markov-Chain Monte-Carlo.
-
-    HB
-        See :term:`Hyperband`.
-
-    HP
-        Hyperparameter.
-
-    MF
-        See :term:`Multi-Fidelity`.
-
-    RF
-        Random Forest.
-
-    ROAR
-        See :term:`Random Online Adaptive Racing`.
-
-    SMAC
-        Sequential Model-Based Algorithm Configuration.
-
-    SMBO
-        Sequential Mode-Based Optimization.
-
-    Bayesian Optimization
-        Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does 
-        not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions.
-        A Bayesian optimization weights exploration and exploitation to find the minimum of its objective.
-
-    Black-Box
-        Refers to an algorithm being optimized, where only input and output are observable.
-
-    Budget
-        Budget is another word for fidelity. Examples are the number of training epochs or the size of
-        the data subset the algorithm is trained on. However, budget can also be used in the context of
-        instances. For example, if you have 100 instances (let's say we optimize across datasets) and you want to run
-        your algorithm on 10 of them, then the budget is 10.
-
-    Hyperband
-        `Hyperband <https://arxiv.org/abs/1603.06560>`_. A novel bandit-based algorithm for hyperparameter
-        optimization. Hyperband is an extension of successive halving and therefore works with
-        multi-fidelities.
-
-    Incumbent
-        The incumbent is the current best known configuration.
-
-    Instances
-        Often you want to optimize across different datasets, subsets, or even different transformations (e.g.
-        augmentation). In general, each of these is called an instance. Configurations are evaluated on multiple
-        instances so that a configuration is found which performs superior on all instances instead of only
-        a few.
-
-    Intensification
-        A mechanism that governs how many evaluations to perform with each configuration and when to trust a
-        configuration enough to make it the new current best known configuration (the incumbent).
-
-    Multi-Fidelity
-        Multi-fidelity refers to running an algorithm on multiple budgets (such as number of epochs or
-        subsets of data) and thereby evaluating the performance prematurely.
-
-    Multi-Objective
-        A multi-objective optimization problem is a problem with more than one objective.
-        The goal is to find a solution that is optimal or at least a good compromise in all objectives.
-
-    Objective
-        An objective is a metric to evaluate the quality or performance of an algorithm.
-
-    Random Online Adaptive Racing
-        Random Online Adaptive Racing. A simple model-free instantiation of the general :term:`SMBO` framework.
-        It selects configurations uniformly at random and iteratively compares them against the current incumbent
-        using the intensification mechanism. See `SMAC extended <https://ai.dmi.unibas.ch/research/reading_group/hutter-et-al-tr2010.pdf>`_
-        chapter 3.2 for details.
-
-    Target Function
-        Your model, which returns a cost based on the given config, seed, budget, and/or instance.
-
-    Trial
-        Trial is a single run of a target function on a combination of configuration, seed, budget and/or instance.

From 163233f664b13c20a0eb7c71dce6b1ebbc65e162 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:25:16 +0200
Subject: [PATCH 09/64] docs(faq): rst -> md

---
 docs/{8_faq.rst => 8_faq.md} | 33 +++++++++++++++------------------
 1 file changed, 15 insertions(+), 18 deletions(-)
 rename docs/{8_faq.rst => 8_faq.md} (61%)

diff --git a/docs/8_faq.rst b/docs/8_faq.md
similarity index 61%
rename from docs/8_faq.rst
rename to docs/8_faq.md
index 111e84455..250b3482e 100644
--- a/docs/8_faq.rst
+++ b/docs/8_faq.md
@@ -1,30 +1,27 @@
-F.A.Q.
-======
+# F.A.Q.
 
-
-Should I use SMAC2 or SMAC3?
-  SMAC3 is a reimplementation of the original SMAC tool (`Sequential Model-Based Optimization for
-  General Algorithm Configuration <https://ml.informatik.uni-freiburg.de/wp-content/uploads/papers/11-LION5-SMAC.pdf>`_, Hutter et al., 2021). However, the reimplementation slightly differs from the original
+#### Should I use SMAC2 or SMAC3?
+  SMAC3 is a reimplementation of the original SMAC tool ([Sequential Model-Based Optimization for General Algorithm Configuration](https://ml.informatik.uni-freiburg.de/wp-content/uploads/papers/11-LION5-SMAC.pdf), Hutter et al., 2021). However, the reimplementation slightly differs from the original
   SMAC. For comparisons against the original SMAC, we refer to a stable release of SMAC (v2) in Java
-  which can be found `here <https://www.cs.ubc.ca/labs/algorithms/Projects/SMAC/>`_.
+  which can be found [here](https://www.cs.ubc.ca/labs/algorithms/Projects/SMAC/).
   Since SMAC3 is actively maintained, we recommend to use SMAC3 for any AutoML applications.
 
 
-SMAC cannot be imported.
+#### SMAC cannot be imported.
   Try to either run SMAC from SMAC's root directory or try to run the installation first.
 
 
-pyrfr raises cryptic import errors.
+#### pyrfr raises cryptic import errors.
   Ensure that the gcc used to compile the pyrfr is the same as used for linking
   during execution. This often happens with Anaconda. See
-  :ref:`Installation <installation>` for a solution.
+  [Installation](1_installation.md) for a solution.
 
 
-How can I use :term:`BOHB` and/or `HpBandSter <https://github.com/automl/HpBandSter>`_ with SMAC?
-  The facade MultiFidelityFacade is the closest implementation to :term:`BOHB` and/or `HpBandSter <https://github.com/automl/HpBandSter>`_.
+#### How can I use :term:`BOHB` and/or [HpBandSter](https://github.com/automl/HpBandSter) with SMAC?
+  The facade MultiFidelityFacade is the closest implementation to :term:`BOHB` and/or [HpBandSter](https://github.com/automl/HpBandSter).
 
 
-I discovered a bug or SMAC does not behave as expected. Where should I report to?
+#### I discovered a bug or SMAC does not behave as expected. Where should I report to?
   Open an issue in our issue list on GitHub. Before you report a bug, please make sure that:
 
   * Your bug hasn't already been reported in our issue tracker.
@@ -38,13 +35,13 @@ I discovered a bug or SMAC does not behave as expected. Where should I report to
   * Feel free to add a screenshot showing the issue.
 
 
-I want to contribute code or discuss a new idea. Where should I report to?
-  SMAC uses the `GitHub issue-tracker <https://github.com/automl/SMAC3/issues>`_ to also take care
+#### I want to contribute code or discuss a new idea. Where should I report to?
+  SMAC uses the [GitHub issue-tracker](https://github.com/automl/SMAC3/issues) to also take care
   of questions and feedback and is the preferred location for discussing new features and ongoing work. Please also have a look at our
-  `contribution guide <https://github.com/automl/SMAC3/blob/main/CONTRIBUTING.md>`_.
+  [contribution guide](https://github.com/automl/SMAC3/blob/main/CONTRIBUTING.md).
 
 
-What is the meaning of *deterministic*?
+#### What is the meaning of *deterministic*?
   If the ``deterministic`` flag is set to `False` the target function is assumed to be non-deterministic.
   To evaluate a configuration of a non-deterministic algorithm, multiple runs with different seeds will be evaluated
   to determine the performance of that configuration on one instance.
@@ -53,6 +50,6 @@ What is the meaning of *deterministic*?
   target function.
 
 
-Why does SMAC not run on Colab/Mac and crashes with the error "Child process not yet created"?
+#### Why does SMAC not run on Colab/Mac and crashes with the error "Child process not yet created"?
   SMAC uses pynisher to enforce time and memory limits on the target function runner. However, pynisher may not always
   work on specific setups. To overcome this error, it is recommended to remove limitations to make SMAC run.

From 31140b8fc56fbfbb1dd9fe779e260474de207750 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:27:19 +0200
Subject: [PATCH 10/64] docs(license): rst -> md

---
 docs/{9_license.rst => 9_license.md} | 7 +++----
 1 file changed, 3 insertions(+), 4 deletions(-)
 rename docs/{9_license.rst => 9_license.md} (59%)

diff --git a/docs/9_license.rst b/docs/9_license.md
similarity index 59%
rename from docs/9_license.rst
rename to docs/9_license.md
index c00628f58..a1f2f4ca7 100644
--- a/docs/9_license.rst
+++ b/docs/9_license.md
@@ -1,9 +1,8 @@
-License
-=======
+# License
 
 This program is free software: you can redistribute it and/or modify it under the terms of the 3-clause BSD license
-(please see the LICENSE file).
+(please see the [LICENSE](https://github.com/automl/SMAC3/blob/main/LICENSE.txt) file).
 This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY;
 without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
 You should have received a copy of the 3-clause BSD license along with this program
-(see LICENSE file). If not, see `<https://opensource.org/license/BSD-3-Clause>`_.
\ No newline at end of file
+(see [LICENSE](https://github.com/automl/SMAC3/blob/main/LICENSE.txt) file). If not, see [BSD-3-Clause license](https://opensource.org/license/BSD-3-Clause).
\ No newline at end of file

From a4e8454035bbb5eb579a72f489406ae7ea570911 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:30:56 +0200
Subject: [PATCH 11/64] docs(experimental): rst -> md

---
 ...10_experimental.rst => 10_experimental.md} | 43 ++++++++++---------
 docs/1_installation.md                        | 14 +-----
 2 files changed, 24 insertions(+), 33 deletions(-)
 rename docs/{10_experimental.rst => 10_experimental.md} (52%)

diff --git a/docs/10_experimental.rst b/docs/10_experimental.md
similarity index 52%
rename from docs/10_experimental.rst
rename to docs/10_experimental.md
index a075498c4..984586505 100644
--- a/docs/10_experimental.rst
+++ b/docs/10_experimental.md
@@ -1,12 +1,10 @@
-Experimental
-============
+# Experimental
 
-.. warning::
+!!! warning
     This part is experimental and might not work in each case. If you would like to suggest any changes, please let us know. 
 
 
-Installation in Windows via WSL
-------------------------------
+## Installation in Windows via WSL
 
 SMAC can be installed in a WSL (Windows Subsystem for Linux) under Windows.
 
@@ -21,28 +19,31 @@ Download an Anaconda Linux version to drive D under Windows, e.g. D:\\Anaconda3-
     
 In the WSL, Windows resources are mounted under /mnt:
 
-.. code:: bash
-
-    cd /mnt/d
-    bash Anaconda3-2023.03-1-Linux-x86_64
+```bash
+cd /mnt/d
+bash Anaconda3-2023.03-1-Linux-x86_64
+```
 
 Enter this command to create the environment variable:
 
-.. code:: bash
-
-    export PATH="$PATH:/home/${USER}/anaconda3/bin
+```bash
+export PATH="$PATH:/home/${USER}/anaconda3/bin
+```
 
-Input 'python' to check if the installation was successful.
+Input `python` to check if the installation was successful.
 
 **3) Install SMAC**
 
 Change to your home folder and install the general software there:
 
-.. code:: bash
-
-    cd /home/${USER}
-    sudo apt-get install software-properties-common
-    sudo apt-get update
-    sudo apt-get install build-essential swig
-    conda install gxx_linux-64 gcc_linux-64 swig
-    curl https://raw.githubusercontent.com/automl/smac3/master/requirements.txt | xargs -n 1 -L 1 pip install
+```bash
+cd /home/${USER}
+sudo apt-get install software-properties-common
+sudo apt-get update
+sudo apt-get install build-essential swig
+conda install gxx_linux-64 gcc_linux-64 swig
+curl https://raw.githubusercontent.com/automl/smac3/master/requirements.txt | xargs -n 1 -L 1 pip install
+```
+
+## Installation in Pure Windows
+Please refer to this [issue](https://github.com/automl/SMAC3/issues/952) for installation instructions for SMAC3-1.4 and SMAC3-2.x.
\ No newline at end of file
diff --git a/docs/1_installation.md b/docs/1_installation.md
index 52d78b6a8..056f0f5aa 100644
--- a/docs/1_installation.md
+++ b/docs/1_installation.md
@@ -60,19 +60,9 @@ conda install smac
 
 Read [SMAC feedstock](https://github.com/conda-forge/smac-feedstock) for more details.
 
-## Windows via WSL (Experimental)
+## Windows (native or via WSL, experimental)
 
 SMAC can be installed under Windows in a WSL (Windows Subsystem for Linux). 
-You can find an instruction on how to do this here: [Experimental](#Experimental).
+You can find an instruction on how to do this here: [Experimental](./10_experimental.md).
 However, this is experimental and might not work in each case. 
 If you would like to suggest any changes, please let us know. 
-
-## Windows (Experimental)
-
-SMAC can also be installed under Windows.
-
-!!! warning
-
-    This is an experimental feature and might not work in each case.
-
-Please refer to this [issue](https://github.com/automl/SMAC3/issues/952) for installation instructions for SMAC3-1.4 and SMAC3-2.x.

From c44abd0b0269b18eb0423a32fcecd913edd0a27a Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:40:34 +0200
Subject: [PATCH 12/64] Rename file

---
 smac/facade/old/{boing_facade.py => _boing_facade.py} | 0
 1 file changed, 0 insertions(+), 0 deletions(-)
 rename smac/facade/old/{boing_facade.py => _boing_facade.py} (100%)

diff --git a/smac/facade/old/boing_facade.py b/smac/facade/old/_boing_facade.py
similarity index 100%
rename from smac/facade/old/boing_facade.py
rename to smac/facade/old/_boing_facade.py

From 9da916e62c2913ce9481c13249da128c26778856 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:41:45 +0200
Subject: [PATCH 13/64] renaem old files

---
 smac/main/old/{boing.py => _boing.py} | 0
 smac/main/old/{turbo.py => _turbo.py} | 0
 2 files changed, 0 insertions(+), 0 deletions(-)
 rename smac/main/old/{boing.py => _boing.py} (100%)
 rename smac/main/old/{turbo.py => _turbo.py} (100%)

diff --git a/smac/main/old/boing.py b/smac/main/old/_boing.py
similarity index 100%
rename from smac/main/old/boing.py
rename to smac/main/old/_boing.py
diff --git a/smac/main/old/turbo.py b/smac/main/old/_turbo.py
similarity index 100%
rename from smac/main/old/turbo.py
rename to smac/main/old/_turbo.py

From cc657c8675c46e6e22dddc50d095499a13f36ae4 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:43:30 +0200
Subject: [PATCH 14/64] docs: add hooks

---
 docs/api_generator.py                         |  44 +++
 docs/example_runner.py                        | 295 ++++++++++++++++++
 docs/hooks/cleanup_log_output.py              |  38 +++
 .../debug_which_page_is_being_rendered.py     |  34 ++
 docs/hooks/disable_markdown_exec.py           |  45 +++
 5 files changed, 456 insertions(+)
 create mode 100644 docs/api_generator.py
 create mode 100644 docs/example_runner.py
 create mode 100644 docs/hooks/cleanup_log_output.py
 create mode 100644 docs/hooks/debug_which_page_is_being_rendered.py
 create mode 100644 docs/hooks/disable_markdown_exec.py

diff --git a/docs/api_generator.py b/docs/api_generator.py
new file mode 100644
index 000000000..c6db813ec
--- /dev/null
+++ b/docs/api_generator.py
@@ -0,0 +1,44 @@
+"""Generate the code reference pages and navigation.
+
+# https://mkdocstrings.github.io/recipes/
+"""
+from __future__ import annotations
+
+import logging
+from pathlib import Path
+
+import mkdocs_gen_files
+
+logger = logging.getLogger(__name__)
+
+source_path = "smac"
+
+# Modules whose members should not include inherited attributes or methods
+# NOTE: Given the current setup, we can only operate at a module level.
+# Ideally we specify options (at least at a module level) and we render
+# them into strings using a yaml parser. For now this is fine though
+NO_INHERITS = ("sklearn.evaluation",)
+TAB = "    "
+
+for path in sorted(Path(source_path).rglob("*.py")):
+    module_path = path.relative_to(source_path).with_suffix("")
+    doc_path = path.relative_to(source_path).with_suffix(".md")
+    full_doc_path = Path("api", doc_path)
+
+    parts = tuple(module_path.parts)
+
+    if parts[-1] in ("__main__", "__version__", "__init__"):
+        continue
+
+    if any(part.startswith("_") for part in parts):
+        continue
+
+    with mkdocs_gen_files.open(full_doc_path, "w") as fd:
+        ident = ".".join(parts)
+        fd.write(f"::: {ident}")
+
+        if ident.endswith(NO_INHERITS):
+            fd.write(f"\n{TAB}options:")
+            fd.write(f"\n{TAB}{TAB}inherited_members: false")
+
+    mkdocs_gen_files.set_edit_path(full_doc_path, path)
\ No newline at end of file
diff --git a/docs/example_runner.py b/docs/example_runner.py
new file mode 100644
index 000000000..d0ba74e92
--- /dev/null
+++ b/docs/example_runner.py
@@ -0,0 +1,295 @@
+"""Generates the examples pages."""
+from __future__ import annotations
+
+import logging
+import os
+import textwrap
+from dataclasses import dataclass
+from itertools import takewhile
+from pathlib import Path
+from typing import Any
+from typing_extensions import override
+
+import mkdocs_gen_files
+from more_itertools import first_true, peekable
+
+logger = logging.getLogger("mkdocs")
+
+RUN_EXAMPLES_ENV_VAR = "SMAC_DOC_RENDER_EXAMPLES"
+
+
+@dataclass
+class CodeSegment:
+    lines: list[str]
+    session: str
+    exec: bool
+
+    def code(self, code: list[str]) -> str:
+        points_start = first_true(code, pred=lambda _l: _l.startswith("# 1."))
+        if points_start is not None:
+            points_start_index = code.index(points_start)
+
+            points = [""]
+            points.extend([_l.lstrip("#")[1:] for _l in code[points_start_index:]])
+            points.append("")
+
+            body = code[:points_start_index]
+        else:
+            points = []
+            body = code
+
+        # Trim off any excess leading lines which only have whitespace
+        while body and body[0].strip() == "":
+            body.pop(0)
+
+        hl_lines = []
+        for i, line in enumerate(body):
+            _l = line.rstrip()
+            if "<!>" in _l:
+                hl_lines.append(str(i + 1))
+                _l = _l.replace("<!>", "").rstrip()
+
+            for strip in ["# !", "#!", "#"]:
+                if _l.endswith(strip):
+                    _l = _l[: -len(strip)]
+
+            body[i] = _l
+
+        if any(hl_lines):
+            hl_lines = " ".join(hl_lines)
+            hl_string = f'hl_lines="{hl_lines}"'
+        else:
+            hl_string = ""
+
+        # We generate two tabs if executing
+        if self.exec:
+            indented_body = "\n".join(f"    {_l}" for _l in body)
+
+            code_annotations = " ".join(
+                [
+                    "{",
+                    ".python",
+                    ".annotate",
+                    hl_string,
+                    "}",
+                ],
+            )
+            tab1 = "\n".join(
+                [
+                    '=== "Code"',
+                    "",
+                    f"    ``` {code_annotations}",
+                    indented_body,
+                    "    ```",
+                    *[f"    {point}" for point in points],
+                    "",
+                ],
+            )
+
+            run_annotations = " ".join(
+                [
+                    "{",
+                    ".python",
+                    f"session='{self.session}'",
+                    'exec="True"',
+                    'result="python"',
+                    "}",
+                ],
+            )
+
+            tab2 = "\n".join(
+                [
+                    '=== "Run"',
+                    "",
+                    f"    ``` {run_annotations}",
+                    indented_body,
+                    "    ```",
+                ],
+            )
+
+            return "\n".join([tab1, "", tab2])
+
+        annotations = " ".join(["{", ".python", ".annotate", hl_string, "}"])
+        top = f"```{annotations}"
+        bottom = "```"
+
+        s = [top, *body, bottom, *points]
+        body = "\n".join(s)
+        return body
+
+    @override
+    def __str__(self) -> str:
+        return self.code(self.lines)
+
+
+@dataclass
+class CommentSegment:
+    lines: list[str]
+
+    @override
+    def __str__(self) -> str:
+        return "\n".join(self.lines)
+
+
+@dataclass
+class Example:
+    name: str
+    filepath: Path
+    description: str
+    segments: list[CodeSegment | CommentSegment]
+
+    @classmethod
+    def should_execute(cls, *, name: str, runnable: bool) -> bool:
+        if not runnable:
+            return False
+
+        env_var = os.environ.get(RUN_EXAMPLES_ENV_VAR, "all")
+        if env_var in ("false", "", "0", "no", "off"):
+            return False
+
+        if env_var == "all":
+            return True
+
+        examples_to_exec = [
+            example.lstrip().rstrip() for example in env_var.lower().split(",")
+        ]
+        return name.lower() in examples_to_exec
+
+    @classmethod
+    def header_flags(cls, line: str) -> dict[str, Any] | None:
+        prefix = "# Flags:"
+        if not line.startswith(prefix):
+            return None
+
+        line = line[len(prefix) :]
+        flags = [line.strip() for line in line.split(",")]
+
+        results = {}
+
+        results["doc-runnable"] = any(flag.lower() == "doc-runnable" for flag in flags)
+        return results
+
+    @classmethod
+    def from_file(cls, path: Path) -> Example:
+        with path.open() as f:
+            lines = f.readlines()
+
+        lines = iter(lines)
+
+        # First line is the name of the example to show
+        name = next(lines).strip().replace('"""', "")
+        potential_flag_line = next(lines)
+        flags = cls.header_flags(potential_flag_line)
+        if flags is None:
+            # Prepend the potential flag line back to the lines
+            lines = iter([potential_flag_line, *lines])
+            flags = {}
+
+        # Lines leading up to the second triple quote are the description
+        description = "".join(takewhile(lambda _l: not _l.startswith('"""'), lines))
+
+        segments: list[CodeSegment | CommentSegment] = []
+
+        # The rest is interspersed with triple quotes and code blocks
+        # We need to wrap the code blocks in triple backticks while
+        # removing the triple quotes for the comment blocks
+        remaining = peekable(lines)
+        while remaining.peek(None) is not None:
+            # If we encounter triple backticks we remove them and just add the lines
+            # in, up until the point we hit the next set of backticks
+            if remaining.peek().startswith('"""'):
+                # Skip the triple quotes
+                next(remaining)
+                ls = list(takewhile(lambda _l: not _l.startswith('"""'), remaining))
+                comment_segment = CommentSegment([line.rstrip() for line in ls])
+                segments.append(comment_segment)
+
+            # Otherwise we wrap the line in triple backticks until we hit the next
+            # set of triple quotes
+            else:
+                ls = list(takewhile(lambda _l: not _l.startswith('"""'), remaining))
+                code_segment = CodeSegment(
+                    [line.rstrip() for line in ls],
+                    session=name,
+                    exec=cls.should_execute(
+                        name=name,
+                        runnable=flags.get("doc-runnable", False),
+                    ),
+                )
+                segments.append(code_segment)
+
+                remaining.prepend('"""')  # Stick back in so we can find it next itr
+
+        return cls(name, path, description, segments)
+
+    def header(self) -> str:
+        return f"# {self.name}"
+
+    def description_header(self) -> str:
+        return "\n".join(
+            [
+                "## Description",
+                self.description,
+            ],
+        )
+
+    def generate_doc(self) -> str:
+        return "\n".join(
+            [
+                self.header(),
+                self.copy_section(),
+                self.description_header(),
+                *map(str, self.segments),
+            ],
+        )
+
+    def copy_section(self) -> str:
+        body = "\n".join(
+            [
+                "```python",
+                *[
+                    "\n".join(segment.lines)
+                    for segment in self.segments
+                    if isinstance(segment, CodeSegment)
+                ],
+                "```",
+            ],
+        )
+        indented_body = textwrap.indent(body, " " * 4)
+        header = (
+            f'??? quote "Expand to copy'
+            f' `{self.filepath}` :material-content-copy: (top right)"'
+        )
+        return "\n".join(
+            [
+                header,
+                "",
+                indented_body,
+                "",
+            ],
+        )
+
+
+if os.environ.get(RUN_EXAMPLES_ENV_VAR, "all") in ("false", "", "0", "no", "off"):
+    logger.warning(
+        f"Env variable {RUN_EXAMPLES_ENV_VAR} not set - not running examples."
+        " Use `just docs-full` to run and render examples.",
+    )
+
+for path in sorted(Path("examples").rglob("*.py")):
+    module_path = path.relative_to("examples").with_suffix("")
+    doc_path = path.relative_to("examples").with_suffix(".md")
+    full_doc_path = Path("examples", doc_path)
+
+    parts = tuple(module_path.parts)
+    filename = parts[-1]
+
+    if filename.startswith("_"):
+        continue
+
+    example = Example.from_file(path)
+    with mkdocs_gen_files.open(full_doc_path, "w") as f:
+        f.write(example.generate_doc())
+
+    toc_name = example.name
+    mkdocs_gen_files.set_edit_path(full_doc_path, path)
\ No newline at end of file
diff --git a/docs/hooks/cleanup_log_output.py b/docs/hooks/cleanup_log_output.py
new file mode 100644
index 000000000..2ce95ce72
--- /dev/null
+++ b/docs/hooks/cleanup_log_output.py
@@ -0,0 +1,38 @@
+"""The module is a hook which disables warnings and log messages which pollute the
+doc build output.
+
+One possible downside is if one of these modules ends up giving an actual
+error, such as OpenML failing to retrieve a dataset. I tried to make sure ERROR
+log message are still allowed through.
+"""
+import logging
+import warnings
+from typing import Any
+
+import mkdocs
+import mkdocs.plugins
+import mkdocs.structure.pages
+
+log = logging.getLogger("mkdocs")
+
+
+@mkdocs.plugins.event_priority(-50)
+def on_startup(**kwargs: Any):
+    # We get a load of deprecation warnings from SMAC
+    warnings.filterwarnings("ignore", category=DeprecationWarning)
+
+    # ConvergenceWarning from sklearn
+    warnings.filterwarnings("ignore", module="sklearn")
+
+
+def on_pre_page(
+    page: mkdocs.structure.pages.Page,
+    config: Any,
+    files: Any,
+) -> mkdocs.structure.pages.Page | None:
+    # NOTE: mkdocs says they're always normalized to be '/' seperated
+    # which means this should work on windows as well.
+
+    logging.getLogger("smac").setLevel(logging.ERROR)
+    logging.getLogger("openml").setLevel(logging.ERROR)
+    return page
\ No newline at end of file
diff --git a/docs/hooks/debug_which_page_is_being_rendered.py b/docs/hooks/debug_which_page_is_being_rendered.py
new file mode 100644
index 000000000..9c4aec84a
--- /dev/null
+++ b/docs/hooks/debug_which_page_is_being_rendered.py
@@ -0,0 +1,34 @@
+"""This module is a hook that when any code is being rendered, it will
+print the path to the file being rendered.
+
+This makes it easier to identify which file is being rendered when an error happens.
+"""
+from __future__ import annotations
+
+import logging
+import os
+from typing import TYPE_CHECKING, Any
+
+import mkdocs
+import mkdocs.plugins
+
+if TYPE_CHECKING:
+    import mkdocs.structure.pages
+
+log = logging.getLogger("mkdocs")
+
+RENDER_EXAMPLES_ENV_VAR = "SMAC_DOC_RENDER_EXAMPLES"
+EXEC_DOCS_ENV_VAR = "SMAC_EXEC_DOCS"
+
+truthy_values = {"yes", "on", "true", "1", "all"}
+
+
+def on_pre_page(
+    page: mkdocs.structure.pages.Page,
+    config: Any,
+    files: Any,
+) -> mkdocs.structure.pages.Page | None:
+    render_examples = os.environ.get(RENDER_EXAMPLES_ENV_VAR, "true")
+    render_code = os.environ.get(EXEC_DOCS_ENV_VAR, "true")
+    if render_examples.lower() in truthy_values or render_code.lower() in truthy_values:
+        log.info(f"{page.file.src_path}")
diff --git a/docs/hooks/disable_markdown_exec.py b/docs/hooks/disable_markdown_exec.py
new file mode 100644
index 000000000..ee6ae10bd
--- /dev/null
+++ b/docs/hooks/disable_markdown_exec.py
@@ -0,0 +1,45 @@
+"""This disable markdown_exec based on an environment variable.
+This speeds up the build of the docs for faster iteration.
+
+This is done by overwriting the module responsible for compiling and executing the code
+by overriding the `exec(...)` global variable that is used to run the code.
+We hijack it and print a helpful message about how to run the code cell instead.
+
+https://github.com/pawamoy/markdown-exec/blob/adff40b2928dbb2d22f27684e085f02d39a07291/src/markdown_exec/formatters/python.py#L42-L70
+"""
+from __future__ import annotations
+
+import logging
+import os
+from typing import Any
+
+import mkdocs
+import mkdocs.plugins
+import mkdocs.structure.pages
+
+RUN_CODE_BLOCKS_ENV_VAR = "SMAC_EXEC_DOCS"
+
+logger = logging.getLogger("mkdocs")
+
+
+def _print_msg(compiled_code: Any, code_block_id: int, exec_globals: dict) -> None:
+    _print = exec_globals["print"]
+    _print(
+        f"Env variable {RUN_CODE_BLOCKS_ENV_VAR}=0 - No code to display."
+        "\nUse `just docs-code` (or `just docs-full` for examples) to run"
+        " the code block and display output."
+    )
+
+truthy_values = {"yes", "on", "true", "1"}
+
+@mkdocs.plugins.event_priority(100)
+def on_startup(**kwargs: Any):
+    run_code_blocks = os.environ.get(RUN_CODE_BLOCKS_ENV_VAR, "true")
+    if run_code_blocks.lower() not in truthy_values:
+        logger.warning(
+            f"Disabling markdown-exec due to {RUN_CODE_BLOCKS_ENV_VAR}={run_code_blocks}"
+            "\n.Use `just docs-full` to run and render examples.",
+        )
+        from markdown_exec.formatters import python
+
+        setattr(python, "exec_python", _print_msg)

From 73da94520ec9c483c6fe5b69039f93dba19917e2 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 15:43:49 +0200
Subject: [PATCH 15/64] docs: add mkdocs config

---
 docs/stylesheets/custom.css |  32 ++++++
 mkdocs.yaml                 | 214 ++++++++++++++++++++++++++++++++++++
 2 files changed, 246 insertions(+)
 create mode 100644 docs/stylesheets/custom.css
 create mode 100644 mkdocs.yaml

diff --git a/docs/stylesheets/custom.css b/docs/stylesheets/custom.css
new file mode 100644
index 000000000..ae7ed5498
--- /dev/null
+++ b/docs/stylesheets/custom.css
@@ -0,0 +1,32 @@
+:root {
+  --md-primary-fg-color:        #00b2ff;
+  --md-accent-fg-color:        #FF00B2;
+}
+
+/* Change the color of the slider track */
+input[type="range"]::-webkit-slider-runnable-track {
+  background: #00b2ff; /* Change this to your desired color */
+}
+
+input[type="range"]::-moz-range-track {
+  background: #00b2ff; /* Change this to your desired color */
+}
+
+input[type="range"]::-ms-track {
+  background: #00b2ff; /* Change this to your desired color */
+  border-color: transparent;
+  color: transparent;
+}
+
+/* Change the color of the slider thumb */
+input[type="range"]::-webkit-slider-thumb {
+  background: #00b2ff; /* Change this to your desired color */
+}
+
+input[type="range"]::-moz-range-thumb {
+  background: #00b2ff; /* Change this to your desired color */
+}
+
+input[type="range"]::-ms-thumb {
+  background: #00b2ff; /* Change this to your desired color */
+}
\ No newline at end of file
diff --git a/mkdocs.yaml b/mkdocs.yaml
new file mode 100644
index 000000000..5f011098c
--- /dev/null
+++ b/mkdocs.yaml
@@ -0,0 +1,214 @@
+# This project uses mkdocs to generate the documentation.
+# Specifically it uses the mkdocs-material theme, which provides a whole
+# host of nice features and customization
+#
+# mkdocs: https://www.mkdocs.org/getting-started/#getting-started-with-mkdocs
+# mkdocs-material: https://squidfunk.github.io/mkdocs-material/
+#
+# Please refer to these links for more information on how to use mkdocs
+#
+# For serving the docs locally, you can take a look at the `justfile` at
+# the root of this repository, it contains a few commands for generating the docs
+# with different levels of execution.
+#
+# Please refer to individual sections for any additional notes
+site_name: "SMAC3"
+repo_url: https://github.com/automl/SMAC3/
+repo_name: automl/SMAC3
+
+theme:
+  name: material
+  logo: images/logo.png
+  favicon: images/logo.png
+  icon:
+    repo: fontawesome/brands/github
+  features:
+    - content.code.annotate
+    - content.code.copy
+    - navigation.footer
+    - navigation.sections
+    - toc.follow
+    - toc.integrate
+    - navigation.tabs
+    - navigation.tabs.sticky
+    - header.autohide
+    - search.suggest
+    - search.highlight
+    - search.share
+  font:
+    text: Roboto
+    code: Roboto Mono
+  palette:
+    - scheme: slate
+      media: "(prefers-color-scheme: dark)"
+      primary: custom
+      accent: custom
+      toggle:
+        icon: material/eye-outline
+        name: Switch to light mode
+
+    # Palette toggle for light mode
+    - scheme: default
+      media: "(prefers-color-scheme: light)"
+      primary: custom
+      accent: custom
+      toggle:
+        icon: material/eye
+        name: Switch to dark mode
+
+
+# The `mike` versioning provider
+# https://github.com/jimporter/mike
+#
+# This is what allows us to create versioned docs in the github cli
+extra:
+  version:
+    provider: mike
+  social:
+    - icon: fontawesome/brands/github
+      link: https://github.com/automl
+    - icon: fontawesome/brands/twitter
+      link: https://twitter.com/automl_org
+
+# We do have some extra custom css
+# If for whatever reason you think this is breaking something,
+# please feel free to remove it.
+extra_css:
+  - stylesheets/custom.css
+
+watch:
+  - smac
+  - docs
+  - examples
+  - CONTRIBUTING.md
+
+markdown_extensions:
+  - admonition
+  - tables
+  - attr_list
+  - md_in_html
+  - toc:
+      permalink: "#"
+  - pymdownx.highlight:
+      anchor_linenums: true
+  - pymdownx.magiclink:
+      hide_protocol: true
+      repo_url_shortener: true
+      repo_url_shorthand: true
+      user: automl
+      repo: SMAC3
+  - pymdownx.highlight
+  - pymdownx.inlinehilite
+  - pymdownx.snippets
+  - pymdownx.details
+  - pymdownx.tabbed:
+      alternate_style: true
+  - pymdownx.superfences:
+      custom_fences:
+      - name: mermaid
+        class: mermaid
+        format: !!python/name:pymdownx.superfences.fence_code_format
+  - pymdownx.emoji:
+      emoji_index: !!python/name:material.extensions.emoji.twemoji
+      emoji_generator: !!python/name:material.extensions.emoji.to_svg
+
+# These are files that are run when serving the docs.
+hooks:
+  # This prevents logging messages from polluting the doc build
+  - docs/hooks/cleanup_log_output.py
+  # This prevents markdown_exec (plugin) from executing code blocks
+  # dependant on environment variables. These env variables are
+  # automatically set with the `justfile` commands to build docs
+  - docs/hooks/disable_markdown_exec.py
+  # This hook simply prints the page being rendered for an easier time debugging
+  # any issues with code in docs
+  - docs/hooks/debug_which_page_is_being_rendered.py
+
+plugins:
+  - search
+  - autorefs
+  - glightbox
+  - offline:
+      enabled: !ENV [SMAC_DOCS_OFFLINE, false]
+  - markdown-exec
+  - mike:
+      version_selector: true
+      css_dir: css
+      javascript_dir: js
+      canonical_version: latest
+  - gen-files:
+      scripts:
+        - docs/api_generator.py
+        - docs/example_runner.py
+  - literate-nav:
+        nav_file: SUMMARY.md
+  # - mkdocs-glossary-plugin:
+  #     glossary_dirs: ["docs/glossary"] # This plugin considers the md files in "docs/glossary" as glossary files.
+  - mkdocstrings:
+      default_handler: python
+      enable_inventory: true
+      handlers:
+        python:
+          paths: [smac]
+          # Extra objects which allow for linking to external docs
+          import:
+            - 'https://docs.python.org/3/objects.inv'
+            - 'https://numpy.org/doc/stable/objects.inv'
+            - 'https://pandas.pydata.org/docs/objects.inv'
+            - 'https://optuna.readthedocs.io/en/stable/objects.inv'
+            - 'https://scikit-learn.org/stable/objects.inv'
+            - 'https://pytorch.org/docs/stable/objects.inv'
+            - 'https://jobqueue.dask.org/en/latest/objects.inv'
+          # Please do not try to change these without having
+          # looked at all of the documentation and seeing if it
+          # causes the API docs to look weird anywhere.
+          options:  # https://mkdocstrings.github.io/python/usage/
+            docstring_section_style: spacy
+            docstring_options:
+              ignore_init_summary: true
+              trim_doctest_flags: true
+              returns_multiple_items: false
+            show_docstring_attributes: true
+            show_docstring_description: true
+            show_root_heading: true
+            show_root_toc_entry: true
+            show_object_full_path: false
+            show_root_members_full_path: false
+            signature_crossrefs: true
+            merge_init_into_class: true
+            show_symbol_type_heading: true
+            show_symbol_type_toc: true
+            docstring_style: google
+            inherited_members: true
+            show_if_no_docstring: false
+            show_bases: true
+            show_source: true
+            members_order: "alphabetical"
+            group_by_category: true
+            show_signature: true
+            separate_signature: true
+            show_signature_annotations: true
+            filters:
+              - "!^_[^_]"
+
+nav:
+  - Home: "index.md"
+  - Installation: "installation.md"
+  # Auto generated with docs/examples_runner.py
+  - Examples: "examples/"
+  # Auto generated with docs/api_generator.py
+  - API: "api/"
+  # - Guides:
+  #   - "guides/first-steps.md"
+  #   - "guides/hydra.md"
+  #   - "guides/database.md"
+  #   - "guides/containers.md"
+  #   - "guides/large-scale-benchmarking.md"
+  #   - "guides/using-carps.md"
+  # - Commands: "commands.md"
+  # - Contributing:
+  #   - "contributing/index.md"
+  #   - "contributing/contributing-a-benchmark.md"
+  #   - "contributing/contributing-an-optimizer.md"
+  # - What's New?: "changelog.md"
+

From 1a97979bfd8ca28efcdb541cd2f6b59426847aa7 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Wed, 23 Oct 2024 16:32:29 +0200
Subject: [PATCH 16/64] docs(components): rst -> md

API links not yet working
---
 .../{1_components.rst => 1_components.md}     | 197 ++++++++----------
 1 file changed, 86 insertions(+), 111 deletions(-)
 rename docs/advanced_usage/{1_components.rst => 1_components.md} (72%)

diff --git a/docs/advanced_usage/1_components.rst b/docs/advanced_usage/1_components.md
similarity index 72%
rename from docs/advanced_usage/1_components.rst
rename to docs/advanced_usage/1_components.md
index 9b3235e7e..e23b0f40e 100644
--- a/docs/advanced_usage/1_components.rst
+++ b/docs/advanced_usage/1_components.md
@@ -1,5 +1,4 @@
-Components
-==========
+# Components
 
 In addition to the basic components mentioned in :ref:`Getting Started`, all other components are
 explained in the following paragraphs to give a better picture of SMAC. These components are all used to guide
@@ -15,8 +14,7 @@ occupied, SMAC will wait until a new worker is available again. Moreover, limita
 trials are checked in every iteration.
 
 
-:ref:`Surrogate Model<smac.model.abstract\\_model>`
----------------------------------------------------
+## [Surrogate Model][smac.facade.abstract_facade]
 
 The surrogate model is used to approximate the objective function of configurations. In previous versions, the model was 
 referred to as the Empirical Performance Model (EPM). Mostly, Bayesian optimization is used/associated with Gaussian
@@ -32,42 +30,38 @@ If you are using instances, it is recommended to use instance features. The mode
 associated with its features. Imagine you have two hyperparameters, two instances and no instance features, the model 
 would be trained on:
 
-.. csv-table::
-    :header: "HP 1", "HP 2", "Objective Value"
-
-    "0.1", "0.8", "0.5"
-    "0.1", "0.8", "0.75"
-    "505", "7", "2.4"
-    "505", "7", "1.3"
+| HP 1  | HP 2  | Objective Value |
+|-------|-------|-----------------|
+| 0.1   | 0.8   | 0.5             |
+| 0.1   | 0.8   | 0.75            |
+| 505   | 7     | 2.4             |
+| 505   | 7     | 1.3             |
 
 You can see that the same inputs lead to different objective values because of two instances. If you associate
 each instance with a feature, you would end-up with the following data points:
 
-.. csv-table::
-    :header: "HP 1", "HP 2", "Instance Feature", "Objective Value"
-
-    "0.1", "0.8", "0", "0.5"
-    "0.1", "0.8", "1", "0.75"
-    "505", "7", "0", "2.4"
-    "505", "7", "1", "1.3"
+| HP 1  | HP 2  | Instance Feature | Objective Value |
+|-------|-------|------------------|-----------------|
+| 0.1   | 0.8   | 0                | 0.5             |
+| 0.1   | 0.8   | 1                | 0.75            |
+| 505   | 7     | 0                | 2.4             |
+| 505   | 7     | 1                | 1.3             |
 
 
 The steps to receiving data are as follows:
 
-#. The intensifier requests new configurations via ``next(self.config_generator)``.
-#. The config selector collects the data via the runhistory encoder which iterates over the runhistory trials.
-#. The runhistory encoder only collects trials which are in ``considered_states`` and timeout trials. Also, only the
+* The intensifier requests new configurations via ``next(self.config_generator)``.
+* The config selector collects the data via the runhistory encoder which iterates over the runhistory trials.
+* The runhistory encoder only collects trials which are in ``considered_states`` and timeout trials. Also, only the
    highest budget is considered if budgets are used. In this step, multi-objective values are scalarized using the
    ``normalize_costs`` function (uses ``objective_bounds`` from the runhistory) and the multi-objective algorithm.
    For example, when ParEGO is used, the scalarization would be different in each training.
-#. The selected trial objectives are transformed (e.g., log-transformed, depending on the selected
+* The selected trial objectives are transformed (e.g., log-transformed, depending on the selected
    encoder).
-#. The hyperparameters might still have inactive values. The model takes care of that after the collected data
+* The hyperparameters might still have inactive values. The model takes care of that after the collected data
    are passed to the model.
 
-
-:ref:`Acquisition Function<smac.acquisition.function>`
-------------------------------------------------------
+## [Acquisition Function][smac.acquisition.function]
 
 Acquisition functions are mathematical techniques that guide how the parameter space should be explored during Bayesian 
 optimization. They use the predicted mean and predicted variance generated by the surrogate model. 
@@ -77,32 +71,28 @@ a bunch of different acquisition functions (Lower Confidence Bound, Expected Imp
 Thompson, integrated acquisition functions and prior acquisition functions). We refer to literature 
 for more information about acquisition functions.
 
-.. note ::
-
+!!! note
     The acquisition function calculates the acquisition value for each configuration. However, the configurations
     are provided by the acquisition maximizer. Therefore, the acquisition maximizer is responsible for receiving
     the next configurations.
 
 
-:ref:`Acquisition Maximizer<smac.acquisition.maximizer>`
--------------------------------------------------------
+## [Acquisition Maximize][smac.acquisition.maximizer]
 
 The acquisition maximizer is a wrapper for the acquisition function. It returns the next configurations. SMAC
 supports local search, (sorted) random search, local and (sorted) random search, and differential evolution.
 While local search checks neighbours of the best configurations, random search makes sure to explore the configuration
 space. When using sorted random search, random configurations are sorted by the value of the acquisition function.
 
-.. warning ::
-
+!!! warning
     Pay attention to the number of challengers: If you experience RAM issues or long computational times in the
     acquisition function, you might lower the number of challengers.
 
-The acquisition maximizer also incorporates the `Random Design`_. Please see the
-:ref:`ChallengerList<smac.acquisition.maximizer.helpers>` for more information.
+The acquisition maximizer also incorporates the [Random Design][#Random Design]. Please see the
+[ChallengerList][smac.acquisition.maximizer.helpers] for more information.
 
 
-:ref:`Initial Design<smac.initial\\_design>`
-------------------------------------------
+## [Initial Design][smac.initial_design]
 
 The surrogate model needs data to be trained. Therefore, the initial design is used to generate the initial data points.
 We provide random, latin hypercube, sobol, factorial and default initial designs. The default initial design uses
@@ -114,10 +104,8 @@ a multidimensional distribution.
 The initial design configurations are yielded by the config selector first. Moreover, the config selector keeps
 track of which configurations already have been returned to make sure a configuration is not returned twice.
 
-.. _Random Design:
 
-:ref:`Random Design<smac.random\\_design>`
-------------------------------------------
+## [Random Design][smac.random_design]
 
 The random design is used in the acquisition maximizer to tell whether the next configuration should be
 random or sampled from the acquisition function. For example, if we use a random design with a probability of 
@@ -129,15 +117,13 @@ are *guaranteed* to find the best configuration over time.
 In addition to simple probability random design, we also provide annealing and modulus random design.
 
 
-:ref:`Intensifier<smac.intensifier>`
-------------------------------------
+## [Intensifier][smac.intensifier]
 
 The intensifier compares different configurations based on evaluated :term:`trial<Trial>` so far. It decides
 which configuration should be `intensified` or, in other words, if a configuration is worth to spend more time on (e.g.,
 evaluate another seed pair, evaluate on another instance, or evaluate on a higher budget).
 
-.. warning ::
-
+!!! warning
     Always pay attention to ``max_config_calls`` or ``n_seeds``: If this argument is set high, the intensifier might 
     spend a lot of time on a single configuration.
 
@@ -157,8 +143,7 @@ All intensifiers support multi-objective, multi-fidelity, and multi-threading:
   repeated as often as needed. Intensifier are not required to receive results as the results are directly taken from
   the runhistory.
 
-.. note ::
-
+!!! note
     All intensifiers are working on the runhistory and recognize previous logged trials (e.g., if the user already
     evaluated something beforehand). Previous configurations (in the best case, also complete trials) are added to the 
     queue/tracker again so that they are integrated into the intensification process.
@@ -166,8 +151,7 @@ All intensifiers support multi-objective, multi-fidelity, and multi-threading:
     That means continuing a run as well as incorporating user inputs are natively supported.
 
 
-:ref:`Configuration Selector<smac.main.config\\_selector>`
-----------------------------------------------------------
+## [Configuration Selector][smac.main.config_selector]
 
 The configuration selector uses the initial design, surrogate model, acquisition maximizer/function, runhistory,
 runhistory encoder, and random design to select the next configuration. The configuration selector is directly
@@ -175,113 +159,104 @@ used by the intensifier and is called everytime a new configuration is requested
 
 The idea behind the configuration selector is straight forward:
 
-#. Yield the initial design configurations.
-#. Train the surrogate model with the data from the runhistory encoder.
-#. Get the next ``retrain_after`` configurations from the acquisition function/maximizer and yield them.
-#. After all ``retrain_after`` configurations were yield, go back to step 2.
-
-.. note ::
+* Yield the initial design configurations.
+* Train the surrogate model with the data from the runhistory encoder.
+* Get the next ``retrain_after`` configurations from the acquisition function/maximizer and yield them.
+* After all ``retrain_after`` configurations were yield, go back to step 2.
 
+!!! note
     The configuration selector is a generator and yields configurations. Therefore, the current state of the 
     selector is saved and when the intensifier calls ``next``, the selector continues there where it stopped.
 
-.. note ::
-
+!!! note
     Everytime the surrogate model is trained, the multi-objective algorithm is updated via 
     ``update_on_iteration_start``.
 
 
-:ref:`Multi-Objective Algorithm<smac.multi\\_objective>`
---------------------------------------------------------
+## [Multi-Objective Algorithm][smac.multi_objective]
 
 The multi-objective algorithm is used to scalarize multi-objective values. The multi-objective algorithm 
 gets normalized objective values passed and returns a single value. The resulting value (called by the 
 runhistory encoder) is then used to train the surrogate model.
 
-.. warning ::
-
+!!! warning
     Depending on the multi-objective algorithm, the values for the runhistory encoder might differ each time 
     the surrogate model is trained. Let's take ParEGO for example:
     Everytime a new configuration is sampled (see ConfigSelector), the objective weights are updated. Therefore,
     the scalarized values are different and the acquisition maximizer might return completely different configurations.
 
 
-:ref:`RunHistory<smac.runhistory.runhistory>`
----------------------------------------------
+## [RunHistory][smac.runhistory.runhistory]
 
 The runhistory holds all (un-)evaluated trials of the optimization run. You can use the runhistory to 
 get (running) configs, (running) trials, trials of a specific config, and more.
 The runhistory encoder iterates over the runhistory to receive data for the surrogate model. The following 
 code shows how to iterate over the runhistory:
 
-.. code-block:: python
-
-    smac = HPOFacade(...)
-
-    # Iterate over all trials
-    for trial_info, trial_value in smac.runhistory.items():
-        # Trial info
-        config = trial_info.config
-        instance = trial_info.instance
-        budget = trial_info.budget
-        seed = trial_info.seed
-
-        # Trial value
-        cost = trial_value.cost
-        time = trial_value.time
-        status = trial_value.status
-        starttime = trial_value.starttime
-        endtime = trial_value.endtime
-        additional_info = trial_value.additional_info
-
-    # Iterate over all configs
-    for config in smac.runhistory.get_configs():
-        # Get the cost of all trials of this config
-        average_cost = smac.runhistory.average_cost(config)
-
-.. warning ::
-
+```python
+smac = HPOFacade(...)
+
+# Iterate over all trials
+for trial_info, trial_value in smac.runhistory.items():
+    # Trial info
+    config = trial_info.config
+    instance = trial_info.instance
+    budget = trial_info.budget
+    seed = trial_info.seed
+
+    # Trial value
+    cost = trial_value.cost
+    time = trial_value.time
+    status = trial_value.status
+    starttime = trial_value.starttime
+    endtime = trial_value.endtime
+    additional_info = trial_value.additional_info
+
+# Iterate over all configs
+for config in smac.runhistory.get_configs():
+    # Get the cost of all trials of this config
+    average_cost = smac.runhistory.average_cost(config)
+```
+
+!!! warning
     The intensifier uses a callback to update the incumbent everytime a new trial is added to the runhistory.
 
 
-:ref:`RunHistory Encoder<smac.runhistory.encoder>`
---------------------------------------------------
-
+## [RunHistory Encoder][smac.runhistory.encoder]
 The runhistory encoder is used to encode the runhistory data into a format that can be used by the surrogate model.
 Only trials with the status ``considered_states`` and timeout trials are considered. Multi-objective values are 
 scalarized using the ``normalize_costs`` function (uses ``objective_bounds`` from the runhistory). Afterwards, the 
 normalized value is processed by the multi-objective algorithm. 
 
 
-:ref:`Callback<smac.callback>`
-------------------------------
+## [Callback][smac.callback]
 
 Callbacks provide the ability to easily execute code before, inside, and after the Bayesian optimization loop.
 To add a callback, you have to inherit from ``smac.Callback`` and overwrite the methods (if needed).
 Afterwards, you can pass the callbacks to any facade. 
 
-.. code-block:: python
-
-    from smac import MultiFidelityFacade, Callback
+```python
+from smac import MultiFidelityFacade, Callback
 
 
-    class CustomCallback(Callback):
-        def on_start(self, smbo: SMBO) -> None:
-            pass
+class CustomCallback(Callback):
+    def on_start(self, smbo: SMBO) -> None:
+        pass
 
-        def on_end(self, smbo: SMBO) -> None:
-            pass
+    def on_end(self, smbo: SMBO) -> None:
+        pass
 
-        def on_iteration_start(self, smbo: SMBO) -> None:
-            pass
+    def on_iteration_start(self, smbo: SMBO) -> None:
+        pass
 
-        def on_iteration_end(self, smbo: SMBO, info: RunInfo, value: RunValue) -> bool | None:
-            # We just do a simple printing here
-            print(info, value)
+    def on_iteration_end(self, smbo: SMBO, info: RunInfo, value: RunValue) -> bool | None:
+        # We just do a simple printing here
+        print(info, value)
 
 
-    smac = MultiFidelityFacade(
-        ...
-        callbacks=[CustomCallback()]
-    )
-    smac.optimize()
\ No newline at end of file
+smac = MultiFidelityFacade(
+    ...
+    callbacks=[CustomCallback()]
+)
+smac.optimize()
+```
\ No newline at end of file

From c0eccb6c384565ba6a876fe15d509fa2086464cd Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:21:29 +0200
Subject: [PATCH 17/64] refactor(acquisition_maximizer): fix typo

---
 smac/acquisition/maximizer/__init__.py                        | 2 +-
 ...usition_maximizer.py => abstract_acquisition_maximizer.py} | 0
 smac/acquisition/maximizer/local_and_random_search.py         | 4 ++--
 smac/acquisition/maximizer/local_search.py                    | 2 +-
 smac/acquisition/maximizer/random_search.py                   | 2 +-
 5 files changed, 5 insertions(+), 5 deletions(-)
 rename smac/acquisition/maximizer/{abstract_acqusition_maximizer.py => abstract_acquisition_maximizer.py} (100%)

diff --git a/smac/acquisition/maximizer/__init__.py b/smac/acquisition/maximizer/__init__.py
index 5e3756190..b97bd2bbb 100644
--- a/smac/acquisition/maximizer/__init__.py
+++ b/smac/acquisition/maximizer/__init__.py
@@ -1,4 +1,4 @@
-from smac.acquisition.maximizer.abstract_acqusition_maximizer import (
+from smac.acquisition.maximizer.abstract_acquisition_maximizer import (
     AbstractAcquisitionMaximizer,
 )
 from smac.acquisition.maximizer.differential_evolution import DifferentialEvolution
diff --git a/smac/acquisition/maximizer/abstract_acqusition_maximizer.py b/smac/acquisition/maximizer/abstract_acquisition_maximizer.py
similarity index 100%
rename from smac/acquisition/maximizer/abstract_acqusition_maximizer.py
rename to smac/acquisition/maximizer/abstract_acquisition_maximizer.py
diff --git a/smac/acquisition/maximizer/local_and_random_search.py b/smac/acquisition/maximizer/local_and_random_search.py
index 71c7f86c4..a409996d3 100644
--- a/smac/acquisition/maximizer/local_and_random_search.py
+++ b/smac/acquisition/maximizer/local_and_random_search.py
@@ -5,7 +5,7 @@
 from ConfigSpace import Configuration, ConfigurationSpace
 
 from smac.acquisition.function import AbstractAcquisitionFunction
-from smac.acquisition.maximizer.abstract_acqusition_maximizer import (
+from smac.acquisition.maximizer.abstract_acquisition_maximizer import (
     AbstractAcquisitionMaximizer,
 )
 from smac.acquisition.maximizer.local_search import LocalSearch
@@ -184,6 +184,6 @@ def _maximize(
         next_configs_by_acq_value.sort(reverse=True, key=lambda x: x[0])
         first_five = [f"{_[0]} ({_[1].origin})" for _ in next_configs_by_acq_value[:5]]
 
-        logger.debug(f"First 5 acquisition function values of selected configurations:\n{', '.join(first_five)}")
+        logger.debug(f"First 5 acquisition function values of selected configurations: \n{', '.join(first_five)}")
 
         return next_configs_by_acq_value
diff --git a/smac/acquisition/maximizer/local_search.py b/smac/acquisition/maximizer/local_search.py
index ff0d4b373..bc26d1131 100644
--- a/smac/acquisition/maximizer/local_search.py
+++ b/smac/acquisition/maximizer/local_search.py
@@ -10,7 +10,7 @@
 from ConfigSpace.exceptions import ForbiddenValueError
 
 from smac.acquisition.function import AbstractAcquisitionFunction
-from smac.acquisition.maximizer.abstract_acqusition_maximizer import (
+from smac.acquisition.maximizer.abstract_acquisition_maximizer import (
     AbstractAcquisitionMaximizer,
 )
 from smac.utils.configspace import (
diff --git a/smac/acquisition/maximizer/random_search.py b/smac/acquisition/maximizer/random_search.py
index c5f87fda0..35115a9e4 100644
--- a/smac/acquisition/maximizer/random_search.py
+++ b/smac/acquisition/maximizer/random_search.py
@@ -2,7 +2,7 @@
 
 from ConfigSpace import Configuration
 
-from smac.acquisition.maximizer.abstract_acqusition_maximizer import (
+from smac.acquisition.maximizer.abstract_acquisition_maximizer import (
     AbstractAcquisitionMaximizer,
 )
 from smac.utils.logging import get_logger

From 6cbcaec1d4aed3e5ce70b26b0a31927f0b1c2d98 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:24:31 +0200
Subject: [PATCH 18/64] docs(component): fix links

---
 docs/advanced_usage/1_components.md | 28 ++++++++++++++--------------
 1 file changed, 14 insertions(+), 14 deletions(-)

diff --git a/docs/advanced_usage/1_components.md b/docs/advanced_usage/1_components.md
index e23b0f40e..9cec83490 100644
--- a/docs/advanced_usage/1_components.md
+++ b/docs/advanced_usage/1_components.md
@@ -14,7 +14,7 @@ occupied, SMAC will wait until a new worker is available again. Moreover, limita
 trials are checked in every iteration.
 
 
-## [Surrogate Model][smac.facade.abstract_facade]
+## [Surrogate Model](../../api/smac/facade/abstract_facade)
 
 The surrogate model is used to approximate the objective function of configurations. In previous versions, the model was 
 referred to as the Empirical Performance Model (EPM). Mostly, Bayesian optimization is used/associated with Gaussian
@@ -61,7 +61,7 @@ The steps to receiving data are as follows:
 * The hyperparameters might still have inactive values. The model takes care of that after the collected data
    are passed to the model.
 
-## [Acquisition Function][smac.acquisition.function]
+## [Acquisition Function](../../api/smac/acquisition/function/abstract_acquisition_function)
 
 Acquisition functions are mathematical techniques that guide how the parameter space should be explored during Bayesian 
 optimization. They use the predicted mean and predicted variance generated by the surrogate model. 
@@ -77,7 +77,7 @@ for more information about acquisition functions.
     the next configurations.
 
 
-## [Acquisition Maximize][smac.acquisition.maximizer]
+## [Acquisition Maximize](../../api/smac/acquisition/maximizer/abstract_acquisition_maximizer)
 
 The acquisition maximizer is a wrapper for the acquisition function. It returns the next configurations. SMAC
 supports local search, (sorted) random search, local and (sorted) random search, and differential evolution.
@@ -88,11 +88,11 @@ space. When using sorted random search, random configurations are sorted by the
     Pay attention to the number of challengers: If you experience RAM issues or long computational times in the
     acquisition function, you might lower the number of challengers.
 
-The acquisition maximizer also incorporates the [Random Design][#Random Design]. Please see the
-[ChallengerList][smac.acquisition.maximizer.helpers] for more information.
+The acquisition maximizer also incorporates the [Random Design][random-design]. Please see the
+[ChallengerList](../../api/smac/acquisition/maximizer/helpers) for more information.
 
 
-## [Initial Design][smac.initial_design]
+## [Initial Design](../../api/smac/initial_design/abstract_initial_design)
 
 The surrogate model needs data to be trained. Therefore, the initial design is used to generate the initial data points.
 We provide random, latin hypercube, sobol, factorial and default initial designs. The default initial design uses
@@ -104,8 +104,8 @@ a multidimensional distribution.
 The initial design configurations are yielded by the config selector first. Moreover, the config selector keeps
 track of which configurations already have been returned to make sure a configuration is not returned twice.
 
-
-## [Random Design][smac.random_design]
+[](){#random-design}
+## [Random Design](../../api/smac/initial_design/random_design)
 
 The random design is used in the acquisition maximizer to tell whether the next configuration should be
 random or sampled from the acquisition function. For example, if we use a random design with a probability of 
@@ -117,7 +117,7 @@ are *guaranteed* to find the best configuration over time.
 In addition to simple probability random design, we also provide annealing and modulus random design.
 
 
-## [Intensifier][smac.intensifier]
+## [Intensifier](../../api/smac/intensifier/abstract_intensifier)
 
 The intensifier compares different configurations based on evaluated :term:`trial<Trial>` so far. It decides
 which configuration should be `intensified` or, in other words, if a configuration is worth to spend more time on (e.g.,
@@ -151,7 +151,7 @@ All intensifiers support multi-objective, multi-fidelity, and multi-threading:
     That means continuing a run as well as incorporating user inputs are natively supported.
 
 
-## [Configuration Selector][smac.main.config_selector]
+## [Configuration Selector](../../api/smac/main/config_selector)
 
 The configuration selector uses the initial design, surrogate model, acquisition maximizer/function, runhistory,
 runhistory encoder, and random design to select the next configuration. The configuration selector is directly
@@ -173,7 +173,7 @@ The idea behind the configuration selector is straight forward:
     ``update_on_iteration_start``.
 
 
-## [Multi-Objective Algorithm][smac.multi_objective]
+## [Multi-Objective Algorithm](../../api/smac/multi_objective/abstract_multi_objective_algorithm)
 
 The multi-objective algorithm is used to scalarize multi-objective values. The multi-objective algorithm 
 gets normalized objective values passed and returns a single value. The resulting value (called by the 
@@ -186,7 +186,7 @@ runhistory encoder) is then used to train the surrogate model.
     the scalarized values are different and the acquisition maximizer might return completely different configurations.
 
 
-## [RunHistory][smac.runhistory.runhistory]
+## [RunHistory](../../api/smac/runhistory/runhistory)
 
 The runhistory holds all (un-)evaluated trials of the optimization run. You can use the runhistory to 
 get (running) configs, (running) trials, trials of a specific config, and more.
@@ -222,14 +222,14 @@ for config in smac.runhistory.get_configs():
     The intensifier uses a callback to update the incumbent everytime a new trial is added to the runhistory.
 
 
-## [RunHistory Encoder][smac.runhistory.encoder]
+## [RunHistory Encoder](../../api/smac/runhistory/encoder/abstract_encoder)
 The runhistory encoder is used to encode the runhistory data into a format that can be used by the surrogate model.
 Only trials with the status ``considered_states`` and timeout trials are considered. Multi-objective values are 
 scalarized using the ``normalize_costs`` function (uses ``objective_bounds`` from the runhistory). Afterwards, the 
 normalized value is processed by the multi-objective algorithm. 
 
 
-## [Callback][smac.callback]
+## [Callback](../../api/smac/callback)
 
 Callbacks provide the ability to easily execute code before, inside, and after the Bayesian optimization loop.
 To add a callback, you have to inherit from ``smac.Callback`` and overwrite the methods (if needed).

From 9b963ddd14d55ab10e282786ff38cb534d8d9825 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:45:45 +0200
Subject: [PATCH 19/64] docs(2_multi_fidelity): rst -> md

---
 .../{2_multi_fidelity.rst => 2_multi_fidelity.md}     | 11 +++++------
 1 file changed, 5 insertions(+), 6 deletions(-)
 rename docs/advanced_usage/{2_multi_fidelity.rst => 2_multi_fidelity.md} (66%)

diff --git a/docs/advanced_usage/2_multi_fidelity.rst b/docs/advanced_usage/2_multi_fidelity.md
similarity index 66%
rename from docs/advanced_usage/2_multi_fidelity.rst
rename to docs/advanced_usage/2_multi_fidelity.md
index 4bcf11c64..d3c62c3ab 100644
--- a/docs/advanced_usage/2_multi_fidelity.rst
+++ b/docs/advanced_usage/2_multi_fidelity.md
@@ -1,11 +1,10 @@
-Multi-Fidelity Optimization
-===========================
+# Multi-Fidelity Optimization
 
 Multi-fidelity refers to running an algorithm on multiple budgets (such as number of epochs or
 subsets of data) and thereby evaluating the performance prematurely. You can run a multi-fidelity optimization
-when using :ref:`Successive Halving<smac.intensifier.successive\\_halving>` or 
-:ref:`Hyperband<smac.intensifier.hyperband>`. `Hyperband` is the default intensifier in the 
-:ref:`multi-fidelity facade<smac.facade.multi\\_fidelity\\_facade>` and requires the arguments 
+when using [Successive Halving][smac.intensifier.successive_halving] or 
+[Hyperband][smac.intensifier.hyperband]. `Hyperband` is the default intensifier in the 
+[multi-fidelity facade][smac.facade.multi_fidelity_facade] and requires the arguments 
 ``min_budget`` and ``max_budget`` in the scenario if no instances are used.
 
 In general, multi-fidelity works for both real-valued and instance budgets. In the real-valued case,
@@ -14,5 +13,5 @@ target function but ``min_budget`` and ``max_budget`` are used internally to det
 each stage. That's also the reason why ``min_budget`` and ``max_budget`` are *not required* when using instances: 
 The ``max_budget`` is simply the max number of instances, whereas the ``min_budget`` is simply 1.
 
-Please have a look into our :ref:`multi-fidelity examples<Multi-Fidelity and Multi-Instances>` to see how to use
+Please have a look into our [multi-fidelity examples](Multi-Fidelity and Multi-Instances) to see how to use
 multi-fidelity optimization in real-world applications.
\ No newline at end of file

From 42a625953838a752d9667a9fbb8047ff1bc6736d Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:49:19 +0200
Subject: [PATCH 20/64] docs(1_components): fix links

---
 docs/advanced_usage/1_components.md | 24 ++++++++++++------------
 1 file changed, 12 insertions(+), 12 deletions(-)

diff --git a/docs/advanced_usage/1_components.md b/docs/advanced_usage/1_components.md
index 9cec83490..1a07c02eb 100644
--- a/docs/advanced_usage/1_components.md
+++ b/docs/advanced_usage/1_components.md
@@ -14,7 +14,7 @@ occupied, SMAC will wait until a new worker is available again. Moreover, limita
 trials are checked in every iteration.
 
 
-## [Surrogate Model](../../api/smac/facade/abstract_facade)
+## [Surrogate Model][smac.facade.abstract_facade]
 
 The surrogate model is used to approximate the objective function of configurations. In previous versions, the model was 
 referred to as the Empirical Performance Model (EPM). Mostly, Bayesian optimization is used/associated with Gaussian
@@ -61,7 +61,7 @@ The steps to receiving data are as follows:
 * The hyperparameters might still have inactive values. The model takes care of that after the collected data
    are passed to the model.
 
-## [Acquisition Function](../../api/smac/acquisition/function/abstract_acquisition_function)
+## [Acquisition Function][smac.acquisition.function.abstract_acquisition_function]
 
 Acquisition functions are mathematical techniques that guide how the parameter space should be explored during Bayesian 
 optimization. They use the predicted mean and predicted variance generated by the surrogate model. 
@@ -77,7 +77,7 @@ for more information about acquisition functions.
     the next configurations.
 
 
-## [Acquisition Maximize](../../api/smac/acquisition/maximizer/abstract_acquisition_maximizer)
+## [Acquisition Maximize][smac.acquisition.maximizer.abstract_acquisition_maximizer]
 
 The acquisition maximizer is a wrapper for the acquisition function. It returns the next configurations. SMAC
 supports local search, (sorted) random search, local and (sorted) random search, and differential evolution.
@@ -89,10 +89,10 @@ space. When using sorted random search, random configurations are sorted by the
     acquisition function, you might lower the number of challengers.
 
 The acquisition maximizer also incorporates the [Random Design][random-design]. Please see the
-[ChallengerList](../../api/smac/acquisition/maximizer/helpers) for more information.
+[ChallengerList][smac.acquisition.maximizer.helpers] for more information.
 
 
-## [Initial Design](../../api/smac/initial_design/abstract_initial_design)
+## [Initial Design][smac.initial_design.abstract_initial_design]
 
 The surrogate model needs data to be trained. Therefore, the initial design is used to generate the initial data points.
 We provide random, latin hypercube, sobol, factorial and default initial designs. The default initial design uses
@@ -105,7 +105,7 @@ The initial design configurations are yielded by the config selector first. More
 track of which configurations already have been returned to make sure a configuration is not returned twice.
 
 [](){#random-design}
-## [Random Design](../../api/smac/initial_design/random_design)
+## [Random Design][smac.initial_design.random_design]
 
 The random design is used in the acquisition maximizer to tell whether the next configuration should be
 random or sampled from the acquisition function. For example, if we use a random design with a probability of 
@@ -117,7 +117,7 @@ are *guaranteed* to find the best configuration over time.
 In addition to simple probability random design, we also provide annealing and modulus random design.
 
 
-## [Intensifier](../../api/smac/intensifier/abstract_intensifier)
+## [Intensifier][smac.intensifier.abstract_intensifier]
 
 The intensifier compares different configurations based on evaluated :term:`trial<Trial>` so far. It decides
 which configuration should be `intensified` or, in other words, if a configuration is worth to spend more time on (e.g.,
@@ -151,7 +151,7 @@ All intensifiers support multi-objective, multi-fidelity, and multi-threading:
     That means continuing a run as well as incorporating user inputs are natively supported.
 
 
-## [Configuration Selector](../../api/smac/main/config_selector)
+## [Configuration Selector][smac.main.config_selector]
 
 The configuration selector uses the initial design, surrogate model, acquisition maximizer/function, runhistory,
 runhistory encoder, and random design to select the next configuration. The configuration selector is directly
@@ -173,7 +173,7 @@ The idea behind the configuration selector is straight forward:
     ``update_on_iteration_start``.
 
 
-## [Multi-Objective Algorithm](../../api/smac/multi_objective/abstract_multi_objective_algorithm)
+## [Multi-Objective Algorithm][smac.multi_objective.abstract_multi_objective_algorithm]
 
 The multi-objective algorithm is used to scalarize multi-objective values. The multi-objective algorithm 
 gets normalized objective values passed and returns a single value. The resulting value (called by the 
@@ -186,7 +186,7 @@ runhistory encoder) is then used to train the surrogate model.
     the scalarized values are different and the acquisition maximizer might return completely different configurations.
 
 
-## [RunHistory](../../api/smac/runhistory/runhistory)
+## [RunHistory][smac.runhistory.runhistory]
 
 The runhistory holds all (un-)evaluated trials of the optimization run. You can use the runhistory to 
 get (running) configs, (running) trials, trials of a specific config, and more.
@@ -222,14 +222,14 @@ for config in smac.runhistory.get_configs():
     The intensifier uses a callback to update the incumbent everytime a new trial is added to the runhistory.
 
 
-## [RunHistory Encoder](../../api/smac/runhistory/encoder/abstract_encoder)
+## [RunHistory Encoder][smac.runhistory.encoder.abstract_encoder]
 The runhistory encoder is used to encode the runhistory data into a format that can be used by the surrogate model.
 Only trials with the status ``considered_states`` and timeout trials are considered. Multi-objective values are 
 scalarized using the ``normalize_costs`` function (uses ``objective_bounds`` from the runhistory). Afterwards, the 
 normalized value is processed by the multi-objective algorithm. 
 
 
-## [Callback](../../api/smac/callback)
+## [Callback][smac.callback.callback]
 
 Callbacks provide the ability to easily execute code before, inside, and after the Bayesian optimization loop.
 To add a callback, you have to inherit from ``smac.Callback`` and overwrite the methods (if needed).

From d2f89fdb88ce9f5e90705995bf1e5c7905faed66 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:49:53 +0200
Subject: [PATCH 21/64] docs(api_generator): fix link generation

---
 docs/api_generator.py | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/docs/api_generator.py b/docs/api_generator.py
index c6db813ec..f36290f89 100644
--- a/docs/api_generator.py
+++ b/docs/api_generator.py
@@ -23,7 +23,7 @@
 for path in sorted(Path(source_path).rglob("*.py")):
     module_path = path.relative_to(source_path).with_suffix("")
     doc_path = path.relative_to(source_path).with_suffix(".md")
-    full_doc_path = Path("api", doc_path)
+    full_doc_path = Path("api/smac", doc_path)
 
     parts = tuple(module_path.parts)
 
@@ -34,6 +34,8 @@
         continue
 
     with mkdocs_gen_files.open(full_doc_path, "w") as fd:
+        if parts[0] != source_path:
+            parts = (source_path,) + parts
         ident = ".".join(parts)
         fd.write(f"::: {ident}")
 

From 7815b3f35f7cea2c41ee20ead611274e9ec868e3 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:56:39 +0200
Subject: [PATCH 22/64] docs(3_multi_objective): rst -> md

---
 ...lti_objective.rst => 3_multi_objective.md} | 29 +++++++++----------
 1 file changed, 14 insertions(+), 15 deletions(-)
 rename docs/advanced_usage/{3_multi_objective.rst => 3_multi_objective.md} (61%)

diff --git a/docs/advanced_usage/3_multi_objective.rst b/docs/advanced_usage/3_multi_objective.md
similarity index 61%
rename from docs/advanced_usage/3_multi_objective.rst
rename to docs/advanced_usage/3_multi_objective.md
index 4fecde06d..d6fc5e01a 100644
--- a/docs/advanced_usage/3_multi_objective.rst
+++ b/docs/advanced_usage/3_multi_objective.md
@@ -1,9 +1,8 @@
-Multi-Objective Optimization
-============================
+# Multi-Objective Optimization
 
 Often we do not only want to optimize just a single objective, but multiple instead. SMAC offers a multi-objective 
 optimization interface to do exactly that. Right now, the algorithm used for this is a mean aggregation strategy or 
-ParEGO [Know06]_. In both cases, multiple objectives are aggregated into a single scalar objective, which is then 
+ParEGO [[Know06][Know06]]. In both cases, multiple objectives are aggregated into a single scalar objective, which is then 
 optimized by SMAC. However, the run history still keeps the original objectives.
 
 
@@ -18,24 +17,24 @@ The basic recipe is as follows:
   multi-objective algorithm default.
 
 
-.. warning ::
+!!! warning
 
-   The multi-objective algorithm influences which configurations are sampled next. More specifically, 
-   since only one surrogate model is trained, multiple objectives have to be scalarized into a single objective.
-   This scalarized value is used to train the surrogate model, which is used by the acquisition function/maximizer
-   to sample the next configurations.  
+    The multi-objective algorithm influences which configurations are sampled next. More specifically, 
+    since only one surrogate model is trained, multiple objectives have to be scalarized into a single objective.
+    This scalarized value is used to train the surrogate model, which is used by the acquisition function/maximizer
+    to sample the next configurations.  
 
 
 You receive the incumbents (points on the Pareto front) after the optimization process directly. Alternatively, you can 
 use the method ``get_incumbents`` in the intensifier.
 
-.. code-block:: python
+```python
 
-   smac = ...
-   incumbents = smac.optimize()
+  smac = ...
+  incumbents = smac.optimize()
 
-   # Or you use the intensifier
-   incumbents = smac.intensifier.get_incumbents()
+  # Or you use the intensifier
+  incumbents = smac.intensifier.get_incumbents()
+```
 
-
-We show an example of how to use multi-objective with plots in our :ref:`examples<Multi-Objective>`.
+We show an example of how to use multi-objective with plots in our [here][examples<Multi-Objective].

From 987e6bcd37cea6b8a0d9f342282a653f6b458e67 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 10:58:18 +0200
Subject: [PATCH 23/64] docs(4_instances): rst -> md

---
 .../{4_instances.rst => 4_instances.md}       | 51 +++++++++----------
 1 file changed, 25 insertions(+), 26 deletions(-)
 rename docs/advanced_usage/{4_instances.rst => 4_instances.md} (55%)

diff --git a/docs/advanced_usage/4_instances.rst b/docs/advanced_usage/4_instances.md
similarity index 55%
rename from docs/advanced_usage/4_instances.rst
rename to docs/advanced_usage/4_instances.md
index b9d185b1a..4c1156ff2 100644
--- a/docs/advanced_usage/4_instances.rst
+++ b/docs/advanced_usage/4_instances.md
@@ -1,5 +1,4 @@
-Optimization across Instances
-=============================
+# Optimization across Instances
 
 Often you want to optimize the cost across different datasets, subsets, or even different
 augmentations. For this purpose, you can use instances.
@@ -7,33 +6,33 @@ augmentations. For this purpose, you can use instances.
 To work with instances, you need to add your pre-defined instance names to the scenario object.
 In the following example, we want to use five different subsets, identified by its id:
 
-.. code-block:: python
-
-    instances = ["d0", "d1", "d2", "d3", "d4"]
-    scenario = Scenario(
-      ...
-      "instances": instances,
-      ...
-    )
+```python
+instances = ["d0", "d1", "d2", "d3", "d4"]
+scenario = Scenario(
+  ...
+  "instances": instances,
+  ...
+)
+```
 
 
 Additionally to the instances, there is the option to define ``instance_features``. Those instance features are
 used to expand the internal X matrix and thus play a role in training the underlying surrogate model. For example, if I 
 want to add the number of samples and the mean of each subset, I can do as follows:
 
-.. code-block:: bash
-
-    instance_features = {
-      "d0": [121, 0.6],
-      "d1": [140, 0.65],
-      "d2": [99, 0.45],
-      "d3": [102, 0.59],
-      "d4": [132, 0.48],
-    }
-
-    scenario = Scenario(
-      ...
-      "instances": instances,
-      "instance_features": instance_features
-      ...
-    )
+```python
+  instance_features = {
+    "d0": [121, 0.6],
+    "d1": [140, 0.65],
+    "d2": [99, 0.45],
+    "d3": [102, 0.59],
+    "d4": [132, 0.48],
+  }
+
+  scenario = Scenario(
+    ...
+    instances=instances,
+    instance_features=instance_features,
+    ...
+  )
+```

From 8c2a6ad7b549c689e4cd7c8ea01bb32dce43d47d Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:00:43 +0200
Subject: [PATCH 24/64] docs(5.1_warmstarting): rst -> md

---
 docs/advanced_usage/5.1_warmstarting.md  | 117 ++++++++++++++++++++++
 docs/advanced_usage/5.1_warmstarting.rst | 120 -----------------------
 2 files changed, 117 insertions(+), 120 deletions(-)
 create mode 100644 docs/advanced_usage/5.1_warmstarting.md
 delete mode 100644 docs/advanced_usage/5.1_warmstarting.rst

diff --git a/docs/advanced_usage/5.1_warmstarting.md b/docs/advanced_usage/5.1_warmstarting.md
new file mode 100644
index 000000000..0b69abcde
--- /dev/null
+++ b/docs/advanced_usage/5.1_warmstarting.md
@@ -0,0 +1,117 @@
+# Warmstarting SMAC
+
+With the ask and tell interface, we can support warmstarting SMAC. We can communicate rich
+information about the previous trials to SMAC using `TrialInfo` and `TrialValue` instances.
+
+We can communicate using the following objects:
+
+```python
+class TrialValue:
+    """Values of a trial.
+
+    Parameters
+    ----------
+    cost : float | list[float]
+    time : float, defaults to 0.0
+    status : StatusType, defaults to StatusType.SUCCESS
+    starttime : float, defaults to 0.0
+    endtime : float, defaults to 0.0
+    additional_info : dict[str, Any], defaults to {}
+    """
+
+class TrialInfo:
+    """Information about a trial.
+
+    Parameters
+    ----------
+    config : Configuration
+    instance : str | None, defaults to None
+    seed : int | None, defaults to None
+    budget : float | None, defaults to None
+    """
+```
+
+## Usage Example
+See [`examples/1_basics/8_warmstart.py`](../../examples/1_basics/8_warmstart.py).
+
+
+```python
+from __future__ import annotations
+
+from smac.scenario import Scenario
+from smac.facade import HyperparameterOptimizationFacade
+from ConfigSpace import Configuration, ConfigurationSpace, Float
+from smac.runhistory.dataclasses import TrialValue, TrialInfo
+
+
+class Rosenbrock2D:
+    @property
+    def configspace(self) -> ConfigurationSpace:
+        cs = ConfigurationSpace(seed=0)
+        x0 = Float("x0", (-5, 10), default=-3)
+        x1 = Float("x1", (-5, 10), default=-4)
+        cs.add([x0, x1])
+
+        return cs
+
+    def evaluate(self, config: Configuration, seed: int = 0) -> float:
+        """The 2-dimensional Rosenbrock function as a toy model.
+        The Rosenbrock function is well know in the optimization community and
+        often serves as a toy problem. It can be defined for arbitrary
+        dimensions. The minimium is always at x_i = 1 with a function value of
+        zero. All input parameters are continuous. The search domain for
+        all x's is the interval [-5, 10].
+        """
+        x1 = config["x0"]
+        x2 = config["x1"]
+
+        cost = 100.0 * (x2 - x1**2.0) ** 2.0 + (1 - x1) ** 2.0
+        return cost
+
+
+if __name__ == "__main__":
+    SEED = 12345
+    task = Rosenbrock2D()
+
+    # Previous evaluations
+    # X vectors need to be connected to the configuration space
+    configurations = [
+        Configuration(task.configspace, {'x0':1, 'x1':2}),
+        Configuration(task.configspace, {'x0':-1, 'x1':3}),
+        Configuration(task.configspace, {'x0':5, 'x1':5}),
+    ]
+    costs = [task.evaluate(c, seed=SEED) for c in configurations]
+
+    # Define optimization problem and budget
+    scenario = Scenario(task.configspace, deterministic=False, n_trials=30)
+    intensifier = HyperparameterOptimizationFacade.get_intensifier(scenario, max_config_calls=1)
+    smac = HyperparameterOptimizationFacade(
+        scenario,
+        task.evaluate,
+        intensifier=intensifier,
+        overwrite=True,
+
+        # Modify the initial design to use our custom initial design
+        initial_design=HyperparameterOptimizationFacade.get_initial_design(
+            scenario, 
+            n_configs=0,  # Do not use the default initial design
+            additional_configs=configurations  # Use the configurations previously evaluated as initial design
+                                            # This only passes the configurations but not the cost!
+                                            # So in order to actually use the custom, pre-evaluated initial design
+                                            # we need to tell those trials, like below.
+        )
+    )
+
+    # Convert previously evaluated configurations into TrialInfo and TrialValue instances to pass to SMAC
+    trial_infos = [TrialInfo(config=c, seed=SEED) for c in configurations]
+    trial_values = [TrialValue(cost=c) for c in costs]
+
+    # Warmstart SMAC with the trial information and values
+    for info, value in zip(trial_infos, trial_values):
+        smac.tell(info, value)
+
+    # Optimize as usual
+    smac.optimize()
+```
+
+For more details on ask and tell consult [`advanced_usage/5_ask_and_tell`](../advanced_usage/5_ask_and_tell.md).
diff --git a/docs/advanced_usage/5.1_warmstarting.rst b/docs/advanced_usage/5.1_warmstarting.rst
deleted file mode 100644
index fb255681e..000000000
--- a/docs/advanced_usage/5.1_warmstarting.rst
+++ /dev/null
@@ -1,120 +0,0 @@
-Warmstarting SMAC
-======================================
-
-With the ask and tell interface, we can support warmstarting SMAC. We can communicate rich
-information about the previous trials to SMAC using `TrialInfo` and `TrialValue` instances.
-
-We can communicate using the following objects:
-
-.. code-block:: python
-
-    class TrialValue:
-        """Values of a trial.
-
-        Parameters
-        ----------
-        cost : float | list[float]
-        time : float, defaults to 0.0
-        status : StatusType, defaults to StatusType.SUCCESS
-        starttime : float, defaults to 0.0
-        endtime : float, defaults to 0.0
-        additional_info : dict[str, Any], defaults to {}
-        """
-
-    class TrialInfo:
-        """Information about a trial.
-
-        Parameters
-        ----------
-        config : Configuration
-        instance : str | None, defaults to None
-        seed : int | None, defaults to None
-        budget : float | None, defaults to None
-        """
-
-
-Usage Example
-~~~~~~~~~~~~~
-See `examples/1_basics/8_warmstart.py`.
-
-
-.. code-block:: python
-
-    from __future__ import annotations
-
-    from smac.scenario import Scenario
-    from smac.facade import HyperparameterOptimizationFacade
-    from ConfigSpace import Configuration, ConfigurationSpace, Float
-    from smac.runhistory.dataclasses import TrialValue, TrialInfo
-
-
-    class Rosenbrock2D:
-        @property
-        def configspace(self) -> ConfigurationSpace:
-            cs = ConfigurationSpace(seed=0)
-            x0 = Float("x0", (-5, 10), default=-3)
-            x1 = Float("x1", (-5, 10), default=-4)
-            cs.add([x0, x1])
-
-            return cs
-
-        def evaluate(self, config: Configuration, seed: int = 0) -> float:
-            """The 2-dimensional Rosenbrock function as a toy model.
-            The Rosenbrock function is well know in the optimization community and
-            often serves as a toy problem. It can be defined for arbitrary
-            dimensions. The minimium is always at x_i = 1 with a function value of
-            zero. All input parameters are continuous. The search domain for
-            all x's is the interval [-5, 10].
-            """
-            x1 = config["x0"]
-            x2 = config["x1"]
-
-            cost = 100.0 * (x2 - x1**2.0) ** 2.0 + (1 - x1) ** 2.0
-            return cost
-
-
-    if __name__ == "__main__":
-        SEED = 12345
-        task = Rosenbrock2D()
-
-        # Previous evaluations
-        # X vectors need to be connected to the configuration space
-        configurations = [
-            Configuration(task.configspace, {'x0':1, 'x1':2}),
-            Configuration(task.configspace, {'x0':-1, 'x1':3}),
-            Configuration(task.configspace, {'x0':5, 'x1':5}),
-        ]
-        costs = [task.evaluate(c, seed=SEED) for c in configurations]
-
-        # Define optimization problem and budget
-        scenario = Scenario(task.configspace, deterministic=False, n_trials=30)
-        intensifier = HyperparameterOptimizationFacade.get_intensifier(scenario, max_config_calls=1)
-        smac = HyperparameterOptimizationFacade(
-            scenario,
-            task.evaluate,
-            intensifier=intensifier,
-            overwrite=True,
-
-            # Modify the initial design to use our custom initial design
-            initial_design=HyperparameterOptimizationFacade.get_initial_design(
-                scenario, 
-                n_configs=0,  # Do not use the default initial design
-                additional_configs=configurations  # Use the configurations previously evaluated as initial design
-                                                # This only passes the configurations but not the cost!
-                                                # So in order to actually use the custom, pre-evaluated initial design
-                                                # we need to tell those trials, like below.
-            )
-        )
-
-        # Convert previously evaluated configurations into TrialInfo and TrialValue instances to pass to SMAC
-        trial_infos = [TrialInfo(config=c, seed=SEED) for c in configurations]
-        trial_values = [TrialValue(cost=c) for c in costs]
-
-        # Warmstart SMAC with the trial information and values
-        for info, value in zip(trial_infos, trial_values):
-            smac.tell(info, value)
-
-        # Optimize as usual
-        smac.optimize()
-
-For more details on ask and tell consult `advanced_usage/5_ask_and_tell`.

From 8d6c8c3876d25996ac809c1db3378a0ca8ccb4e7 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:04:22 +0200
Subject: [PATCH 25/64] Fix link

---
 docs/advanced_usage/3_multi_objective.md                      | 2 +-
 docs/advanced_usage/{5_ask_and_tell.rst => 5_ask_and_tell.md} | 0
 2 files changed, 1 insertion(+), 1 deletion(-)
 rename docs/advanced_usage/{5_ask_and_tell.rst => 5_ask_and_tell.md} (100%)

diff --git a/docs/advanced_usage/3_multi_objective.md b/docs/advanced_usage/3_multi_objective.md
index d6fc5e01a..9698cbda2 100644
--- a/docs/advanced_usage/3_multi_objective.md
+++ b/docs/advanced_usage/3_multi_objective.md
@@ -37,4 +37,4 @@ use the method ``get_incumbents`` in the intensifier.
   incumbents = smac.intensifier.get_incumbents()
 ```
 
-We show an example of how to use multi-objective with plots in our [here][examples<Multi-Objective].
+We show an example of how to use multi-objective with plots in our [examples](../../examples/3_multi_objective/1_schaffer).
diff --git a/docs/advanced_usage/5_ask_and_tell.rst b/docs/advanced_usage/5_ask_and_tell.md
similarity index 100%
rename from docs/advanced_usage/5_ask_and_tell.rst
rename to docs/advanced_usage/5_ask_and_tell.md

From bd49f2c7b4c7c1da15d45eafa91e3e9e0065fcb4 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:04:44 +0200
Subject: [PATCH 26/64] docs(5_ask_and_tell): rst -> md

---
 docs/advanced_usage/5_ask_and_tell.md | 9 ++++-----
 1 file changed, 4 insertions(+), 5 deletions(-)

diff --git a/docs/advanced_usage/5_ask_and_tell.md b/docs/advanced_usage/5_ask_and_tell.md
index c62c78606..3d86b40f6 100644
--- a/docs/advanced_usage/5_ask_and_tell.md
+++ b/docs/advanced_usage/5_ask_and_tell.md
@@ -1,15 +1,14 @@
-Ask-and-Tell Interface
-======================
+# Ask-and-Tell Interface
 
 SMAC provides an ask-and-tell interface in v2.0, giving the user the opportunity to ask for the next trial 
 and report the results of the trial. 
 
-.. warning ::
+!!! warning
 
     When specifying ``n_trials`` in the scenario and trials have been registered by the user, SMAC will 
     count the users trials as well. However, the wallclock time will first start when calling ``optimize``.
 
-.. warning ::
+!!! warning
 
     It might be the case that not all user-provided trials can be considered. Take Successive Halving, for example, 
     when specifying the min and max budget, intermediate budgets are calculated. If the user provided trials with
@@ -17,4 +16,4 @@ and report the results of the trial.
     into the intensification process.
 
 
-Please have a look at our :ref:`ask-and-tell example<Ask-and-Tell>`.
+Please have a look at our [ask-and-tell example](../../examples/1_basics/3_ask_and_tell).

From b6de74d238e78ced2958d9f2a0f77d81c47e77b0 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:07:32 +0200
Subject: [PATCH 27/64] docs(6_commandline): rst -> md

---
 .../{6_commandline.rst => 6_commandline.md}   | 80 +++++++++----------
 1 file changed, 38 insertions(+), 42 deletions(-)
 rename docs/advanced_usage/{6_commandline.rst => 6_commandline.md} (61%)

diff --git a/docs/advanced_usage/6_commandline.rst b/docs/advanced_usage/6_commandline.md
similarity index 61%
rename from docs/advanced_usage/6_commandline.rst
rename to docs/advanced_usage/6_commandline.md
index e33658ba5..1a7d60628 100644
--- a/docs/advanced_usage/6_commandline.rst
+++ b/docs/advanced_usage/6_commandline.md
@@ -1,25 +1,24 @@
-Command-Line Interface
-======================
+# Command-Line Interface
 
 The command-line interface enables the user to run target functions which are non-python code. 
 The passed and further called script (using `Popen`) needs to return a standard output which is then interpreted 
 to perform the optimization process. 
 
-.. note ::
+!!! note
 
     In SMAC v2.0, SMAC can not be called from the command-line directly. Instead, the user should use the python 
     interface to call SMAC. The command-line interface is still available in SMAC v1.4.
 
 
 
-Call of the Target Function
----------------------------
+## Call of the Target Function
 
-The following example shows how the script is called:
 
-.. code-block:: bash
+The following example shows how the script is called:
 
-    filename --instance=test --instance_features=test --seed=0 --hyperparameter1=5323
+```bash
+filename --instance=test --instance_features=test --seed=0 --hyperparameter1=5323
+```
 
 However, as for the python target function, the arguments like instance or budget are depending on which
 components are used. The hyperparameters are depending on the configuration space. The variable ``filename`` could be 
@@ -28,65 +27,62 @@ something like ``./path/to/your/script.sh``.
 We recommend using the following code to receive the arguments in a bash script. Please note that the user is not limited
 to bash scripts but can also use executables, python scripts or anything else.
 
-.. note ::
+!!! note
 
     Since the script is called wih the filename only, make sure to mark the type of the file (e.g., ``#!/bin/bash`` 
     or ``#!/usr/bin/env python``).
 
-.. warning ::
+!!! warning
 
     Everytime an instance is passed, also an instance feature in form of a comma-separated list (no spaces) of floats is
     passed. If no instance feature for the instance is given, an empty list is passed.
 
 
-.. code-block:: bash
-
-    #!/bin/bash
+```bash
+#!/bin/bash
 
-    # Set arguments first
-    for argument in "$@"
-    do
-        key=$(echo $argument | cut -f1 -d=)
-        value=$(echo $argument | cut -f2 -d=)   
+# Set arguments first
+for argument in "$@"
+do
+    key=$(echo $argument | cut -f1 -d=)
+    value=$(echo $argument | cut -f2 -d=)   
 
-        if [[ $key == *"--"* ]]; then
-            v="${key/--/}"
-            declare $v="${value}" 
-        fi
-    done
+    if [[ $key == *"--"* ]]; then
+        v="${key/--/}"
+        declare $v="${value}" 
+    fi
+done
 
-    echo $instance
-    echo $hyperparameter1
+echo $instance
+echo $hyperparameter1
+```
 
 
-Return of the Target Function
------------------------------
+## Return of the Target Function
 
 The script must return an stdout (echo or print) in the following form (white-spaces are ignored):
 
-.. code-block:: 
-
-    cost=0.5; runtime=0.01; status=SUCCESS; additional_info=test (single-objective)
-    cost=0.5, 0.4; runtime=0.01; status=SUCCESS; additional_info=test (multi-objective)
+```
+cost=0.5; runtime=0.01; status=SUCCESS; additional_info=test (single-objective)
+cost=0.5, 0.4; runtime=0.01; status=SUCCESS; additional_info=test (multi-objective)
+```
 
 All arguments are optional except cost and are separated by a semicolon. The string of the status must match
-one of the values from :ref:`StatusType<smac.runhistory.enumerations>`.
+one of the values from [`StatusType`][smac.runhistory.enumerations].
 
 
-Start the Optimization
-----------------------
+## Start the Optimization
 
 The optimization will be started by the normal python interface. The only difference is that you need to pass
 a string as target function instead of a python function.
 
-.. warning ::
+!! warning
 
     Your script needs to have rights to be executed (e.g., update the rights with ``chmod``).
 
-.. code-block:: python
-
-    ...
-    smac = BlackBoxFacade(scenario, target_function="./path/to/your/script.sh")
-    incumbent = smac.optimize()
-    ...
-
+```python
+...
+smac = BlackBoxFacade(scenario, target_function="./path/to/your/script.sh")
+incumbent = smac.optimize()
+...
+```

From d9c9236c0c2425c9c9186778fba8af62d8b5e4cc Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:08:54 +0200
Subject: [PATCH 28/64] docs(7_stopping_criteria): rst -> md

---
 ...ng_criteria.rst => 7_stopping_criteria.md} | 27 +++++++++----------
 1 file changed, 12 insertions(+), 15 deletions(-)
 rename docs/advanced_usage/{7_stopping_criteria.rst => 7_stopping_criteria.md} (74%)

diff --git a/docs/advanced_usage/7_stopping_criteria.rst b/docs/advanced_usage/7_stopping_criteria.md
similarity index 74%
rename from docs/advanced_usage/7_stopping_criteria.rst
rename to docs/advanced_usage/7_stopping_criteria.md
index 508046d74..06212e01f 100644
--- a/docs/advanced_usage/7_stopping_criteria.rst
+++ b/docs/advanced_usage/7_stopping_criteria.md
@@ -1,12 +1,10 @@
-Stopping Criteria
-=================
+# Stopping Criteria
 
 In addition to the standard stopping criteria like number of trials or wallclock time, SMAC also provides 
 more advanced criteria.
 
 
-Termination Cost Threshold
---------------------------
+## Termination Cost Threshold
 
 SMAC can stop the optimization process after a user-defined cost was reached. In each iteration, the average cost 
 (using ``average_cost`` from the run history) from the incumbent is compared to the termination cost threshold. If one
@@ -16,20 +14,19 @@ In other words, the process can be stopped even if the incumbent has not been ev
 highest fidelity, or on all seeds.
 
 
-.. code-block:: python
-
-    scenario = Scenario(
-        ...
-        objectives=["accuracy", "runtime"],
-        termination_cost_threshold=[0.1, np.inf]
-        ...
-    )
+```python
+scenario = Scenario(
+    ...
+    objectives=["accuracy", "runtime"],
+    termination_cost_threshold=[0.1, np.inf]
+    ...
+)
+```
 
 In the code above, the optimization process is stopped if the average accuracy of the incumbent is below 0.1. The 
 runtime is ignored completely as it is set to infinity. Note here again that SMAC minimizes the objective values.
 
 
-Automatically Stopping
-----------------------
+## Automatically Stopping
 
-Coming in the next version of SMAC.
\ No newline at end of file
+Coming soon. 😊
\ No newline at end of file

From 91509044e61b0a5f67baa9f1c16130b3d89e6c40 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:10:21 +0200
Subject: [PATCH 29/64] docs(8_logging): rst -> md

---
 docs/advanced_usage/8_logging.md  | 66 ++++++++++++++++++++++++++++
 docs/advanced_usage/8_logging.rst | 73 -------------------------------
 2 files changed, 66 insertions(+), 73 deletions(-)
 create mode 100644 docs/advanced_usage/8_logging.md
 delete mode 100644 docs/advanced_usage/8_logging.rst

diff --git a/docs/advanced_usage/8_logging.md b/docs/advanced_usage/8_logging.md
new file mode 100644
index 000000000..7d24e84e2
--- /dev/null
+++ b/docs/advanced_usage/8_logging.md
@@ -0,0 +1,66 @@
+# Logging
+
+Logging is a crucial part of the optimization, which should be customizable by the user. This page gives you the
+overview how to customize the logging experience with SMAC.
+
+## Level
+
+The easiest way to change the logging behaviour is to change the level of the global logger. SMAC does this for you
+if you specify the ``logging_level`` in any facade.
+
+```python 
+smac = Facade(
+    ...
+    logging_level=20,
+    ...
+)
+```
+
+The table shows you the specific levels:
+
+| Name      | Level    |
+|-----------|----------|
+| 0         | SHOW ALL |
+| 10        | DEBUG    |
+| 20        | INFO     |
+| 30        | WARNING  |
+| 40        | ERROR    |
+| 50        | CRITICAL |
+
+
+## Custom File
+
+Sometimes, the user wants to disable or highlight specify modules. You can do that by passing a custom yaml
+file to the facade instead.
+
+```python 
+smac = Facade(
+    ...
+    logging_level="path/to/your/logging.yaml",
+    ...
+)
+```
+
+The following file shows you how to display only error messages from the intensifier 
+but keep the level of everything else on INFO:
+
+```yaml
+version: 1
+disable_existing_loggers: false
+formatters:
+    simple:
+        format: '[%(levelname)s][%(filename)s:%(lineno)d] %(message)s'
+handlers:
+    console:
+        class: logging.StreamHandler
+        level: INFO
+        formatter: simple
+        stream: ext://sys.stdout
+loggers:
+    smac.intensifier:
+        level: ERROR
+        handlers: [console]
+root:
+    level: INFO
+    handlers: [console]
+```
diff --git a/docs/advanced_usage/8_logging.rst b/docs/advanced_usage/8_logging.rst
deleted file mode 100644
index f845de9c9..000000000
--- a/docs/advanced_usage/8_logging.rst
+++ /dev/null
@@ -1,73 +0,0 @@
-Logging
-=======
-
-Logging is a crucial part of the optimization, which should be customizable by the user. This page gives you the
-overview how to customize the logging experience with SMAC.
-
-Level
------
-
-The easiest way to change the logging behaviour is to change the level of the global logger. SMAC does this for you
-if you specify the ``logging_level`` in any facade.
-
-.. code-block:: python 
-
-    smac = Facade(
-        ...
-        logging_level=20,
-        ...
-    )
-
-
-The table shows you the specific levels:
-
-.. csv-table::
-    :header: "Name", "Level"
-
-    0, SHOW ALL
-    10, DEBUG
-    20, INFO 
-    30, WARNING
-    40, ERROR 
-    50, CRITICAL
-
-
-Custom File
------------
-
-Sometimes, the user wants to disable or highlight specify modules. You can do that by passing a custom yaml
-file to the facade instead.
-
-.. code-block:: python 
-
-    smac = Facade(
-        ...
-        logging_level="path/to/your/logging.yaml",
-        ...
-    )
-
-
-The following file shows you how to display only error messages from the intensifier 
-but keep the level of everything else on INFO:
-
-.. code-block:: yaml
-
-    version: 1
-    disable_existing_loggers: false
-    formatters:
-        simple:
-            format: '[%(levelname)s][%(filename)s:%(lineno)d] %(message)s'
-    handlers:
-        console:
-            class: logging.StreamHandler
-            level: INFO
-            formatter: simple
-            stream: ext://sys.stdout
-    loggers:
-        smac.intensifier:
-            level: ERROR
-            handlers: [console]
-    root:
-        level: INFO
-        handlers: [console]
-

From 87b744cd717422454cc148a3ddb195fe36047d88 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:12:12 +0200
Subject: [PATCH 30/64] docs(9_parallelism): rst -> md

---
 .../{9_parallelism.rst => 9_parallelism.md}   | 29 +++++++++----------
 1 file changed, 14 insertions(+), 15 deletions(-)
 rename docs/advanced_usage/{9_parallelism.rst => 9_parallelism.md} (71%)

diff --git a/docs/advanced_usage/9_parallelism.rst b/docs/advanced_usage/9_parallelism.md
similarity index 71%
rename from docs/advanced_usage/9_parallelism.rst
rename to docs/advanced_usage/9_parallelism.md
index 9912c782f..85ef545c1 100644
--- a/docs/advanced_usage/9_parallelism.rst
+++ b/docs/advanced_usage/9_parallelism.md
@@ -1,42 +1,41 @@
-Parallelism
-===========
+# Parallelism
 
 SMAC supports multiple workers natively via Dask. Just specify ``n_workers`` in the scenario and you are ready to go. 
 
 
-.. note :: 
+!!! note
     
     Please keep in mind that additional workers are only used to evaluate trials. The main thread still orchestrates the
     optimization process, including training the surrogate model.
 
 
-.. warning ::
+!!! warning
 
     Using high number of workers when the target function evaluation is fast might be counterproductive due to the 
     overhead of communcation. Consider using only one worker in this case.
 
 
-.. warning ::
+!!! warning
 
     When using multiple workers, SMAC is not reproducible anymore.
 
 
-Running on a Cluster
---------------------
+## Running on a Cluster
+
 You can also pass a custom dask client, e.g. to run on a slurm cluster.
-See our :ref:`parallelism example<Parallelization-on-Cluster>`.
+See our [parallelism example](../../examples/1_basics/7_parallelization_cluster.py).
 
-.. warning ::
+!!! warning
 
     On some clusters you cannot spawn new jobs when running a SLURMCluster inside a
     job instead of on the login node. No obvious errors might be raised but it can hang silently.
 
-.. warning ::
+!!! warning
 
     Sometimes you need to modify your launch command which can be done with
-    ``SLURMCluster.job_class.submit_command``. 
-
-.. code-block:: python
+    `SLURMCluster.job_class.submit_command`. 
 
-    cluster.job_cls.submit_command = submit_command
-    cluster.job_cls.cancel_command = cancel_command
+```python
+cluster.job_cls.submit_command = submit_command
+cluster.job_cls.cancel_command = cancel_command
+```
\ No newline at end of file

From 38fa278c99bedfefcb1dca60151abc8b06e80189 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:13:32 +0200
Subject: [PATCH 31/64] docs(10_continue): rst -> md

---
 docs/advanced_usage/{10_continue.rst => 10_continue.md} | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)
 rename docs/advanced_usage/{10_continue.rst => 10_continue.md} (91%)

diff --git a/docs/advanced_usage/10_continue.rst b/docs/advanced_usage/10_continue.md
similarity index 91%
rename from docs/advanced_usage/10_continue.rst
rename to docs/advanced_usage/10_continue.md
index e29c3f9c9..1972ccecd 100644
--- a/docs/advanced_usage/10_continue.rst
+++ b/docs/advanced_usage/10_continue.md
@@ -1,5 +1,4 @@
-Continue
-========
+# Continue
 
 SMAC can automatically restore states where it left off if a run was interrupted or prematurely finished. To do so, 
 it reads in old files (derived from scenario's name, output_directory and seed) and obtains the scenario information
@@ -20,4 +19,4 @@ The behavior can be controlled by setting the parameter ``overwrite`` in the fac
     and the old run is not affected.
 
 
-Please have a look at our :ref:`continue example<Continue an Optimization>`.
\ No newline at end of file
+Please have a look at our [continue example](../../examples/1_basics/5_continue).
\ No newline at end of file

From 3973334b4062c1f4a37e36f028d87413df38984b Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:14:08 +0200
Subject: [PATCH 32/64] docs(11_reproducibility): rst -> md

---
 .../{11_reproducibility.rst => 11_reproducibility.md}          | 3 +--
 1 file changed, 1 insertion(+), 2 deletions(-)
 rename docs/advanced_usage/{11_reproducibility.rst => 11_reproducibility.md} (76%)

diff --git a/docs/advanced_usage/11_reproducibility.rst b/docs/advanced_usage/11_reproducibility.md
similarity index 76%
rename from docs/advanced_usage/11_reproducibility.rst
rename to docs/advanced_usage/11_reproducibility.md
index 2f9f542f3..24cd4f4be 100644
--- a/docs/advanced_usage/11_reproducibility.rst
+++ b/docs/advanced_usage/11_reproducibility.md
@@ -1,4 +1,3 @@
-Reproducibility
-===============
+# Reproducibility
 
 Reproducibility can only be ensured if one worker is used and no time (wallclock or CPU time) is involved.
\ No newline at end of file

From f4c711e78929cddcd8a4a302648cca0ec2a2874e Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:15:53 +0200
Subject: [PATCH 33/64] docs(12_optimzations): rst -> md

---
 .../{12_optimizations.rst => 12_optimizations.md}            | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)
 rename docs/advanced_usage/{12_optimizations.rst => 12_optimizations.md} (94%)

diff --git a/docs/advanced_usage/12_optimizations.rst b/docs/advanced_usage/12_optimizations.md
similarity index 94%
rename from docs/advanced_usage/12_optimizations.rst
rename to docs/advanced_usage/12_optimizations.md
index 6e8d2e82d..9e9382d2a 100644
--- a/docs/advanced_usage/12_optimizations.rst
+++ b/docs/advanced_usage/12_optimizations.md
@@ -1,5 +1,4 @@
-Optimizations
-=============
+# Optimizations
 
 SMAC might run faster or slower depending on the user specifications. In general it
 applies that the more you know about the underlying target function, the better you can optimize the optimization
@@ -17,4 +16,4 @@ The following list might help you to make the optimization process more efficien
 
   - High target function evaluation times: As many ``n_workers`` as cores.
   - Low target function evaluation times: Only one worker because the communication might take longer than evaluating
-    on a single thread.
\ No newline at end of file
+    on a single thread.   
\ No newline at end of file

From 72a89c8cc1ba5a85e58d329c9144bc578b396e37 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:34:00 +0200
Subject: [PATCH 34/64] docs(examples): hide turbo and boing

---
 .../{1_turbo_optimizer.py => _1_turbo_optimizer.py}               | 0
 .../{2_boing_optimizer.py => _2_boing_optimizer.py}               | 0
 2 files changed, 0 insertions(+), 0 deletions(-)
 rename examples/4_advanced_optimizer/{1_turbo_optimizer.py => _1_turbo_optimizer.py} (100%)
 rename examples/4_advanced_optimizer/{2_boing_optimizer.py => _2_boing_optimizer.py} (100%)

diff --git a/examples/4_advanced_optimizer/1_turbo_optimizer.py b/examples/4_advanced_optimizer/_1_turbo_optimizer.py
similarity index 100%
rename from examples/4_advanced_optimizer/1_turbo_optimizer.py
rename to examples/4_advanced_optimizer/_1_turbo_optimizer.py
diff --git a/examples/4_advanced_optimizer/2_boing_optimizer.py b/examples/4_advanced_optimizer/_2_boing_optimizer.py
similarity index 100%
rename from examples/4_advanced_optimizer/2_boing_optimizer.py
rename to examples/4_advanced_optimizer/_2_boing_optimizer.py

From 0121f6bfd4a227ac520e81cd78b1cdd31632a9bd Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:35:26 +0200
Subject: [PATCH 35/64] docs(examples):  rst -> md

---
 examples/1_basics/1_quadratic_function.py     |  4 +-
 examples/1_basics/2_svm_cv.py                 |  4 +-
 examples/1_basics/3_ask_and_tell.py           |  4 +-
 examples/1_basics/4_callback.py               |  4 +-
 examples/1_basics/5_continue.py               |  4 +-
 examples/1_basics/6_priors.py                 |  4 +-
 .../1_basics/7_parallelization_cluster.py     |  4 +-
 examples/1_basics/8_warmstart.py              |  4 +-
 examples/2_multi_fidelity/1_mlp_epochs.py     |  6 +--
 examples/2_multi_fidelity/2_sgd_datasets.py   |  4 +-
 .../3_specify_HB_via_total_budget.py          |  5 +--
 examples/3_multi_objective/1_schaffer.py      |  4 +-
 examples/3_multi_objective/2_parego.py        |  4 +-
 .../3_metadata_callback.py                    |  4 +-
 .../4_intensify_crossvalidation.py            |  4 +-
 .../1_call_target_function_script.py          | 39 +++++++++----------
 16 files changed, 35 insertions(+), 67 deletions(-)

diff --git a/examples/1_basics/1_quadratic_function.py b/examples/1_basics/1_quadratic_function.py
index 4d27c7ae6..7b35d4881 100644
--- a/examples/1_basics/1_quadratic_function.py
+++ b/examples/1_basics/1_quadratic_function.py
@@ -1,6 +1,4 @@
-"""
-Quadratic Function
-^^^^^^^^^^^^^^^^^^
+"""Quadratic Function
 
 An example of applying SMAC to optimize a quadratic function.
 
diff --git a/examples/1_basics/2_svm_cv.py b/examples/1_basics/2_svm_cv.py
index 345fcffb0..dae017055 100644
--- a/examples/1_basics/2_svm_cv.py
+++ b/examples/1_basics/2_svm_cv.py
@@ -1,6 +1,4 @@
-"""
-Support Vector Machine with Cross-Validation
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Support Vector Machine with Cross-Validation
 
 An example of optimizing a simple support vector machine on the IRIS dataset. We use the
 hyperparameter optimization facade, which uses a random forest as its surrogate model. It is able to
diff --git a/examples/1_basics/3_ask_and_tell.py b/examples/1_basics/3_ask_and_tell.py
index 6ab8b5ba8..d99a6c31d 100644
--- a/examples/1_basics/3_ask_and_tell.py
+++ b/examples/1_basics/3_ask_and_tell.py
@@ -1,6 +1,4 @@
-"""
-Ask-and-Tell
-^^^^^^^^^^^^
+"""Ask-and-Tell
 
 This examples show how to use the Ask-and-Tell interface.
 """
diff --git a/examples/1_basics/4_callback.py b/examples/1_basics/4_callback.py
index 0fd9e9d9d..00f21f571 100644
--- a/examples/1_basics/4_callback.py
+++ b/examples/1_basics/4_callback.py
@@ -1,6 +1,4 @@
-"""
-Custom Callback
-^^^^^^^^^^^^^^^
+"""Custom Callback
 
 Using callbacks is the easieast way to integrate custom code inside the Bayesian optimization loop.
 In this example, we disable SMAC's default logging option and use the custom callback to log the evaluated trials.
diff --git a/examples/1_basics/5_continue.py b/examples/1_basics/5_continue.py
index 63cfb3957..79f2d1c4a 100644
--- a/examples/1_basics/5_continue.py
+++ b/examples/1_basics/5_continue.py
@@ -1,6 +1,4 @@
-"""
-Continue an Optimization
-^^^^^^^^^^^^^^^^^^^^^^^^
+"""Continue an Optimization
 
 SMAC can also be continued from a previous run. To do so, it reads in old files (derived from scenario's name,
 output_directory and seed) and sets the corresponding components. In this example, an optimization of a simple quadratic
diff --git a/examples/1_basics/6_priors.py b/examples/1_basics/6_priors.py
index 691460c0b..9f9fb2b9e 100644
--- a/examples/1_basics/6_priors.py
+++ b/examples/1_basics/6_priors.py
@@ -1,6 +1,4 @@
-"""
-User Priors over the Optimum
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""User Priors over the Optimum
 
 Example for optimizing a Multi-Layer Perceptron (MLP) setting priors over the optimum on the
 hyperparameters. These priors are derived from user knowledge (from previous runs on similar
diff --git a/examples/1_basics/7_parallelization_cluster.py b/examples/1_basics/7_parallelization_cluster.py
index 2467f7d22..e250b6def 100644
--- a/examples/1_basics/7_parallelization_cluster.py
+++ b/examples/1_basics/7_parallelization_cluster.py
@@ -1,6 +1,4 @@
-"""
-Parallelization-on-Cluster
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Parallelization-on-Cluster
 
 An example of applying SMAC to optimize Branin using parallelization via Dask client on a 
 SLURM cluster. If you do not want to use a cluster but your local machine, set dask_client
diff --git a/examples/1_basics/8_warmstart.py b/examples/1_basics/8_warmstart.py
index eef95d173..4b4178190 100644
--- a/examples/1_basics/8_warmstart.py
+++ b/examples/1_basics/8_warmstart.py
@@ -1,6 +1,4 @@
-"""
-Warmstarting SMAC
-======================================
+"""Warmstarting SMAC
 
 With the ask and tell interface, we can support warmstarting SMAC. We can communicate rich
 information about the previous trials to SMAC using `TrialInfo` and `TrialValue` instances.
diff --git a/examples/2_multi_fidelity/1_mlp_epochs.py b/examples/2_multi_fidelity/1_mlp_epochs.py
index 5cb0aefa0..e3b5034bd 100644
--- a/examples/2_multi_fidelity/1_mlp_epochs.py
+++ b/examples/2_multi_fidelity/1_mlp_epochs.py
@@ -1,6 +1,4 @@
-"""
-Multi-Layer Perceptron Using Multiple Epochs
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Multi-Layer Perceptron Using Multiple Epochs
 
 Example for optimizing a Multi-Layer Perceptron (MLP) using multiple budgets.
 Since we want to take advantage of multi-fidelity, the ``MultiFidelityFacade`` is a good choice. By default,
@@ -13,7 +11,7 @@
 that ``budget`` specifies the number of epochs smac wants to allocate. The digits dataset
 is chosen to optimize the average accuracy on 5-fold cross validation.
 
-.. note::
+!!! note
 
     This example uses the ``MultiFidelityFacade`` facade, which is the closest implementation to
     `BOHB <https://github.com/automl/HpBandSter>`_.
diff --git a/examples/2_multi_fidelity/2_sgd_datasets.py b/examples/2_multi_fidelity/2_sgd_datasets.py
index 178ea21c2..964626d20 100644
--- a/examples/2_multi_fidelity/2_sgd_datasets.py
+++ b/examples/2_multi_fidelity/2_sgd_datasets.py
@@ -1,6 +1,4 @@
-"""
-Stochastic Gradient Descent On Multiple Datasets
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Stochastic Gradient Descent On Multiple Datasets
 
 Example for optimizing a Multi-Layer Perceptron (MLP) across multiple (dataset) instances.
 
diff --git a/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py b/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py
index 7c0ebdcf0..2569c5893 100644
--- a/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py
+++ b/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py
@@ -1,6 +1,5 @@
-"""
-Specify Number of Trials via a Total Budget in Hyperband
-^^^^^^^^^^^^^^^^^^
+"""Specify Number of Trials via a Total Budget in Hyperband
+
 This example uses a dummy function but illustrates how to setup Hyperband if you 
 want to specify a total optimization budget in terms of fidelity units.
 
diff --git a/examples/3_multi_objective/1_schaffer.py b/examples/3_multi_objective/1_schaffer.py
index 913662508..3a31b8b18 100644
--- a/examples/3_multi_objective/1_schaffer.py
+++ b/examples/3_multi_objective/1_schaffer.py
@@ -1,6 +1,4 @@
-"""
-2D Schaffer Function with Objective Weights
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""2D Schaffer Function with Objective Weights
 
 A simple example on how to use multi-objective optimization is shown. The 2D Schaffer function is used. In the plot
 you can see that all points are on the Pareto front. However, since we set the objective weights, you can notice that
diff --git a/examples/3_multi_objective/2_parego.py b/examples/3_multi_objective/2_parego.py
index b5294fb98..9c7386b1d 100644
--- a/examples/3_multi_objective/2_parego.py
+++ b/examples/3_multi_objective/2_parego.py
@@ -1,6 +1,4 @@
-"""
-ParEGO
-^^^^^^
+"""ParEGO
 
 An example of how to use multi-objective optimization with ParEGO. Both accuracy and run-time are going to be
 optimized on the digits dataset using an MLP, and the configurations are shown in a plot, highlighting the best ones in 
diff --git a/examples/4_advanced_optimizer/3_metadata_callback.py b/examples/4_advanced_optimizer/3_metadata_callback.py
index b82670dbc..65b685ebd 100644
--- a/examples/4_advanced_optimizer/3_metadata_callback.py
+++ b/examples/4_advanced_optimizer/3_metadata_callback.py
@@ -1,6 +1,4 @@
-"""
-Callback for logging run metadata
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Callback for logging run metadata
 
 An example for using a callback to log run metadata to a file. Any arguments passed to the callback will be logged
 to a json file at the beginning of the SMAC run (arguments must be json serializable).
diff --git a/examples/4_advanced_optimizer/4_intensify_crossvalidation.py b/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
index 679253da1..da33e5170 100644
--- a/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
+++ b/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
@@ -1,6 +1,4 @@
-"""
-Speeding up Cross-Validation with Intensification
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Speeding up Cross-Validation with Intensification
 
 An example of optimizing a simple support vector machine on the digits dataset. In contrast to the
 [simple example](examples/1_basics/2_svm_cv.py), in which all cross-validation folds are executed
diff --git a/examples/5_commandline/1_call_target_function_script.py b/examples/5_commandline/1_call_target_function_script.py
index 8c28ae0a3..33a21b6c0 100644
--- a/examples/5_commandline/1_call_target_function_script.py
+++ b/examples/5_commandline/1_call_target_function_script.py
@@ -1,31 +1,28 @@
-"""
-Call Target Function From Script
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+"""Call Target Function From Script
 
 This simple example shows how to call a script with the following content:
 
-.. code-block:: bash
-
-    #!/bin/bash
-
-    # Set arguments first
-    for argument in "$@"
-    do
-        key=$(echo $argument | cut -f1 -d=)
-        value=$(echo $argument | cut -f2 -d=)  
+```bash
+#!/bin/bash
 
-        if [[ $key == *"--"* ]]; then
-            v="${key/--/}"
-            declare $v="${value}"
-        fi
-    done
+# Set arguments first
+for argument in "$@"
+do
+    key=$(echo $argument | cut -f1 -d=)
+    value=$(echo $argument | cut -f2 -d=)  
 
-    # We simply set the cost to our parameter
-    cost=$x0
+    if [[ $key == *"--"* ]]; then
+        v="${key/--/}"
+        declare $v="${value}"
+    fi
+done
 
-    # Return everything
-    echo "cost=$cost"
+# We simply set the cost to our parameter
+cost=$x0
 
+# Return everything
+echo "cost=$cost"
+```
 """
 
 from ConfigSpace import ConfigurationSpace

From b10dd3e2ebcd9a7500e48bd43e70abddc82a5191 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 11:45:20 +0200
Subject: [PATCH 36/64] docs(examples):  fix tiny

---
 examples/1_basics/1_quadratic_function.py     |  2 +-
 examples/1_basics/5_continue.py               |  4 ++--
 .../1_basics/7_parallelization_cluster.py     |  4 ++--
 examples/1_basics/8_warmstart.py              |  2 +-
 .../3_metadata_callback.py                    | 20 ++++++++++---------
 .../4_intensify_crossvalidation.py            |  2 +-
 6 files changed, 18 insertions(+), 16 deletions(-)

diff --git a/examples/1_basics/1_quadratic_function.py b/examples/1_basics/1_quadratic_function.py
index 7b35d4881..bda62efde 100644
--- a/examples/1_basics/1_quadratic_function.py
+++ b/examples/1_basics/1_quadratic_function.py
@@ -3,7 +3,7 @@
 An example of applying SMAC to optimize a quadratic function.
 
 We use the black-box facade because it is designed for black-box function optimization.
-The black-box facade uses a :term:`Gaussian Process<GP>` as its surrogate model.
+The black-box facade uses a [Gaussian Process][GP]` as its surrogate model.
 The facade works best on a numerical hyperparameter configuration space and should not
 be applied to problems with large evaluation budgets (up to 1000 evaluations).
 """
diff --git a/examples/1_basics/5_continue.py b/examples/1_basics/5_continue.py
index 79f2d1c4a..7b0b237a1 100644
--- a/examples/1_basics/5_continue.py
+++ b/examples/1_basics/5_continue.py
@@ -4,11 +4,11 @@
 output_directory and seed) and sets the corresponding components. In this example, an optimization of a simple quadratic
 function is continued.
 
-First, after creating a scenario with 50 trials, we run SMAC with overwrite=True. This will
+First, after creating a scenario with 50 trials, we run SMAC with `overwrite=True`. This will
 overwrite any previous runs (in case the example was called before). We use a custom callback to artificially stop
 this first optimization after 10 trials.
 
-Second, we again run the SMAC optimization using the same scenario, but this time with overwrite=False. As
+Second, we again run the SMAC optimization using the same scenario, but this time with `overwrite=False`. As
 there already is a previous run with the same meta data, this run will be continued until the 50 trials are reached.
 """
 
diff --git a/examples/1_basics/7_parallelization_cluster.py b/examples/1_basics/7_parallelization_cluster.py
index e250b6def..607b521c0 100644
--- a/examples/1_basics/7_parallelization_cluster.py
+++ b/examples/1_basics/7_parallelization_cluster.py
@@ -1,4 +1,4 @@
-"""Parallelization-on-Cluster
+"""Parallelization on Cluster
 
 An example of applying SMAC to optimize Branin using parallelization via Dask client on a 
 SLURM cluster. If you do not want to use a cluster but your local machine, set dask_client
@@ -20,7 +20,7 @@
 
 Here we optimize the synthetic 2d function Branin.
 We use the black-box facade because it is designed for black-box function optimization.
-The black-box facade uses a :term:`Gaussian Process<GP>` as its surrogate model.
+The black-box facade uses a [Gaussian Process][GP] as its surrogate model.
 The facade works best on a numerical hyperparameter configuration space and should not
 be applied to problems with large evaluation budgets (up to 1000 evaluations).
 """
diff --git a/examples/1_basics/8_warmstart.py b/examples/1_basics/8_warmstart.py
index 4b4178190..a21ded585 100644
--- a/examples/1_basics/8_warmstart.py
+++ b/examples/1_basics/8_warmstart.py
@@ -2,7 +2,7 @@
 
 With the ask and tell interface, we can support warmstarting SMAC. We can communicate rich
 information about the previous trials to SMAC using `TrialInfo` and `TrialValue` instances.
-For more details on ask and tell consult `advanced_usage/5_ask_and_tell`.
+For more details on ask and tell consult the [info page ask-and-tell](../../../advanced_usage/5_ask_and_tell).
 """
 from __future__ import annotations
 
diff --git a/examples/4_advanced_optimizer/3_metadata_callback.py b/examples/4_advanced_optimizer/3_metadata_callback.py
index 65b685ebd..681a669a7 100644
--- a/examples/4_advanced_optimizer/3_metadata_callback.py
+++ b/examples/4_advanced_optimizer/3_metadata_callback.py
@@ -4,16 +4,18 @@
 to a json file at the beginning of the SMAC run (arguments must be json serializable).
 
 Instead of editing the Git-related information (repository, branch, commit) by hand each time they change,
-this information can also be added automatically using GitPython (install via "pip install GitPython").
+this information can also be added automatically using GitPython (install via `pip install GitPython`).
 There is an example for obtaining the information via GitPython below:
-    from git import Repo
-    repo = Repo(".", search_parent_directories=True)
-    MetadataCallback(
-        repository=repo.working_tree_dir.split("/")[-1],
-        branch=str(repo.active_branch),
-        commit=str(repo.head.commit),
-        command=" ".join([sys.argv[0][len(repo.working_tree_dir) + 1:]] + sys.argv[1:]),
-    )
+```python
+from git import Repo
+repo = Repo(".", search_parent_directories=True)
+MetadataCallback(
+    repository=repo.working_tree_dir.split("/")[-1],
+    branch=str(repo.active_branch),
+    commit=str(repo.head.commit),
+    command=" ".join([sys.argv[0][len(repo.working_tree_dir) + 1:]] + sys.argv[1:]),
+)
+```
 """
 
 import sys
diff --git a/examples/4_advanced_optimizer/4_intensify_crossvalidation.py b/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
index da33e5170..513cdf43f 100644
--- a/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
+++ b/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
@@ -1,7 +1,7 @@
 """Speeding up Cross-Validation with Intensification
 
 An example of optimizing a simple support vector machine on the digits dataset. In contrast to the
-[simple example](examples/1_basics/2_svm_cv.py), in which all cross-validation folds are executed
+[simple example](../../1_basics/2_svm_cv), in which all cross-validation folds are executed
 at once, we use the intensification mechanism described in the original 
 [SMAC paper](https://link.springer.com/chapter/10.1007/978-3-642-25566-3_40) as also demonstrated
 by [Auto-WEKA](https://dl.acm.org/doi/10.1145/2487575.2487629). This mechanism allows us to

From 8cea4103cec9da628b4b2f2b3ee6e90589abf453 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 14:12:29 +0200
Subject: [PATCH 37/64] docs(examples):  fix navigation sections

---
 docs/advanced_usage/10_continue.md        | 2 +-
 docs/advanced_usage/3_multi_objective.md  | 2 +-
 docs/advanced_usage/5.1_warmstarting.md   | 2 +-
 docs/advanced_usage/5_ask_and_tell.md     | 2 +-
 docs/advanced_usage/9_parallelism.md      | 4 ++--
 docs/example_runner.py                    | 8 +++++++-
 examples/1_basics/README.rst              | 2 --
 examples/1_basics/heading.txt             | 1 +
 examples/2_multi_fidelity/README.rst      | 3 ---
 examples/2_multi_fidelity/heading.txt     | 1 +
 examples/3_multi_objective/README.rst     | 3 ---
 examples/3_multi_objective/heading.txt    | 1 +
 examples/4_advanced_optimizer/heading.txt | 1 +
 examples/5_commandline/README.rst         | 4 ----
 examples/5_commandline/heading.txt        | 1 +
 examples/{README.rst => README.md}        | 0
 16 files changed, 18 insertions(+), 19 deletions(-)
 delete mode 100644 examples/1_basics/README.rst
 create mode 100644 examples/1_basics/heading.txt
 delete mode 100644 examples/2_multi_fidelity/README.rst
 create mode 100644 examples/2_multi_fidelity/heading.txt
 delete mode 100644 examples/3_multi_objective/README.rst
 create mode 100644 examples/3_multi_objective/heading.txt
 create mode 100644 examples/4_advanced_optimizer/heading.txt
 delete mode 100644 examples/5_commandline/README.rst
 create mode 100644 examples/5_commandline/heading.txt
 rename examples/{README.rst => README.md} (100%)

diff --git a/docs/advanced_usage/10_continue.md b/docs/advanced_usage/10_continue.md
index 1972ccecd..f16848995 100644
--- a/docs/advanced_usage/10_continue.md
+++ b/docs/advanced_usage/10_continue.md
@@ -19,4 +19,4 @@ The behavior can be controlled by setting the parameter ``overwrite`` in the fac
     and the old run is not affected.
 
 
-Please have a look at our [continue example](../../examples/1_basics/5_continue).
\ No newline at end of file
+Please have a look at our [continue example](../../examples/1%20Basics/5_continue).
\ No newline at end of file
diff --git a/docs/advanced_usage/3_multi_objective.md b/docs/advanced_usage/3_multi_objective.md
index 9698cbda2..0771412b7 100644
--- a/docs/advanced_usage/3_multi_objective.md
+++ b/docs/advanced_usage/3_multi_objective.md
@@ -37,4 +37,4 @@ use the method ``get_incumbents`` in the intensifier.
   incumbents = smac.intensifier.get_incumbents()
 ```
 
-We show an example of how to use multi-objective with plots in our [examples](../../examples/3_multi_objective/1_schaffer).
+We show an example of how to use multi-objective with plots in our [examples](../../examples/3%20Multi-Objective/1_schaffer).
diff --git a/docs/advanced_usage/5.1_warmstarting.md b/docs/advanced_usage/5.1_warmstarting.md
index 0b69abcde..38189b594 100644
--- a/docs/advanced_usage/5.1_warmstarting.md
+++ b/docs/advanced_usage/5.1_warmstarting.md
@@ -32,7 +32,7 @@ class TrialInfo:
 ```
 
 ## Usage Example
-See [`examples/1_basics/8_warmstart.py`](../../examples/1_basics/8_warmstart.py).
+See [`examples/1_basics/8_warmstart.py`](../../examples/1%20Basics/8_warmstart.py).
 
 
 ```python
diff --git a/docs/advanced_usage/5_ask_and_tell.md b/docs/advanced_usage/5_ask_and_tell.md
index 3d86b40f6..dfd135b04 100644
--- a/docs/advanced_usage/5_ask_and_tell.md
+++ b/docs/advanced_usage/5_ask_and_tell.md
@@ -16,4 +16,4 @@ and report the results of the trial.
     into the intensification process.
 
 
-Please have a look at our [ask-and-tell example](../../examples/1_basics/3_ask_and_tell).
+Please have a look at our [ask-and-tell example](../../examples/1%20Basics/3_ask_and_tell).
diff --git a/docs/advanced_usage/9_parallelism.md b/docs/advanced_usage/9_parallelism.md
index 85ef545c1..8d54f3aa3 100644
--- a/docs/advanced_usage/9_parallelism.md
+++ b/docs/advanced_usage/9_parallelism.md
@@ -23,7 +23,7 @@ SMAC supports multiple workers natively via Dask. Just specify ``n_workers`` in
 ## Running on a Cluster
 
 You can also pass a custom dask client, e.g. to run on a slurm cluster.
-See our [parallelism example](../../examples/1_basics/7_parallelization_cluster.py).
+See our [parallelism example](../../examples/1%20Basics/7_parallelization_cluster).
 
 !!! warning
 
@@ -33,7 +33,7 @@ See our [parallelism example](../../examples/1_basics/7_parallelization_cluster.
 !!! warning
 
     Sometimes you need to modify your launch command which can be done with
-    `SLURMCluster.job_class.submit_command`. 
+    `SLURMCluster.job_class.submit_command`.    
 
 ```python
 cluster.job_cls.submit_command = submit_command
diff --git a/docs/example_runner.py b/docs/example_runner.py
index d0ba74e92..39fd92a70 100644
--- a/docs/example_runner.py
+++ b/docs/example_runner.py
@@ -287,9 +287,15 @@ def copy_section(self) -> str:
     if filename.startswith("_"):
         continue
 
+    heading_fn = Path(path.parent / "heading.txt")
+    if heading_fn.is_file():
+        # This adjusts the navigation section heading to the content in heading.txt
+        heading = heading_fn.read_text().strip()
+        full_doc_path = Path("examples", heading, *doc_path.parts[1:])
+
     example = Example.from_file(path)
     with mkdocs_gen_files.open(full_doc_path, "w") as f:
         f.write(example.generate_doc())
 
     toc_name = example.name
-    mkdocs_gen_files.set_edit_path(full_doc_path, path)
\ No newline at end of file
+    mkdocs_gen_files.set_edit_path(full_doc_path, full_doc_path)
\ No newline at end of file
diff --git a/examples/1_basics/README.rst b/examples/1_basics/README.rst
deleted file mode 100644
index ab81e73c5..000000000
--- a/examples/1_basics/README.rst
+++ /dev/null
@@ -1,2 +0,0 @@
-Basics
-------
\ No newline at end of file
diff --git a/examples/1_basics/heading.txt b/examples/1_basics/heading.txt
new file mode 100644
index 000000000..223b890d1
--- /dev/null
+++ b/examples/1_basics/heading.txt
@@ -0,0 +1 @@
+1 Basics
\ No newline at end of file
diff --git a/examples/2_multi_fidelity/README.rst b/examples/2_multi_fidelity/README.rst
deleted file mode 100644
index 77811397b..000000000
--- a/examples/2_multi_fidelity/README.rst
+++ /dev/null
@@ -1,3 +0,0 @@
-Multi-Fidelity and Multi-Instances
-----------------------------------
-
diff --git a/examples/2_multi_fidelity/heading.txt b/examples/2_multi_fidelity/heading.txt
new file mode 100644
index 000000000..a32280f09
--- /dev/null
+++ b/examples/2_multi_fidelity/heading.txt
@@ -0,0 +1 @@
+2 Multi-Fidelity and Multi-Instances
\ No newline at end of file
diff --git a/examples/3_multi_objective/README.rst b/examples/3_multi_objective/README.rst
deleted file mode 100644
index 995ad9408..000000000
--- a/examples/3_multi_objective/README.rst
+++ /dev/null
@@ -1,3 +0,0 @@
-Multi-Objective
----------------
-
diff --git a/examples/3_multi_objective/heading.txt b/examples/3_multi_objective/heading.txt
new file mode 100644
index 000000000..653c325d7
--- /dev/null
+++ b/examples/3_multi_objective/heading.txt
@@ -0,0 +1 @@
+3 Multi-Objective
\ No newline at end of file
diff --git a/examples/4_advanced_optimizer/heading.txt b/examples/4_advanced_optimizer/heading.txt
new file mode 100644
index 000000000..080cf4a43
--- /dev/null
+++ b/examples/4_advanced_optimizer/heading.txt
@@ -0,0 +1 @@
+4 Advanced Topics
\ No newline at end of file
diff --git a/examples/5_commandline/README.rst b/examples/5_commandline/README.rst
deleted file mode 100644
index 7614fd90d..000000000
--- a/examples/5_commandline/README.rst
+++ /dev/null
@@ -1,4 +0,0 @@
-Command-Line Interface
-----------------------
-
-SMAC can call a target function from a script. This is useful if you want to optimize non-python code.
\ No newline at end of file
diff --git a/examples/5_commandline/heading.txt b/examples/5_commandline/heading.txt
new file mode 100644
index 000000000..73dc157a8
--- /dev/null
+++ b/examples/5_commandline/heading.txt
@@ -0,0 +1 @@
+5 Command-Line Interface
\ No newline at end of file
diff --git a/examples/README.rst b/examples/README.md
similarity index 100%
rename from examples/README.rst
rename to examples/README.md

From 2a4a16e9cbe047008f43333ec9e93488e398f13c Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 14:12:36 +0200
Subject: [PATCH 38/64] docs(examples):  fix navigation sections

---
 docs/6_references.md | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/docs/6_references.md b/docs/6_references.md
index 91e73aeac..d64e0be76 100644
--- a/docs/6_references.md
+++ b/docs/6_references.md
@@ -1,20 +1,20 @@
 # References
 
-* [LJDR18] L. Li, K. Jamieson, G. DeSalvo, A. Rostamizadeh, A. Talwalkar; 
+* [](){#LJDR18}[LJDR18] L. Li, K. Jamieson, G. DeSalvo, A. Rostamizadeh, A. Talwalkar; 
     Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization; 
     https://jmlr.org/papers/v18/16-558.html
 
 
-* [HSSL22] Carl Hvarfner, Danny Stoll, Artur Souza, Marius Lindauer, Frank Hutter, Luigi Nardi; 
+* [](){#HSSL22}[HSSL22] Carl Hvarfner, Danny Stoll, Artur Souza, Marius Lindauer, Frank Hutter, Luigi Nardi; 
     πBO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization; 
     https://arxiv.org/pdf/2204.11051.pdf
 
 
-* [Know06] J. Knowles; 
+* [](){#Know06}[Know06] J. Knowles; 
     ParEGO: A Hybrid Algorithm with on-Line Landscape Approximation for Expensive Multiobjective Optimization Problems; 
     https://www.semanticscholar.org/paper/ParEGO%3A-a-hybrid-algorithm-with-on-line-landscape-Knowles/73b5b196b35fb23e1f908d73b787c2c2942fadb5
 
 
-* [SKKS10] N. Srinivas, S. M. Kakade, A. Krause, M. Seeger; 
+* [](){#SKKS10}[SKKS10] N. Srinivas, S. M. Kakade, A. Krause, M. Seeger; 
     Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design; 
     https://arxiv.org/pdf/0912.3995.pdf
\ No newline at end of file

From c3adc50a0d700057cb6d95610f8c540e974c4e7e Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 14:15:33 +0200
Subject: [PATCH 39/64] fix typo

---
 examples/1_basics/1_quadratic_function.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/examples/1_basics/1_quadratic_function.py b/examples/1_basics/1_quadratic_function.py
index bda62efde..8a7744768 100644
--- a/examples/1_basics/1_quadratic_function.py
+++ b/examples/1_basics/1_quadratic_function.py
@@ -3,7 +3,7 @@
 An example of applying SMAC to optimize a quadratic function.
 
 We use the black-box facade because it is designed for black-box function optimization.
-The black-box facade uses a [Gaussian Process][GP]` as its surrogate model.
+The black-box facade uses a [Gaussian Process][GP] as its surrogate model.
 The facade works best on a numerical hyperparameter configuration space and should not
 be applied to problems with large evaluation budgets (up to 1000 evaluations).
 """

From 524c24d3b52acccb32e8a28d010b276dbc75567c Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 14:28:02 +0200
Subject: [PATCH 40/64] Update

---
 Makefile                            | 10 +++---
 docs/3_getting_started.md           |  6 ++--
 docs/7_glossary.md                  | 54 ++++++++++++++---------------
 docs/advanced_usage/1_components.md |  1 +
 docs/index.md                       | 35 +++++++++++++++++++
 examples/README.md                  |  3 +-
 mkdocs.yaml                         | 33 ++++++++++++------
 7 files changed, 95 insertions(+), 47 deletions(-)
 create mode 100644 docs/index.md

diff --git a/Makefile b/Makefile
index 145d1bb14..854160cc6 100644
--- a/Makefile
+++ b/Makefile
@@ -87,10 +87,12 @@ tests:
 	$(PYTEST) ${TESTS_DIR}
 
 docs:
-	$(MAKE) -C ${DOCDIR} docs
-	@echo
-	@echo "View docs at:"
-	@echo ${INDEX_HTML}
+	# $(MAKE) -C ${DOCDIR} docs
+	# @echo
+	# @echo "View docs at:"
+	# @echo ${INDEX_HTML}
+	$(PYTHON) -m webbrowser -t "http://127.0.0.1:8000/"
+	$(PYTHON) -m mkdocs serve --clean --watch-theme
 
 examples:
 	$(MAKE) -C ${DOCDIR} examples
diff --git a/docs/3_getting_started.md b/docs/3_getting_started.md
index 854d87971..ae4fc6411 100644
--- a/docs/3_getting_started.md
+++ b/docs/3_getting_started.md
@@ -63,7 +63,7 @@ in an efficient way.
 
 ## Scenario
 
-The :ref:`Scenario<smac.scenario>` is used to provide environment variables. For example, 
+The [Scenario][smac.scenario] is used to provide environment variables. For example, 
 if you want to limit the optimization process by a time limit or want to specify where to save the results.
 
 ```python
@@ -82,12 +82,12 @@ scenario = Scenario(
 
 ## Facade
 
-A :ref:`facade<smac.facade>` is the entry point to SMAC, which constructs a default optimization 
+A [facade][smac.facade.abstract_facade] is the entry point to SMAC, which constructs a default optimization 
 pipeline for you. SMAC offers various facades, which satisfy many common use cases and are crucial to
 achieving peak performance. The idea behind the facades is to provide a simple interface to all of SMAC's components,
 which is easy to use and understand and without the need of deep diving into the material. However, experts are
 invited to change the components to their specific hyperparameter optimization needs. The following
-table (horizontally scrollable) shows you what is supported and reveals the default :ref:`components<Components>`:
+table (horizontally scrollable) shows you what is supported and reveals the default [components][components]:
 
 | | [Black-Box](smac.facade.blackbox_facade) | [Hyperparameter Optimization](smac.facade.hyperparameter_optimization_facade) | [Multi-Fidelity](smac.facade.multi_fidelity_facade) | [Algorithm Configuration](smac.facade.algorithm_configuration_facade) | [Random](smac.facade.random_facade) | [Hyperband](smac.facade.hyperband_facade) |
 | --- | --- | --- | --- | --- | --- | --- |
diff --git a/docs/7_glossary.md b/docs/7_glossary.md
index 912cd5a10..721428ae1 100644
--- a/docs/7_glossary.md
+++ b/docs/7_glossary.md
@@ -1,29 +1,29 @@
 # Glossary
 
-- **BB**: See `Black-Box`.
-- **BO**: See `Bayesian Optimization`.
-- **BOHB**: [Bayesian optimization and Hyperband](https://arxiv.org/abs/1807.01774).
-- **CLI**: Command-Line Interface.
-- **CV**: Cross-Validation.
-- **GP**: Gaussian Process.
-- **GP-MCMC**: Gaussian Process with Markov-Chain Monte-Carlo.
-- **HB**: See `Hyperband`.
-- **HP**: Hyperparameter.
-- **MF**: See `Multi-Fidelity`.
-- **RF**: Random Forest.
-- **ROAR**: See `Random Online Adaptive Racing`.
-- **SMAC**: Sequential Model-Based Algorithm Configuration.
-- **SMBO**: Sequential Mode-Based Optimization.
-- **Bayesian Optimization**: Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions. A Bayesian optimization weights exploration and exploitation to find the minimum of its objective.
-- **Black-Box**: Refers to an algorithm being optimized, where only input and output are observable.
-- **Budget**: Budget is another word for fidelity. Examples are the number of training epochs or the size of the data subset the algorithm is trained on. However, budget can also be used in the context of instances. For example, if you have 100 instances (let's say we optimize across datasets) and you want to run your algorithm on 10 of them, then the budget is 10.
-- **Hyperband**: [Hyperband](https://arxiv.org/abs/1603.06560). A novel bandit-based algorithm for hyperparameter optimization. Hyperband is an extension of successive halving and therefore works with multi-fidelities.
-- **Incumbent**: The incumbent is the current best known configuration.
-- **Instances**: Often you want to optimize across different datasets, subsets, or even different transformations (e.g. augmentation). In general, each of these is called an instance. Configurations are evaluated on multiple instances so that a configuration is found which performs superior on all instances instead of only a few.
-- **Intensification**: A mechanism that governs how many evaluations to perform with each configuration and when to trust a configuration enough to make it the new current best known configuration (the incumbent).
-- **Multi-Fidelity**: Multi-fidelity refers to running an algorithm on multiple budgets (such as number of epochs or subsets of data) and thereby evaluating the performance prematurely.
-- **Multi-Objective**: A multi-objective optimization problem is a problem with more than one objective. The goal is to find a solution that is optimal or at least a good compromise in all objectives.
-- **Objective**: An objective is a metric to evaluate the quality or performance of an algorithm.
-- **Random Online Adaptive Racing**: Random Online Adaptive Racing. A simple model-free instantiation of the general `SMBO` framework. It selects configurations uniformly at random and iteratively compares them against the current incumbent using the intensification mechanism. See [SMAC extended](https://ai.dmi.unibas.ch/research/reading_group/hutter-et-al-tr2010.pdf) chapter 3.2 for details.
-- **Target Function**: Your model, which returns a cost based on the given config, seed, budget, and/or instance.
-- **Trial**: Trial is a single run of a target function on a combination of configuration, seed, budget and/or instance.
+- [](){#BB}**BB**: See `Black-Box`.
+- [](){#BO}**BO**: See `Bayesian Optimization`.
+- [](){#BOHB}**BOHB**: [Bayesian optimization and Hyperband](https://arxiv.org/abs/1807.01774).
+- [](){#CLI}**CLI**: Command-Line Interface.
+- [](){#CV}**CV**: Cross-Validation.
+- [](){#GP}**GP**: Gaussian Process.
+- [](){#GP-MCMC}**GP-MCMC**: Gaussian Process with Markov-Chain Monte-Carlo.
+- [](){#HB}**HB**: See `Hyperband`.
+- [](){#HP}**HP**: Hyperparameter.
+- [](){#MF}**MF**: See `Multi-Fidelity`.
+- [](){#RF}**RF**: Random Forest.
+- [](){#ROAR}**ROAR**: See `Random Online Adaptive Racing`.
+- [](){#SMAC}**SMAC**: Sequential Model-Based Algorithm Configuration.
+- [](){#SMBO}**SMBO**: Sequential Mode-Based Optimization.
+- [](){#Bayesian Optimization}**Bayesian Optimization**: Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions. A Bayesian optimization weights exploration and exploitation to find the minimum of its objective.
+- [](){#Black-Box}**Black-Box**: Refers to an algorithm being optimized, where only input and output are observable.
+- [](){#Budget}**Budget**: Budget is another word for fidelity. Examples are the number of training epochs or the size of the data subset the algorithm is trained on. However, budget can also be used in the context of instances. For example, if you have 100 instances (let's say we optimize across datasets) and you want to run your algorithm on 10 of them, then the budget is 10.
+- [](){#Hyperband}**Hyperband**: [Hyperband](https://arxiv.org/abs/1603.06560). A novel bandit-based algorithm for hyperparameter optimization. Hyperband is an extension of successive halving and therefore works with multi-fidelities.
+- [](){#Incumbent}**Incumbent**: The incumbent is the current best known configuration.
+- [](){#Instances}**Instances**: Often you want to optimize across different datasets, subsets, or even different transformations (e.g. augmentation). In general, each of these is called an instance. Configurations are evaluated on multiple instances so that a configuration is found which performs superior on all instances instead of only a few.
+- [](){#Intensification}**Intensification**: A mechanism that governs how many evaluations to perform with each configuration and when to trust a configuration enough to make it the new current best known configuration (the incumbent).
+- [](){#Multi-Fidelity}**Multi-Fidelity**: Multi-fidelity refers to running an algorithm on multiple budgets (such as number of epochs or subsets of data) and thereby evaluating the performance prematurely.
+- [](){#Multi-Objective}**Multi-Objective**: A multi-objective optimization problem is a problem with more than one objective. The goal is to find a solution that is optimal or at least a good compromise in all objectives.
+- [](){#Objective}**Objective**: An objective is a metric to evaluate the quality or performance of an algorithm.
+- [](){#Random Online Adaptive Racing}**Random Online Adaptive Racing**: Random Online Adaptive Racing. A simple model-free instantiation of the general `SMBO` framework. It selects configurations uniformly at random and iteratively compares them against the current incumbent using the intensification mechanism. See [SMAC extended](https://ai.dmi.unibas.ch/research/reading_group/hutter-et-al-tr2010.pdf) chapter 3.2 for details.
+- [](){#Target Function}**Target Function**: Your model, which returns a cost based on the given config, seed, budget, and/or instance.
+- [](){#Trial}**Trial**: Trial is a single run of a target function on a combination of configuration, seed, budget and/or instance.
diff --git a/docs/advanced_usage/1_components.md b/docs/advanced_usage/1_components.md
index 1a07c02eb..8b9b1ce24 100644
--- a/docs/advanced_usage/1_components.md
+++ b/docs/advanced_usage/1_components.md
@@ -1,3 +1,4 @@
+[](){#components}
 # Components
 
 In addition to the basic components mentioned in :ref:`Getting Started`, all other components are
diff --git a/docs/index.md b/docs/index.md
new file mode 100644
index 000000000..338979c40
--- /dev/null
+++ b/docs/index.md
@@ -0,0 +1,35 @@
+Home
+# SMAC3 Documentation
+
+## Introduction
+
+SMAC is a tool for algorithm configuration to optimize the parameters of arbitrary algorithms, including hyperparameter optimization of Machine Learning algorithms. The main core consists of Bayesian Optimization in combination with an aggressive racing mechanism to efficiently decide which of two configurations performs better.
+
+SMAC3 is written in Python3 and continuously tested with Python 3.8, 3.9, and 3.10. Its Random Forest is written in C++. In the following, SMAC is representatively mentioned for SMAC3.
+
+If you use SMAC, please cite our [JMLR paper](https://jmlr.org/papers/v23/21-0888.html):
+
+```text
+@article{lindauer-jmlr22a,
+       author  = {Marius Lindauer and Katharina Eggensperger and Matthias Feurer and André Biedenkapp and Difan Deng and Carolin Benjamins and Tim Ruhkopf and René Sass and Frank Hutter},
+       title   = {SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization},
+       journal = {Journal of Machine Learning Research},
+       year    = {2022},
+       volume  = {23},
+       number  = {54},
+       pages   = {1--9},
+       url     = {http://jmlr.org/papers/v23/21-0888.html}
+}
+```
+
+For the original idea, we refer to:
+
+```text
+Hutter, F. and Hoos, H. H. and Leyton-Brown, K.
+Sequential Model-Based Optimization for General Algorithm Configuration
+In: Proceedings of the conference on Learning and Intelligent Optimization (LION 5)
+```
+
+## Contact
+
+SMAC3 is developed by [AutoML.org](https://www.automl.org). If you want to contribute or found an issue, please visit our [GitHub page](https://github.com/automl/SMAC3). Our guidelines for contributing to this package can be found [here](https://github.com/automl/SMAC3/blob/main/CONTRIBUTING.md).
diff --git a/examples/README.md b/examples/README.md
index deaa8fbac..33bd78ca3 100644
--- a/examples/README.md
+++ b/examples/README.md
@@ -1,5 +1,4 @@
-Examples
-========
+# Examples
 
 We provide several examples of how to use SMAC with Python. Practical use-cases were chosen to show the
 variety of SMAC.
\ No newline at end of file
diff --git a/mkdocs.yaml b/mkdocs.yaml
index 5f011098c..8a1ded77f 100644
--- a/mkdocs.yaml
+++ b/mkdocs.yaml
@@ -16,6 +16,8 @@ site_name: "SMAC3"
 repo_url: https://github.com/automl/SMAC3/
 repo_name: automl/SMAC3
 
+site_dir: docs/site
+
 theme:
   name: material
   logo: images/logo.png
@@ -142,8 +144,6 @@ plugins:
         - docs/example_runner.py
   - literate-nav:
         nav_file: SUMMARY.md
-  # - mkdocs-glossary-plugin:
-  #     glossary_dirs: ["docs/glossary"] # This plugin considers the md files in "docs/glossary" as glossary files.
   - mkdocstrings:
       default_handler: python
       enable_inventory: true
@@ -193,19 +193,30 @@ plugins:
 
 nav:
   - Home: "index.md"
-  - Installation: "installation.md"
+  - Installation: "1_installation.md"
+  - Package Overview: "2_package_overview.md"
+  - Getting Started: "3_getting_started.md"
+  - Advanced Usage:
+    - "advanced_usage/1_components.md"
+    - "advanced_usage/2_multi_fidelity.md"
+    - "advanced_usage/3_multi_objective.md"
+    - "advanced_usage/4_instances.md"
+    - "advanced_usage/5_ask_and_tell.md"
+    - "advanced_usage/6_commandline.md"
+    - "advanced_usage/7_stopping_criteria.md"
+    - "advanced_usage/8_logging.md"
+    - "advanced_usage/9_parallelism.md"
+    - "advanced_usage/10_continue.md"
+    - "advanced_usage/11_reproducibility.md"
+    - "advanced_usage/12_optimizations.md"
   # Auto generated with docs/examples_runner.py
   - Examples: "examples/"
   # Auto generated with docs/api_generator.py
   - API: "api/"
-  # - Guides:
-  #   - "guides/first-steps.md"
-  #   - "guides/hydra.md"
-  #   - "guides/database.md"
-  #   - "guides/containers.md"
-  #   - "guides/large-scale-benchmarking.md"
-  #   - "guides/using-carps.md"
-  # - Commands: "commands.md"
+  - Info & FAQ:
+    - "6_references.md"
+    - "7_glossary.md"
+    - "8_faq.md"
   # - Contributing:
   #   - "contributing/index.md"
   #   - "contributing/contributing-a-benchmark.md"

From 4891644282c902fabe3443c292eead0daacf9b57 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 14:28:12 +0200
Subject: [PATCH 41/64] ignore more

---
 .gitignore | 5 ++++-
 1 file changed, 4 insertions(+), 1 deletion(-)

diff --git a/.gitignore b/.gitignore
index 679812827..f72077d61 100644
--- a/.gitignore
+++ b/.gitignore
@@ -147,4 +147,7 @@ src
 
 # Pycharm
 .idea
-.vscode
\ No newline at end of file
+.vscode
+
+projects
+_api
\ No newline at end of file

From 360ba4b759c702a17762da255617ff53ba6ad818 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 17:46:58 +0200
Subject: [PATCH 42/64] docs(docstring): fix refs

---
 smac/acquisition/function/confidence_bound.py          | 2 +-
 smac/acquisition/function/prior_acqusition_function.py | 2 +-
 smac/facade/hyperband_facade.py                        | 2 +-
 3 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/smac/acquisition/function/confidence_bound.py b/smac/acquisition/function/confidence_bound.py
index 13d5db320..a03a77eb5 100644
--- a/smac/acquisition/function/confidence_bound.py
+++ b/smac/acquisition/function/confidence_bound.py
@@ -18,7 +18,7 @@
 class LCB(AbstractAcquisitionFunction):
     r"""Computes the lower confidence bound for a given x over the best so far value as acquisition value.
 
-    :math:`LCB(X) = \mu(\mathbf{X}) - \sqrt(\beta_t)\sigma(\mathbf{X})` [SKKS10]_
+    :math:`LCB(X) = \mu(\mathbf{X}) - \sqrt(\beta_t)\sigma(\mathbf{X})` [[SKKS10][SKKS10]]
 
     with
 
diff --git a/smac/acquisition/function/prior_acqusition_function.py b/smac/acquisition/function/prior_acqusition_function.py
index ca180908d..9f0a7d2a3 100644
--- a/smac/acquisition/function/prior_acqusition_function.py
+++ b/smac/acquisition/function/prior_acqusition_function.py
@@ -28,7 +28,7 @@ class PriorAcquisitionFunction(AbstractAcquisitionFunction):
     r"""Weight the acquisition function with a user-defined prior over the optimum.
 
     See "piBO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization" by Carl
-    Hvarfner et al. [HSSL22]_ for further details.
+    Hvarfner et al. [[HSSL22][HSSL22]] for further details.
 
     Parameters
     ----------
diff --git a/smac/facade/hyperband_facade.py b/smac/facade/hyperband_facade.py
index d8be7bf4d..92ab11c14 100644
--- a/smac/facade/hyperband_facade.py
+++ b/smac/facade/hyperband_facade.py
@@ -10,7 +10,7 @@
 
 class HyperbandFacade(RandomFacade):
     """
-    Facade to use model-free Hyperband [LJDR18]_ for algorithm configuration.
+    Facade to use model-free Hyperband [[LJDR18][LJDR18]] for algorithm configuration.
 
     Uses Random Aggressive Online Racing (ROAR) to compare configurations, a random
     initial design and the Hyperband intensifier.

From 52bdc1d6dfdbf26d3b27cfda56792088e5265f22 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 17:47:10 +0200
Subject: [PATCH 43/64] docs(docstring): fix refs

---
 docs/2_package_overview.md          | 2 +-
 docs/3_getting_started.md           | 6 +++++-
 docs/advanced_usage/1_components.md | 8 ++++----
 3 files changed, 10 insertions(+), 6 deletions(-)

diff --git a/docs/2_package_overview.md b/docs/2_package_overview.md
index 07466da19..28d3dd512 100644
--- a/docs/2_package_overview.md
+++ b/docs/2_package_overview.md
@@ -19,7 +19,7 @@ Use categorical, continuous, hierarchical and/or conditional hyperparameters wit
 Optimization with any [objective](#) (e.g., accuracy, runtime, cross-validation, ...) is possible.
 
 #### [Multi-Objective](#) Optimization
-Optimize arbitrary number of objectives using scalarized multi-objective algorithms. Both ParEGO [Know06] and mean aggregation strategies are supported.
+Optimize arbitrary number of objectives using scalarized multi-objective algorithms. Both ParEGO [[Know06][Know06]] and mean aggregation strategies are supported.
 
 #### [Multi-Fidelity](#) Optimization
 Judge configurations on multiple [budgets](#) to discard unsuitable configurations early on. This will result in a massive speed-up, depending on the budgets.
diff --git a/docs/3_getting_started.md b/docs/3_getting_started.md
index ae4fc6411..108af0545 100644
--- a/docs/3_getting_started.md
+++ b/docs/3_getting_started.md
@@ -1,3 +1,4 @@
+[](){#getting_started}
 # Getting Started
 
 SMAC needs four core components (configuration space, target function, scenario and a facade) to run an
@@ -5,7 +6,10 @@ optimization process, all of which are explained on this page.
 
 They interact in the following way:
 
-<img src="images/smac_components_interaction.jpg" width="400" alt="Interaction of SMAC's components">
+<figure markdown="span">
+  ![Interaction of SMAC's components](./images/smac_components_interaction.jpg){ width="300" }
+  <figcaption>Interaction of SMAC's components</figcaption>
+</figure>
 
 
 ## Configuration Space
diff --git a/docs/advanced_usage/1_components.md b/docs/advanced_usage/1_components.md
index 8b9b1ce24..25aec5f25 100644
--- a/docs/advanced_usage/1_components.md
+++ b/docs/advanced_usage/1_components.md
@@ -1,14 +1,14 @@
 [](){#components}
 # Components
 
-In addition to the basic components mentioned in :ref:`Getting Started`, all other components are
+In addition to the basic components mentioned in [Getting Started][getting_started], all other components are
 explained in the following paragraphs to give a better picture of SMAC. These components are all used to guide
 the optimization process and simple changes can influence the results drastically.
 
 Before diving into the components, we shortly want to explain the main Bayesian optimization loop in SMAC.
-The :term:`SMBO` receives all instantiated components from the facade and the logic happens here.
+The [SMBO][SMBO] receives all instantiated components from the facade and the logic happens here.
 In general, a while loop is used to ask for the next trial, submit it to the runner, and wait for the runner to 
-finish the evaluation. Since the runner and the :ref:`SMBO<smac.main.smbo>`
+finish the evaluation. Since the runner and the [`SMBO`][smac.main.smbo]
 object are decoupled, the while loop continues and asks for even 
 more trials (e.g., in case of multi-threading), which also can be submitted to the runner. If all workers are
 occupied, SMAC will wait until a new worker is available again. Moreover, limitations like wallclock time and remaining 
@@ -24,7 +24,7 @@ higher dimensional and complex spaces.
 
 The data used to train the surrogate model is collected by the runhistory encoder (receives data from the runhistory 
 and transforms it). If budgets are
-involved, the highest budget which satisfies ``min_trials`` (defaults to 1) in :ref:`smac.main.config_selector` is
+involved, the highest budget which satisfies ``min_trials`` (defaults to 1) in [smac.main.config_selector][smac.main.config_selector] is
 used. If no budgets are used, all observations are used.
 
 If you are using instances, it is recommended to use instance features. The model is trained on each instance 

From a88d82f9650a9da37422257c145b5ab98b9fefd5 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 17:47:21 +0200
Subject: [PATCH 44/64] docs(home): add sections

---
 docs/index.md | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

diff --git a/docs/index.md b/docs/index.md
index 338979c40..3d9427335 100644
--- a/docs/index.md
+++ b/docs/index.md
@@ -7,9 +7,11 @@ SMAC is a tool for algorithm configuration to optimize the parameters of arbitra
 
 SMAC3 is written in Python3 and continuously tested with Python 3.8, 3.9, and 3.10. Its Random Forest is written in C++. In the following, SMAC is representatively mentioned for SMAC3.
 
+
+## Cite Us
 If you use SMAC, please cite our [JMLR paper](https://jmlr.org/papers/v23/21-0888.html):
 
-```text
+```bibtex
 @article{lindauer-jmlr22a,
        author  = {Marius Lindauer and Katharina Eggensperger and Matthias Feurer and André Biedenkapp and Difan Deng and Carolin Benjamins and Tim Ruhkopf and René Sass and Frank Hutter},
        title   = {SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization},

From 866f949ee1d2f709397f86ee43eb1e8687f65532 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 17:48:26 +0200
Subject: [PATCH 45/64] build(setup): add requirements for mkdocs

---
 setup.py | 16 ++++++++++++++++
 1 file changed, 16 insertions(+)

diff --git a/setup.py b/setup.py
index eb01e0ec6..6ac234efe 100644
--- a/setup.py
+++ b/setup.py
@@ -39,6 +39,22 @@ def read_file(filepath: str) -> str:
         "flake8",
         "pre-commit",
         "pylint",
+        "mkdocs",
+        "mkdocs-material",
+        "mkdocs-autorefs",
+        "mkdocs-gen-files",
+        "mkdocs-literate-nav",
+        "mkdocs-glightbox",
+        "mkdocs-glossary-plugin",
+        "mkdocstrings[python]",
+        "markdown-exec[ansi]",
+        "mike",
+        "pillow",
+        "cairosvg",
+        "black",                # This allows mkdocstrings to format signatures in the docs
+        "pytest",
+        "pytest-coverage",
+        "pytest-cases",
     ],
 }
 

From 234c4ee664e23c549399aa63933c3959527461d5 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 17:49:36 +0200
Subject: [PATCH 46/64] refactor(abstract_intensifier): track param
 max_config_calls

---
 smac/intensifier/abstract_intensifier.py | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/smac/intensifier/abstract_intensifier.py b/smac/intensifier/abstract_intensifier.py
index b7a5ae1ca..e46d6ccd1 100644
--- a/smac/intensifier/abstract_intensifier.py
+++ b/smac/intensifier/abstract_intensifier.py
@@ -100,6 +100,7 @@ def meta(self) -> dict[str, Any]:
         return {
             "name": self.__class__.__name__,
             "max_incumbents": self._max_incumbents,
+            "max_config_calls": self._max_config_calls,
             "seed": self._seed,
         }
 
@@ -594,7 +595,7 @@ def update_incumbents(self, config: Configuration) -> None:
                     self._remove_rejected_config(config_id)
                     logger.info(
                         f"Added config {config_hash} and rejected config {removed_incumbent_hash} as incumbent because "
-                        f"it is not better than the incumbents on {len(config_isb_keys)} instances:"
+                        f"it is not better than the incumbents on {len(config_isb_keys)} instances: "
                     )
                     print_config_changes(rh.get_config(removed_incumbent_id), config, logger=logger)
         elif len(previous_incumbents) < len(new_incumbents):

From 5bd9817c62f45eda91f339625c0f8d4d7564f053 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 17:51:42 +0200
Subject: [PATCH 47/64] refactor(docs): remove old files

---
 Makefile       | 10 ---------
 docs/Makefile  | 30 --------------------------
 docs/index.rst | 57 --------------------------------------------------
 3 files changed, 97 deletions(-)
 delete mode 100644 docs/Makefile
 delete mode 100644 docs/index.rst

diff --git a/Makefile b/Makefile
index 854160cc6..6e481dd06 100644
--- a/Makefile
+++ b/Makefile
@@ -87,19 +87,9 @@ tests:
 	$(PYTEST) ${TESTS_DIR}
 
 docs:
-	# $(MAKE) -C ${DOCDIR} docs
-	# @echo
-	# @echo "View docs at:"
-	# @echo ${INDEX_HTML}
 	$(PYTHON) -m webbrowser -t "http://127.0.0.1:8000/"
 	$(PYTHON) -m mkdocs serve --clean --watch-theme
 
-examples:
-	$(MAKE) -C ${DOCDIR} examples
-	@echo
-	@echo "View docs at:"
-	@echo ${INDEX_HTML}
-
 # Build a distribution in ./dist
 build:
 	$(PYTHON) setup.py sdist
diff --git a/docs/Makefile b/docs/Makefile
deleted file mode 100644
index d0d8f6e2b..000000000
--- a/docs/Makefile
+++ /dev/null
@@ -1,30 +0,0 @@
-SPHINXBUILD = sphinx-build
-BUILDDIR = build
-SPHINXOPTS =
-ALLSPHINXOPTS = $(SPHINXOPTS) .
-
-.PHONY: clean buildapi linkcheck html docs html-noexamples
-
-clean:
-	rm -rf $(BUILDDIR)/*
-	rm -rf api
-	rm -rf examples
-
-linkcheck:
-	SPHINX_GALLERY_PLOT=False $(SPHINXBUILD) -b linkcheck $(ALLSPHINXOPTS) $(BUILDDIR)/linkcheck
-	@echo
-	@echo "Link check complete; look for any errors in the above output " \
-	      "or in $(BUILDDIR)/linkcheck/output.txt."
-
-html:
-	$(SPHINXBUILD) -b html $(ALLSPHINXOPTS) $(BUILDDIR)/html
-	@echo
-	@echo "Build finished. The HTML pages are in $(BUILDDIR)/html."
-
-html-noexamples:
-	SPHINX_GALLERY_PLOT=False $(SPHINXBUILD) -b html $(ALLSPHINXOPTS) $(SOURCEDIR) $(BUILDDIR)/html
-	@echo
-	@echo "Build finished. The HTML pages are in $(BUILDDIR)/html."
-
-docs: html linkcheck
-
diff --git a/docs/index.rst b/docs/index.rst
deleted file mode 100644
index 3d8f03520..000000000
--- a/docs/index.rst
+++ /dev/null
@@ -1,57 +0,0 @@
-Home
-====
-
-.. toctree::
-   :hidden:
-   :maxdepth: 2
-
-   1_installation
-   2_package_overview
-   3_getting_started
-   4_minimal_example
-   examples/index
-   advanced_usage/index
-   5_api
-   6_references
-   7_glossary
-   8_faq
-   9_license
-
-
-SMAC is a tool for algorithm configuration to optimize the parameters of arbitrary algorithms, including hyperparameter 
-optimization of Machine Learning algorithms. The main core consists of Bayesian Optimization in combination with an 
-aggressive racing mechanism to efficiently decide which of two configurations performs better.
-
-SMAC3 is written in Python3 and continuously tested with Python 3.8, 3.9, and 3.10. Its Random
-Forest is written in C++. In the following, SMAC is representatively mentioned for SMAC3.
-
-If you use SMAC, please cite our `JMLR paper <https://jmlr.org/papers/v23/21-0888.html>`_:
-
-.. code-block:: text
-
-    @article{lindauer-jmlr22a,
-             author  = {Marius Lindauer and Katharina Eggensperger and Matthias Feurer and André Biedenkapp and Difan Deng and Carolin Benjamins and Tim Ruhkopf and René Sass and Frank Hutter},
-             title   = {SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization},
-             journal = {Journal of Machine Learning Research},
-             year    = {2022},
-             volume  = {23},
-             number  = {54},
-             pages   = {1--9},
-             url     = {http://jmlr.org/papers/v23/21-0888.html}
-    }
-
-For the original idea, we refer to:
-
-.. code-block:: text
-
-   Hutter, F. and Hoos, H. H. and Leyton-Brown, K.
-   Sequential Model-Based Optimization for General Algorithm Configuration
-   In: Proceedings of the conference on Learning and Intelligent OptimizatioN (LION 5)
-
-
-Contact
--------
-
-SMAC3 is developed by `<https://www.automl.org>`_.
-If you want to contribute or found an issue please visit our github page `<https://github.com/automl/SMAC3>`_.
-Our guidelines for contributing to this package can be found `here <https://github.com/automl/SMAC3/blob/main/CONTRIBUTING.md>`_.

From fdf25d4979ecd4f8c53d6edf1483ac89aaa37e09 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 18:08:23 +0200
Subject: [PATCH 48/64] refactor(acquisition): fix 'acqusition' typo

---
 smac/acquisition/function/__init__.py                         | 4 +++-
 smac/acquisition/function/abstract_acquisition_function.py    | 2 +-
 smac/acquisition/function/confidence_bound.py                 | 2 +-
 ...r_acqusition_function.py => prior_acquisition_function.py} | 0
 smac/facade/abstract_facade.py                                | 2 +-
 smac/main/config_selector.py                                  | 2 +-
 tests/test_acquisition/test_functions.py                      | 4 ++--
 7 files changed, 9 insertions(+), 7 deletions(-)
 rename smac/acquisition/function/{prior_acqusition_function.py => prior_acquisition_function.py} (100%)

diff --git a/smac/acquisition/function/__init__.py b/smac/acquisition/function/__init__.py
index 5ac88a721..9a0a4c52c 100644
--- a/smac/acquisition/function/__init__.py
+++ b/smac/acquisition/function/__init__.py
@@ -6,7 +6,9 @@
 from smac.acquisition.function.integrated_acquisition_function import (
     IntegratedAcquisitionFunction,
 )
-from smac.acquisition.function.prior_acqusition_function import PriorAcquisitionFunction
+from smac.acquisition.function.prior_acquisition_function import (
+    PriorAcquisitionFunction,
+)
 from smac.acquisition.function.probability_improvement import PI
 from smac.acquisition.function.thompson import TS
 
diff --git a/smac/acquisition/function/abstract_acquisition_function.py b/smac/acquisition/function/abstract_acquisition_function.py
index 519f5b3d0..c62170299 100644
--- a/smac/acquisition/function/abstract_acquisition_function.py
+++ b/smac/acquisition/function/abstract_acquisition_function.py
@@ -50,7 +50,7 @@ def update(self, model: AbstractModel, **kwargs: Any) -> None:
 
         This method will be called after fitting the model, but before maximizing the acquisition
         function. As an examples, EI uses it to update the current fmin. The default implementation only updates the
-        attributes of the acqusition function which are already present.
+        attributes of the acquisition function which are already present.
 
         Calls `_update` to update the acquisition function attributes.
 
diff --git a/smac/acquisition/function/confidence_bound.py b/smac/acquisition/function/confidence_bound.py
index a03a77eb5..e5a264b03 100644
--- a/smac/acquisition/function/confidence_bound.py
+++ b/smac/acquisition/function/confidence_bound.py
@@ -93,7 +93,7 @@ def _compute(self, X: np.ndarray) -> np.ndarray:
         assert self._model is not None
         if self._num_data is None:
             raise ValueError(
-                "No current number of data points specified. Call `update` to inform the acqusition function."
+                "No current number of data points specified. Call `update` to inform the acquisition function."
             )
 
         if len(X.shape) == 1:
diff --git a/smac/acquisition/function/prior_acqusition_function.py b/smac/acquisition/function/prior_acquisition_function.py
similarity index 100%
rename from smac/acquisition/function/prior_acqusition_function.py
rename to smac/acquisition/function/prior_acquisition_function.py
diff --git a/smac/facade/abstract_facade.py b/smac/facade/abstract_facade.py
index 9a2031099..fd153cdb8 100644
--- a/smac/facade/abstract_facade.py
+++ b/smac/facade/abstract_facade.py
@@ -14,7 +14,7 @@
 from smac.acquisition.function.abstract_acquisition_function import (
     AbstractAcquisitionFunction,
 )
-from smac.acquisition.maximizer.abstract_acqusition_maximizer import (
+from smac.acquisition.maximizer.abstract_acquisition_maximizer import (
     AbstractAcquisitionMaximizer,
 )
 from smac.callback.callback import Callback
diff --git a/smac/main/config_selector.py b/smac/main/config_selector.py
index 42a72f02b..9e3fd8f3c 100644
--- a/smac/main/config_selector.py
+++ b/smac/main/config_selector.py
@@ -10,7 +10,7 @@
 from smac.acquisition.function.abstract_acquisition_function import (
     AbstractAcquisitionFunction,
 )
-from smac.acquisition.maximizer.abstract_acqusition_maximizer import (
+from smac.acquisition.maximizer.abstract_acquisition_maximizer import (
     AbstractAcquisitionMaximizer,
 )
 from smac.callback.callback import Callback
diff --git a/tests/test_acquisition/test_functions.py b/tests/test_acquisition/test_functions.py
index 53c8b5f3a..57de28285 100644
--- a/tests/test_acquisition/test_functions.py
+++ b/tests/test_acquisition/test_functions.py
@@ -747,14 +747,14 @@ def test_ts_NxD(model, acq_ts):
 
 
 def test_ts_rng():
-    """Test TS acqusition function with model that only has attribute 'rng'"""
+    """Test TS acquisition function with model that only has attribute 'rng'"""
     model = MockModelRNG()
     ts = TS()
     ts.model = model
 
 
 def test_ts_sampler():
-    "Test TS acqusition function with model that only has attribute 'sample_functions'"
+    "Test TS acquisition function with model that only has attribute 'sample_functions'"
     model = MockModelSampler()
     ts = TS()
     ts.model = model

From a6b1f710ccff88720e65ab9a7048e1ca135d76f2 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 18:20:48 +0200
Subject: [PATCH 49/64] Update CHANGELOG.md

---
 CHANGELOG.md | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/CHANGELOG.md b/CHANGELOG.md
index 0c20493be..cb21d03bf 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -1,3 +1,9 @@
+# 2.3.0
+
+## Documentation
+- Update windows install guide (#952)
+- Migrate sphinx docs to mkdocs (#1154)
+
 # 2.2.1
 
 ## Improvements
@@ -688,7 +694,7 @@ Since many urgent features were already taken care of in 0.14.0, this release ma
   conditions when starting multiple runs on a cluster.
 * MAINT #209: adds the seed or a pseudo-seed to the output directory name for
   better identifiability of the output directories.
-* FIX #216: replace broken call to in EIPS acqusition function.
+* FIX #216: replace broken call to in EIPS acquisition function.
 * MAINT: use codecov.io instead of coveralls.io.
 * MAINT: increase minimal required version of the ConfigSpace package to 0.3.2.
 

From 5fc3d4a68a81ddbdd61f5fb98a8a83421c304808 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 18:21:16 +0200
Subject: [PATCH 50/64] docs(examples): flags to run when building   (does not
 work yet)

---
 examples/1_basics/1_quadratic_function.py                    | 3 ++-
 examples/1_basics/2_svm_cv.py                                | 1 +
 examples/1_basics/3_ask_and_tell.py                          | 1 +
 examples/1_basics/4_callback.py                              | 1 +
 examples/1_basics/5_continue.py                              | 1 +
 examples/1_basics/6_priors.py                                | 1 +
 examples/1_basics/8_warmstart.py                             | 1 +
 examples/2_multi_fidelity/1_mlp_epochs.py                    | 1 +
 examples/2_multi_fidelity/2_sgd_datasets.py                  | 1 +
 examples/2_multi_fidelity/3_specify_HB_via_total_budget.py   | 1 +
 examples/3_multi_objective/1_schaffer.py                     | 1 +
 examples/3_multi_objective/2_parego.py                       | 1 +
 examples/4_advanced_optimizer/4_intensify_crossvalidation.py | 1 +
 examples/5_commandline/1_call_target_function_script.py      | 1 +
 14 files changed, 15 insertions(+), 1 deletion(-)

diff --git a/examples/1_basics/1_quadratic_function.py b/examples/1_basics/1_quadratic_function.py
index 8a7744768..4c059dc5f 100644
--- a/examples/1_basics/1_quadratic_function.py
+++ b/examples/1_basics/1_quadratic_function.py
@@ -1,4 +1,5 @@
 """Quadratic Function
+# Flags: doc-Runnable
 
 An example of applying SMAC to optimize a quadratic function.
 
@@ -12,7 +13,7 @@
 from ConfigSpace import Configuration, ConfigurationSpace, Float
 from matplotlib import pyplot as plt
 
-from smac import HyperparameterOptimizationFacade as HPOFacade
+from smac.facade.hyperparameter_optimization_facade import HyperparameterOptimizationFacade as HPOFacade
 from smac import RunHistory, Scenario
 
 __copyright__ = "Copyright 2021, AutoML.org Freiburg-Hannover"
diff --git a/examples/1_basics/2_svm_cv.py b/examples/1_basics/2_svm_cv.py
index dae017055..65d676fe4 100644
--- a/examples/1_basics/2_svm_cv.py
+++ b/examples/1_basics/2_svm_cv.py
@@ -1,4 +1,5 @@
 """Support Vector Machine with Cross-Validation
+# Flags: doc-Runnable
 
 An example of optimizing a simple support vector machine on the IRIS dataset. We use the
 hyperparameter optimization facade, which uses a random forest as its surrogate model. It is able to
diff --git a/examples/1_basics/3_ask_and_tell.py b/examples/1_basics/3_ask_and_tell.py
index d99a6c31d..b66a7e19b 100644
--- a/examples/1_basics/3_ask_and_tell.py
+++ b/examples/1_basics/3_ask_and_tell.py
@@ -1,4 +1,5 @@
 """Ask-and-Tell
+# Flags: doc-Runnable
 
 This examples show how to use the Ask-and-Tell interface.
 """
diff --git a/examples/1_basics/4_callback.py b/examples/1_basics/4_callback.py
index 00f21f571..252baaabc 100644
--- a/examples/1_basics/4_callback.py
+++ b/examples/1_basics/4_callback.py
@@ -1,4 +1,5 @@
 """Custom Callback
+# Flags: doc-Runnable
 
 Using callbacks is the easieast way to integrate custom code inside the Bayesian optimization loop.
 In this example, we disable SMAC's default logging option and use the custom callback to log the evaluated trials.
diff --git a/examples/1_basics/5_continue.py b/examples/1_basics/5_continue.py
index 7b0b237a1..eff67b232 100644
--- a/examples/1_basics/5_continue.py
+++ b/examples/1_basics/5_continue.py
@@ -1,4 +1,5 @@
 """Continue an Optimization
+# Flags: doc-Runnable
 
 SMAC can also be continued from a previous run. To do so, it reads in old files (derived from scenario's name,
 output_directory and seed) and sets the corresponding components. In this example, an optimization of a simple quadratic
diff --git a/examples/1_basics/6_priors.py b/examples/1_basics/6_priors.py
index 9f9fb2b9e..4af42444b 100644
--- a/examples/1_basics/6_priors.py
+++ b/examples/1_basics/6_priors.py
@@ -1,4 +1,5 @@
 """User Priors over the Optimum
+# Flags: doc-Runnable
 
 Example for optimizing a Multi-Layer Perceptron (MLP) setting priors over the optimum on the
 hyperparameters. These priors are derived from user knowledge (from previous runs on similar
diff --git a/examples/1_basics/8_warmstart.py b/examples/1_basics/8_warmstart.py
index a21ded585..bc6f29b23 100644
--- a/examples/1_basics/8_warmstart.py
+++ b/examples/1_basics/8_warmstart.py
@@ -1,4 +1,5 @@
 """Warmstarting SMAC
+# Flags: doc-Runnable
 
 With the ask and tell interface, we can support warmstarting SMAC. We can communicate rich
 information about the previous trials to SMAC using `TrialInfo` and `TrialValue` instances.
diff --git a/examples/2_multi_fidelity/1_mlp_epochs.py b/examples/2_multi_fidelity/1_mlp_epochs.py
index e3b5034bd..3ea7f30f4 100644
--- a/examples/2_multi_fidelity/1_mlp_epochs.py
+++ b/examples/2_multi_fidelity/1_mlp_epochs.py
@@ -1,4 +1,5 @@
 """Multi-Layer Perceptron Using Multiple Epochs
+# Flags: doc-Runnable
 
 Example for optimizing a Multi-Layer Perceptron (MLP) using multiple budgets.
 Since we want to take advantage of multi-fidelity, the ``MultiFidelityFacade`` is a good choice. By default,
diff --git a/examples/2_multi_fidelity/2_sgd_datasets.py b/examples/2_multi_fidelity/2_sgd_datasets.py
index 964626d20..814f53f44 100644
--- a/examples/2_multi_fidelity/2_sgd_datasets.py
+++ b/examples/2_multi_fidelity/2_sgd_datasets.py
@@ -1,4 +1,5 @@
 """Stochastic Gradient Descent On Multiple Datasets
+# Flags: doc-Runnable
 
 Example for optimizing a Multi-Layer Perceptron (MLP) across multiple (dataset) instances.
 
diff --git a/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py b/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py
index 2569c5893..89cc81141 100644
--- a/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py
+++ b/examples/2_multi_fidelity/3_specify_HB_via_total_budget.py
@@ -1,4 +1,5 @@
 """Specify Number of Trials via a Total Budget in Hyperband
+# Flags: doc-Runnable
 
 This example uses a dummy function but illustrates how to setup Hyperband if you 
 want to specify a total optimization budget in terms of fidelity units.
diff --git a/examples/3_multi_objective/1_schaffer.py b/examples/3_multi_objective/1_schaffer.py
index 3a31b8b18..9adf5f683 100644
--- a/examples/3_multi_objective/1_schaffer.py
+++ b/examples/3_multi_objective/1_schaffer.py
@@ -1,4 +1,5 @@
 """2D Schaffer Function with Objective Weights
+# Flags: doc-Runnable
 
 A simple example on how to use multi-objective optimization is shown. The 2D Schaffer function is used. In the plot
 you can see that all points are on the Pareto front. However, since we set the objective weights, you can notice that
diff --git a/examples/3_multi_objective/2_parego.py b/examples/3_multi_objective/2_parego.py
index 9c7386b1d..8a407218b 100644
--- a/examples/3_multi_objective/2_parego.py
+++ b/examples/3_multi_objective/2_parego.py
@@ -1,4 +1,5 @@
 """ParEGO
+# Flags: doc-Runnable
 
 An example of how to use multi-objective optimization with ParEGO. Both accuracy and run-time are going to be
 optimized on the digits dataset using an MLP, and the configurations are shown in a plot, highlighting the best ones in 
diff --git a/examples/4_advanced_optimizer/4_intensify_crossvalidation.py b/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
index 513cdf43f..41267fc0c 100644
--- a/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
+++ b/examples/4_advanced_optimizer/4_intensify_crossvalidation.py
@@ -1,4 +1,5 @@
 """Speeding up Cross-Validation with Intensification
+# Flags: doc-Runnable
 
 An example of optimizing a simple support vector machine on the digits dataset. In contrast to the
 [simple example](../../1_basics/2_svm_cv), in which all cross-validation folds are executed
diff --git a/examples/5_commandline/1_call_target_function_script.py b/examples/5_commandline/1_call_target_function_script.py
index 33a21b6c0..1ec31e893 100644
--- a/examples/5_commandline/1_call_target_function_script.py
+++ b/examples/5_commandline/1_call_target_function_script.py
@@ -1,4 +1,5 @@
 """Call Target Function From Script
+# Flags: doc-Runnable
 
 This simple example shows how to call a script with the following content:
 

From 6e2f242e98c3362b0cc44ef581fd24f5e0f9a05d Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 18:21:24 +0200
Subject: [PATCH 51/64] Update Makefile

---
 Makefile | 26 ++++++++++++++++++++++----
 1 file changed, 22 insertions(+), 4 deletions(-)

diff --git a/Makefile b/Makefile
index 6e481dd06..aade064b3 100644
--- a/Makefile
+++ b/Makefile
@@ -86,9 +86,30 @@ format: format-black format-isort
 tests:
 	$(PYTEST) ${TESTS_DIR}
 
+# Launch the docs, executing code blocks and examples
+docs-full:
+	$(PYTHON) -m webbrowser -t "http://127.0.0.1:8000/"
+	SMAC_DOC_RENDER_EXAMPLES=all \
+		SMAC_DOCS_OFFLINE=true \
+		SMAC_EXEC_DOCS=true \
+		mkdocs serve --watch-theme
+
+# Launch the docs and execute code blocks
+docs-code:
+	$(PYTHON) -m webbrowser -t "http://127.0.0.1:8000/"
+	SMAC_DOCS_OFFLINE=true \
+		SMAC_EXEC_DOCS=true \
+		SMAC_DOC_RENDER_EXAMPLES=false \
+		mkdocs serve --watch-theme
+
+# Launch the docs but dont run code examples
 docs:
 	$(PYTHON) -m webbrowser -t "http://127.0.0.1:8000/"
-	$(PYTHON) -m mkdocs serve --clean --watch-theme
+	SMAC_DOCS_OFFLINE=true \
+		SMAC_EXEC_DOCS=false \
+		SMAC_DOC_RENDER_EXAMPLES=false \
+		mkdocs serve --watch-theme
+	# https://github.com/pawamoy/markdown-exec/issues/19
 
 # Build a distribution in ./dist
 build:
@@ -96,9 +117,6 @@ build:
 
 clean: clean-build clean-docs clean-data
 
-clean-docs:
-	$(MAKE) -C ${DOCDIR} clean
-
 clean-build:
 	$(PYTHON) setup.py clean
 	rm -rf ${DIST}

From ba7e2c34f6772c80e07789d3394c00c187f82d62 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 24 Oct 2024 18:22:23 +0200
Subject: [PATCH 52/64] Update CHANGELOG.md

---
 CHANGELOG.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/CHANGELOG.md b/CHANGELOG.md
index cb21d03bf..0cf8db879 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,7 +2,7 @@
 
 ## Documentation
 - Update windows install guide (#952)
-- Migrate sphinx docs to mkdocs (#1154)
+- Migrate sphinx docs to mkdocs (#1155)
 
 # 2.2.1
 

From 32d5111135f4ea43f00b5731cf76fe629726aa2e Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Tue, 29 Oct 2024 15:16:36 +0100
Subject: [PATCH 53/64] Fix links

---
 docs/2_package_overview.md | 18 +++++++++---------
 docs/7_glossary.md         |  2 +-
 2 files changed, 10 insertions(+), 10 deletions(-)

diff --git a/docs/2_package_overview.md b/docs/2_package_overview.md
index 28d3dd512..4562fab99 100644
--- a/docs/2_package_overview.md
+++ b/docs/2_package_overview.md
@@ -1,30 +1,30 @@
 # Package Overview
 
-SMAC supports you in determining well-performing hyperparameter configurations for your algorithms. By being a robust and flexible framework for [Bayesian Optimization](#), SMAC can improve performance within a few function evaluations. It offers several entry points and pre-sets for typical use cases, such as optimizing hyperparameters, solving low dimensional continuous (artificial) global optimization problems and configuring algorithms to perform well across multiple problem [instances](#).
+SMAC supports you in determining well-performing hyperparameter configurations for your algorithms. By being a robust and flexible framework for [Bayesian Optimization][BayesianOptimization], SMAC can improve performance within a few function evaluations. It offers several entry points and pre-sets for typical use cases, such as optimizing hyperparameters, solving low dimensional continuous (artificial) global optimization problems and configuring algorithms to perform well across multiple problem [instances][Instances].
 
 ## Features
 
 SMAC has the following characteristics and capabilities:
 
 #### Global Optimizer
-[Bayesian Optimization](#) is used for sample-efficient optimization.
+[Bayesian Optimization][BayesianOptimization] is used for sample-efficient optimization.
 
-#### Optimize [Black-Box](#) Functions
+#### Optimize [Black-Box][Black-Box] Functions
 Optimization is only aware of input and output. It is agnostic to internals of the function.
 
 #### Flexible Hyperparameters
 Use categorical, continuous, hierarchical and/or conditional hyperparameters with the well-integrated [ConfigurationSpace](https://automl.github.io/ConfigSpace). SMAC can optimize *up to 100 hyperparameters* efficiently.
 
 #### Any Objectives
-Optimization with any [objective](#) (e.g., accuracy, runtime, cross-validation, ...) is possible.
+Optimization with any [objective][Objective] (e.g., accuracy, runtime, cross-validation, ...) is possible.
 
-#### [Multi-Objective](#) Optimization
+#### [Multi-Objective][Multi-Objective] Optimization
 Optimize arbitrary number of objectives using scalarized multi-objective algorithms. Both ParEGO [[Know06][Know06]] and mean aggregation strategies are supported.
 
-#### [Multi-Fidelity](#) Optimization
-Judge configurations on multiple [budgets](#) to discard unsuitable configurations early on. This will result in a massive speed-up, depending on the budgets.
+#### [Multi-Fidelity][Multi-Fidelity] Optimization
+Judge configurations on multiple [budgets][Budget] to discard unsuitable configurations early on. This will result in a massive speed-up, depending on the budgets.
 
-#### [Instances](#)
+#### [Instances][Instances]
 Find well-performing hyperparameter configurations not only for one instance (e.g. dataset) of an algorithm, but for many.
 
 #### Command-Line Interface
@@ -37,7 +37,7 @@ SMAC can not only be executed within a python file but also from the command lin
 
 The following table provides an overview of SMAC's capabilities in comparison with other optimization tools.
 
-| Package      | Complex Hyperparameter Space | [Multi-Objective](#) | [Multi-Fidelity](#) | [Instances](#) | Command-Line Interface | Parallelism |
+| Package      | Complex Hyperparameter Space | [Multi-Objective][Multi-Objective] | [Multi-Fidelity][Multi-Fidelity] | [Instances][Instances] | Command-Line Interface | Parallelism |
 |--------------|------------------------------|----------------------|---------------------|----------------|------------------------|-------------|
 | HyperMapper  | ✅                            | ✅                    | ❌                   | ❌              | ❌                      | ❌           |
 | Optuna       | ✅                            | ✅                    | ✅                   | ❌              | ✅                      | ✅           |
diff --git a/docs/7_glossary.md b/docs/7_glossary.md
index 721428ae1..c305a5596 100644
--- a/docs/7_glossary.md
+++ b/docs/7_glossary.md
@@ -14,7 +14,7 @@
 - [](){#ROAR}**ROAR**: See `Random Online Adaptive Racing`.
 - [](){#SMAC}**SMAC**: Sequential Model-Based Algorithm Configuration.
 - [](){#SMBO}**SMBO**: Sequential Mode-Based Optimization.
-- [](){#Bayesian Optimization}**Bayesian Optimization**: Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions. A Bayesian optimization weights exploration and exploitation to find the minimum of its objective.
+- [](){#BayesianOptimization}**Bayesian Optimization**: Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions. A Bayesian optimization weights exploration and exploitation to find the minimum of its objective.
 - [](){#Black-Box}**Black-Box**: Refers to an algorithm being optimized, where only input and output are observable.
 - [](){#Budget}**Budget**: Budget is another word for fidelity. Examples are the number of training epochs or the size of the data subset the algorithm is trained on. However, budget can also be used in the context of instances. For example, if you have 100 instances (let's say we optimize across datasets) and you want to run your algorithm on 10 of them, then the budget is 10.
 - [](){#Hyperband}**Hyperband**: [Hyperband](https://arxiv.org/abs/1603.06560). A novel bandit-based algorithm for hyperparameter optimization. Hyperband is an extension of successive halving and therefore works with multi-fidelities.

From a710875c66b0246392300092cb5ff9a5bc95d234 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Tue, 29 Oct 2024 15:17:33 +0100
Subject: [PATCH 54/64] Update github workflow for doc building

---
 .github/workflows/docs.yml | 92 +++++++-------------------------------
 1 file changed, 15 insertions(+), 77 deletions(-)

diff --git a/.github/workflows/docs.yml b/.github/workflows/docs.yml
index 5f03e854f..d109fc6bc 100644
--- a/.github/workflows/docs.yml
+++ b/.github/workflows/docs.yml
@@ -1,89 +1,27 @@
+# This workflow is just to test that the docs build successfully.
 name: docs
-
+concurrency:
+  group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
+  cancel-in-progress: true
 on:
-  # Manual trigger option in GitHub
-  # This won't push to GitHub pages where docs are hosted due
-  # to the guarded if statement in those steps
   workflow_dispatch:
-
-  # Trigger on push to these branches
   push:
     branches:
       - main
-      - development
-
-  # Trigger on open/push to a PR targeting one of these branches
   pull_request:
-    types:
-      - opened
-      - synchronize
-      - reopened
-      - ready_for_review
     branches:
       - main
-      - development
-
-env:
-  name: SMAC3
-
+permissions:
+  contents: write
 jobs:
-  build-and-deploy:
-    if: ${{ !github.event.pull_request.draft }}
+  build:
     runs-on: ubuntu-latest
     steps:
-    - name: Checkout
-      uses: actions/checkout@v4
-
-    - name: Setup Python
-      uses: actions/setup-python@v5
-      with:
-        python-version: "3.10"
-
-    - name: Install dependencies
-      id: install
-      run: |
-        pip install ".[gpytorch,dev]"
-
-        # Getting the version
-        SMAC_VERSION=$(python -c "import smac; print('v' + str(smac.version));")
-
-        # Make it a global variable
-        echo "SMAC_VERSION=$SMAC_VERSION" >> $GITHUB_ENV
-
-    - name: Make docs
-      run: |
-        make clean
-        make docs
-
-    - name: Pull latest gh-pages
-      if: (contains(github.ref, 'develop') || contains(github.ref, 'main')) && github.event_name == 'push'
-      run: |
-        cd ..
-        git clone https://github.com/${{ github.repository }}.git --branch gh-pages --single-branch gh-pages
-
-    - name: Copy new docs into gh-pages
-      if: (contains(github.ref, 'develop') || contains(github.ref, 'main')) && github.event_name == 'push'
-      run: |
-        branch_name=${GITHUB_REF##*/}
-        cd ../gh-pages
-        rm -rf $branch_name
-        cp -r ../${{ env.name }}/docs/build/html $branch_name
-
-        # we also copy the current SMAC_VERSION
-        rm -rf $SMAC_VERSION
-        cp -r ../${{ env.name }}/docs/build/html $SMAC_VERSION
-
-
-    - name: Push to gh-pages
-      if: (contains(github.ref, 'develop') || contains(github.ref, 'main')) && github.event_name == 'push'
-      run: |
-        last_commit=$(git log --pretty=format:"%an: %s")
-        cd ../gh-pages
-        branch_name=${GITHUB_REF##*/}
-        git add $branch_name/
-        git add $SMAC_VERSION/
-        git config --global user.name 'Github Actions'
-        git config --global user.email 'not@mail.com'
-        git remote set-url origin https://x-access-token:${{ secrets.GITHUB_TOKEN }}@github.com/${{ github.repository }}
-        git commit -am "$last_commit"
-        git push
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v5
+        with:
+          python-version: "3.10"
+      - name: "Install dependancies"
+        run: python -m pip install -e ".[dev]"
+      - name: "Build Docs"
+        run: mkdocs build --clean --strict
\ No newline at end of file

From 55e91eb5e0c145c92839cb4d740306a8243f957b Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 10:55:12 +0100
Subject: [PATCH 55/64] docs(3_getting_started): fix links

---
 docs/3_getting_started.md | 16 ++++++++--------
 1 file changed, 8 insertions(+), 8 deletions(-)

diff --git a/docs/3_getting_started.md b/docs/3_getting_started.md
index 108af0545..2394d3912 100644
--- a/docs/3_getting_started.md
+++ b/docs/3_getting_started.md
@@ -93,22 +93,22 @@ which is easy to use and understand and without the need of deep diving into the
 invited to change the components to their specific hyperparameter optimization needs. The following
 table (horizontally scrollable) shows you what is supported and reveals the default [components][components]:
 
-| | [Black-Box](smac.facade.blackbox_facade) | [Hyperparameter Optimization](smac.facade.hyperparameter_optimization_facade) | [Multi-Fidelity](smac.facade.multi_fidelity_facade) | [Algorithm Configuration](smac.facade.algorithm_configuration_facade) | [Random](smac.facade.random_facade) | [Hyperband](smac.facade.hyperband_facade) |
+| | [Black-Box][smac.facade.blackbox_facade] | [Hyperparameter Optimization][smac.facade.hyperparameter_optimization_facade] | [Multi-Fidelity][smac.facade.multi_fidelity_facade] | [Algorithm Configuration][smac.facade.algorithm_configuration_facade] | [Random][smac.facade.random_facade] | [Hyperband][smac.facade.hyperband_facade] |
 | --- | --- | --- | --- | --- | --- | --- |
 | #Parameters | low | low/medium/high | low/medium/high | low/medium/high | low/medium/high | low/medium/high |
 | Supports Instances | ❌ | ✅ | ✅ | ✅ | ❌ | ✅ |
 | Supports Multi-Fidelity | ❌ | ❌ | ✅ | ✅ | ❌ | ✅ |
-| Initial Design | [Sobol](smac.initial_design.sobol_design) | [Sobol](smac.initial_design.sobol_design) | [Random](smac.initial_design.random_design) | [Default](smac.initial_design.default_design) | [Default](smac.initial_design.default_design) | [Default](smac.initial_design.default_design) |
-| Surrogate Model | [Gaussian Process](smac.model.gaussian_process.gaussian_process) | [Random Forest](smac.model.random_forest.random_forest) | [Random Forest](smac.model.random_forest.random_forest) | [Random Forest](smac.model.random_forest.random_forest) | Not used | Not used |
-| Acquisition Function | [Expected Improvement](smac.acquisition.function.expected_improvement) | [Log Expected Improvement](smac.acquisition.function.expected_improvement) | [Log Expected Improvement](smac.acquisition.function.expected_improvement) | [Expected Improvement](smac.acquisition.function.expected_improvement) | Not used | Not used |
-| Acquisition Maximizer | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.local_and_random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.random_search) | [Local and Sorted Random Search](smac.acquisition.maximizer.random_search) |
-| Intensifier | [Default](smac.intensifier.intensifier) | [Default](smac.intensifier.intensifier) | [Hyperband](smac.intensifier.hyperband) | [Hyperband](smac.intensifier.hyperband) | [Default](smac.intensifier.intensifier) | [Hyperband](smac.intensifier.hyperband) |
-| Runhistory Encoder | [Default](smac.runhistory.encoder.encoder) | [Log](smac.runhistory.encoder.log_encoder) | [Log](smac.runhistory.encoder.log_encoder) | [Default](smac.runhistory.encoder.encoder) | [Default](smac.runhistory.encoder.encoder) | [Default](smac.runhistory.encoder.encoder) |
+| Initial Design | [Sobol][smac.initial_design.sobol_design] | [Sobol][smac.initial_design.sobol_design] | [Random][smac.initial_design.random_design] | [Default][smac.initial_design.default_design] | [Default][smac.initial_design.default_design] | [Default][smac.initial_design.default_design] |
+| Surrogate Model | [Gaussian Process][smac.model.gaussian_process.gaussian_process] | [Random Forest][smac.model.random_forest.random_forest] | [Random Forest][smac.model.random_forest.random_forest] | [Random Forest][smac.model.random_forest.random_forest] | Not used | Not used |
+| Acquisition Function | [Expected Improvement][smac.acquisition.function.expected_improvement] | [Log Expected Improvement][smac.acquisition.function.expected_improvement] | [Log Expected Improvement][smac.acquisition.function.expected_improvement] | [Expected Improvement][smac.acquisition.function.expected_improvement] | Not used | Not used |
+| Acquisition Maximizer | [Local and Sorted Random Search][smac.acquisition.maximizer.local_and_random_search] | [Local and Sorted Random Search][smac.acquisition.maximizer.local_and_random_search] | [Local and Sorted Random Search][smac.acquisition.maximizer.local_and_random_search] | [Local and Sorted Random Search][smac.acquisition.maximizer.local_and_random_search] | [Local and Sorted Random Search][smac.acquisition.maximizer.random_search] | [Local and Sorted Random Search][smac.acquisition.maximizer.random_search] |
+| Intensifier | [Default][smac.intensifier.intensifier] | [Default][smac.intensifier.intensifier] | [Hyperband][smac.intensifier.hyperband] | [Default][smac.intensifier.intensifier] | [Default][smac.intensifier.intensifier] | [Hyperband][smac.intensifier.hyperband] |
+| Runhistory Encoder | [Default][smac.runhistory.encoder.encoder] | [Log][smac.runhistory.encoder.log_encoder] | [Log][smac.runhistory.encoder.log_encoder] | [Default][smac.runhistory.encoder.encoder] | [Default][smac.runhistory.encoder.encoder] | [Default][smac.runhistory.encoder.encoder] |
 | Random Design Probability | 8.5% | 20% | 20% | 50% | Not used | Not used |
 
 
 !!! info
-    The multi-fidelity facade is the closest implementation to `BOHB <https://github.com/automl/HpBandSter>`_.
+    The multi-fidelity facade is the closest implementation to [BOHB](https://github.com/automl/HpBandSter).
 
 
 !!! note

From 0c6e0a301fa2a9192240afeb06c95422c794f813 Mon Sep 17 00:00:00 2001
From: Lukas Fehring <lukasfehring@gmail.com>
Date: Thu, 21 Nov 2024 12:02:42 +0100
Subject: [PATCH 56/64] Minor modifications

---
 docs/1_installation.md     | 26 +++++++++++++++-----------
 docs/2_package_overview.md |  2 +-
 docs/3_getting_started.md  | 10 +++++-----
 3 files changed, 21 insertions(+), 17 deletions(-)

diff --git a/docs/1_installation.md b/docs/1_installation.md
index 056f0f5aa..d886d4b49 100644
--- a/docs/1_installation.md
+++ b/docs/1_installation.md
@@ -3,19 +3,14 @@
 ## Requirements
 
 SMAC is written in python3 and therefore requires an environment with python>=3.8.
-Furthermore, the Random Forest used in SMAC requires SWIG as a build dependency. Install it either in your
-environment or on your system directly. The command to install swig on linux machines is the following:
-
-```bash
-apt-get install swig
-```
+Furthermore, the Random Forest used in SMAC requires SWIG as a build dependency.
 
 !!! info 
 
     SMAC is tested on Linux and Mac machines with python >=3.8.
 
 
-## Anaconda
+## SetUp
 
 We recommend using Anaconda to create and activate an environment:
 
@@ -24,13 +19,22 @@ conda create -n SMAC python=3.10
 conda activate SMAC
 ```
 
-If you haven't installed swig yet, you can install it directly inside the Anaconda environment:
+Now install swig either on the system level e.g. using the following command for Linux:
+```bash
+apt-get install swig
+```
+
+Or install swig inside of an already created conda environment using:
 
 ```bash
 conda install gxx_linux-64 gcc_linux-64 swig
 ```
 
-Now install SMAC via PyPI:
+## Install SMAC
+You can install SMAC either using PyPI or Conda-forge.
+
+### PYPI
+To install SMAC with PyPI call:
 
 ```bash
 pip install smac
@@ -40,10 +44,10 @@ Or alternatively, clone the environment from GitHub directly:
 
 ```bash
 git clone https://github.com/automl/SMAC3.git && cd SMAC3
-pip install -e .[dev]
+pip install -e ".[dev]"
 ```
 
-## Conda-forge
+### Conda-forge
 
 Installing SMAC from the `conda-forge` channel can be achieved by adding `conda-forge` to your channels with:
 
diff --git a/docs/2_package_overview.md b/docs/2_package_overview.md
index 4562fab99..dcd7c0060 100644
--- a/docs/2_package_overview.md
+++ b/docs/2_package_overview.md
@@ -15,7 +15,7 @@ Optimization is only aware of input and output. It is agnostic to internals of t
 #### Flexible Hyperparameters
 Use categorical, continuous, hierarchical and/or conditional hyperparameters with the well-integrated [ConfigurationSpace](https://automl.github.io/ConfigSpace). SMAC can optimize *up to 100 hyperparameters* efficiently.
 
-#### Any Objectives
+#### Any [Objectives][Objective]
 Optimization with any [objective][Objective] (e.g., accuracy, runtime, cross-validation, ...) is possible.
 
 #### [Multi-Objective][Multi-Objective] Optimization
diff --git a/docs/3_getting_started.md b/docs/3_getting_started.md
index 2394d3912..3237db587 100644
--- a/docs/3_getting_started.md
+++ b/docs/3_getting_started.md
@@ -27,7 +27,7 @@ cs = ConfigurationSpace({
 })
 ```
 
-Please see the documentation of `ConfigSpace <https://automl.github.io/ConfigSpace/latest/>`_ for more details.
+Please see the documentation of [ConfigurationSpace](https://automl.github.io/ConfigSpace) for more details.
 
 
 ## Target Function
@@ -48,10 +48,6 @@ in an efficient way.
         return 1 - accuracy  # SMAC always minimizes (the smaller the better)
 ```
 
-!!! warning
-    SMAC *always* minimizes the value returned from the target function.
-
-
 !!! note
     In general, the arguments of the target function depend on the intensifier. However,
     in all cases, the first argument must be the configuration (arbitrary argument name is possible here) and a seed.
@@ -61,6 +57,10 @@ in an efficient way.
     used.
 
 
+!!! warning
+    SMAC *always* minimizes the value returned from the target function.
+
+
 !!! warning
     SMAC passes either `instance` or `budget` to the target function but never both.
 

From f85b4d4eb6e7ac757877b63e1367316cab9bacda Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:06:13 +0100
Subject: [PATCH 57/64] docs(constants): add docstring

---
 smac/constants.py | 1 +
 1 file changed, 1 insertion(+)

diff --git a/smac/constants.py b/smac/constants.py
index 7db48ebe8..590f2d116 100644
--- a/smac/constants.py
+++ b/smac/constants.py
@@ -1,3 +1,4 @@
+"""Constants used in SMAC, e.g. maximum number of cutoffs, very small number, etc."""
 __copyright__ = "Copyright 2022, automl.org"
 __license__ = "3-clause BSD"
 

From ddb3fd145310f1839364eb20f576dd407b71f294 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:08:40 +0100
Subject: [PATCH 58/64] docs(multi_objective): fix broken link

---
 docs/advanced_usage/3_multi_objective.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/advanced_usage/3_multi_objective.md b/docs/advanced_usage/3_multi_objective.md
index 0771412b7..eea381962 100644
--- a/docs/advanced_usage/3_multi_objective.md
+++ b/docs/advanced_usage/3_multi_objective.md
@@ -37,4 +37,4 @@ use the method ``get_incumbents`` in the intensifier.
   incumbents = smac.intensifier.get_incumbents()
 ```
 
-We show an example of how to use multi-objective with plots in our [examples](../../examples/3%20Multi-Objective/1_schaffer).
+We show an example of how to use multi-objective with plots in our [examples](../examples/3%20Multi-Objective/1_schaffer.html).

From 9915c2ac5bcf830391fadd6f6a80ae1ccd766895 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:10:18 +0100
Subject: [PATCH 59/64] docs(ask_and_tell): fix broken link

---
 docs/advanced_usage/5_ask_and_tell.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/advanced_usage/5_ask_and_tell.md b/docs/advanced_usage/5_ask_and_tell.md
index dfd135b04..498ecacef 100644
--- a/docs/advanced_usage/5_ask_and_tell.md
+++ b/docs/advanced_usage/5_ask_and_tell.md
@@ -16,4 +16,4 @@ and report the results of the trial.
     into the intensification process.
 
 
-Please have a look at our [ask-and-tell example](../../examples/1%20Basics/3_ask_and_tell).
+Please have a look at our [ask-and-tell example](../examples/1%20Basics/3_ask_and_tell.html).

From 5ccd816d9169d9a46873077a097c4e6bb8a9082d Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:11:18 +0100
Subject: [PATCH 60/64] docs(parallelism.md): fix broken link

---
 docs/advanced_usage/9_parallelism.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/advanced_usage/9_parallelism.md b/docs/advanced_usage/9_parallelism.md
index 8d54f3aa3..1c3633c92 100644
--- a/docs/advanced_usage/9_parallelism.md
+++ b/docs/advanced_usage/9_parallelism.md
@@ -23,7 +23,7 @@ SMAC supports multiple workers natively via Dask. Just specify ``n_workers`` in
 ## Running on a Cluster
 
 You can also pass a custom dask client, e.g. to run on a slurm cluster.
-See our [parallelism example](../../examples/1%20Basics/7_parallelization_cluster).
+See our [parallelism example](../examples/1%20Basics/7_parallelization_cluster.html).
 
 !!! warning
 

From 3a1bf19d0667f63d336a7c7a2e3e36396c318df3 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:12:02 +0100
Subject: [PATCH 61/64] docs(continue): fix broken link

---
 docs/advanced_usage/10_continue.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/advanced_usage/10_continue.md b/docs/advanced_usage/10_continue.md
index f16848995..b58b91c8f 100644
--- a/docs/advanced_usage/10_continue.md
+++ b/docs/advanced_usage/10_continue.md
@@ -19,4 +19,4 @@ The behavior can be controlled by setting the parameter ``overwrite`` in the fac
     and the old run is not affected.
 
 
-Please have a look at our [continue example](../../examples/1%20Basics/5_continue).
\ No newline at end of file
+Please have a look at our [continue example](../examples/1%20Basics/5_continue.html).
\ No newline at end of file

From 418a282f7613b1eb5e639f1ddcaf9fcb5504cf22 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:15:06 +0100
Subject: [PATCH 62/64] delete .rst

---
 docs/advanced_usage/index.rst | 29 -----------------------------
 1 file changed, 29 deletions(-)
 delete mode 100644 docs/advanced_usage/index.rst

diff --git a/docs/advanced_usage/index.rst b/docs/advanced_usage/index.rst
deleted file mode 100644
index 87feb3b8a..000000000
--- a/docs/advanced_usage/index.rst
+++ /dev/null
@@ -1,29 +0,0 @@
-Advanced Usage
-==============
-
-In this chapter, we will discuss some more advanced usage of the library. If you want to customize 
-SMAC to your needs, we strongly recommend reading all pages. Since SMAC holds a lot of complex dependencies,
-we can not guarantee that your customization will work. However, we can give you some hints on how SMAC
-reacts to certain things.
-
-
-Navigation
-----------
-
-.. toctree::
-    :maxdepth: 2
-
-    1_components
-    2_multi_fidelity
-    3_multi_objective
-    4_instances
-    5_ask_and_tell
-    5.1_warmstarting
-    6_commandline
-    7_stopping_criteria
-    8_logging
-    9_parallelism
-    10_continue
-    11_reproducibility
-    12_optimizations
-

From acc6adc264e3d8cc7f80674aaad6fec3762da6eb Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 12:16:50 +0100
Subject: [PATCH 63/64] Fix sidebar navigatoon

---
 docs/advanced_usage/1_components.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/docs/advanced_usage/1_components.md b/docs/advanced_usage/1_components.md
index 25aec5f25..6b6c5aa67 100644
--- a/docs/advanced_usage/1_components.md
+++ b/docs/advanced_usage/1_components.md
@@ -1,5 +1,5 @@
-[](){#components}
 # Components
+[](){#components}
 
 In addition to the basic components mentioned in [Getting Started][getting_started], all other components are
 explained in the following paragraphs to give a better picture of SMAC. These components are all used to guide

From bf92c001d14e68c1cac050bb94fb349a73e716f4 Mon Sep 17 00:00:00 2001
From: benjamc <c.benjamins@instadeep.com>
Date: Thu, 21 Nov 2024 13:47:25 +0100
Subject: [PATCH 64/64] update CHANGELOG.md

---
 CHANGELOG.md | 1 +
 1 file changed, 1 insertion(+)

diff --git a/CHANGELOG.md b/CHANGELOG.md
index 0cf8db879..db009bf97 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,6 +2,7 @@
 
 ## Documentation
 - Update windows install guide (#952)
+- Correct intensifier for Algorithm Configuration Facade (#1162, #1165)
 - Migrate sphinx docs to mkdocs (#1155)
 
 # 2.2.1