From c8de1c72c2137ac9190e5163098b26ca41e697b3 Mon Sep 17 00:00:00 2001
From: Mark Stephenson <mark2000stephenson@gmail.com>
Date: Fri, 27 Sep 2024 02:19:54 -0600
Subject: [PATCH] Add BSK-RL environment; fix indents on bulleted lists (#1178)

---
 docs/environments/third_party_environments.md | 224 +++++++++---------
 1 file changed, 116 insertions(+), 108 deletions(-)

diff --git a/docs/environments/third_party_environments.md b/docs/environments/third_party_environments.md
index 68ea42184..0db8ebe6d 100644
--- a/docs/environments/third_party_environments.md
+++ b/docs/environments/third_party_environments.md
@@ -30,10 +30,10 @@ goal-RL ([Gymnasium-Robotics](https://robotics.farama.org/)).
 <!-- Template
 - [NAME: SUB_NAME_IF_PRESENT](LINK)
 
-![Gymnasium version dependency](ADD YOUR BADGE HERE)
-![GitHub stars](ADD YOUR BADGE HERE OPTIONAL)
+  ![Gymnasium version dependency](ADD YOUR BADGE HERE)
+  ![GitHub stars](ADD YOUR BADGE HERE OPTIONAL)
 
-A short 2 sentence description.
+  A short 2 sentence description.
 -->
 
 
@@ -42,28 +42,28 @@ A short 2 sentence description.
 
 - [gym-electric-motor: Gym environments for electric motor simulations](https://github.com/upb-lea/gym-electric-motor)
 
-An environment for simulating a wide variety of electric drives taking into account different types of electric motors and converters.
+  An environment for simulating a wide variety of electric drives taking into account different types of electric motors and converters.
 
 - [racecar_gym: Miniature racecar env using PyBullet](https://github.com/axelbr/racecar_gym/)
 
-A gym environment for a miniature racecar using the [PyBullet](https://github.com/bulletphysics/bullet3) physics engine.
+  A gym environment for a miniature racecar using the [PyBullet](https://github.com/bulletphysics/bullet3) physics engine.
 
 - [sumo-rl: Reinforcement Learning using SUMO traffic simulator](https://github.com/LucasAlegre/sumo-rl)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.3-blue)
-![GitHub stars](https://img.shields.io/github/stars/LucasAlegre/sumo-rl)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.3-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/LucasAlegre/sumo-rl)
 
-Gymnasium wrapper for various environments in the SUMO traffic simulator. Supports both single and multiagent settings (using [pettingzoo](https://pettingzoo.farama.org/)).
+  Gymnasium wrapper for various environments in the SUMO traffic simulator. Supports both single and multiagent settings (using [pettingzoo](https://pettingzoo.farama.org/)).
 
 ### Biological / Medical environments
 *Interacting with Biological Systems.*
 
 - [ICU-Sepsis: A Benchmark MDP Built from Real Medical Data](https://github.com/icu-sepsis/icu-sepsis)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/icu-sepsis/icu-sepsis)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/icu-sepsis/icu-sepsis)
 
-ICU-Sepsis is a tabular reinforcement learning environment that simulates the treatment of sepsis in an intensive care unit (ICU). Introduced in the paper [ICU-Sepsis: A Benchmark MDP Built from Real Medical Data](https://arxiv.org/abs/2406.05646), the environment is lightweight and easy to use, yet challenging for most reinforcement learning algorithms.
+  ICU-Sepsis is a tabular reinforcement learning environment that simulates the treatment of sepsis in an intensive care unit (ICU). Introduced in the paper [ICU-Sepsis: A Benchmark MDP Built from Real Medical Data](https://arxiv.org/abs/2406.05646), the environment is lightweight and easy to use, yet challenging for most reinforcement learning algorithms.
 
 
 ### Economic / Financial environments
@@ -71,80 +71,80 @@ ICU-Sepsis is a tabular reinforcement learning environment that simulates the tr
 
 - [gym-anytrading: Financial trading environments for FOREX and STOCKS](https://github.com/AminHP/gym-anytrading)
 
-AnyTrading is a collection of Gym environments for reinforcement learning-based trading algorithms with a great focus on simplicity, flexibility, and comprehensiveness.
+  AnyTrading is a collection of Gym environments for reinforcement learning-based trading algorithms with a great focus on simplicity, flexibility, and comprehensiveness.
 
 - [gym-mtsim: Financial trading for MetaTrader 5 platform](https://github.com/AminHP/gym-mtsim)
 
-MtSim is a simulator for the [MetaTrader 5](https://www.metatrader5.com/) trading platform for reinforcement learning-based trading algorithms.
+  MtSim is a simulator for the [MetaTrader 5](https://www.metatrader5.com/) trading platform for reinforcement learning-based trading algorithms.
 
 - [gym-trading-env: Trading Environment](https://gym-trading-env.readthedocs.io/)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![Github stars](https://img.shields.io/github/stars/ClementPerroud/Gym-Trading-Env)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![Github stars](https://img.shields.io/github/stars/ClementPerroud/Gym-Trading-Env)
 
-Gym Trading Env simulates stock (or crypto) market from historical data. It was designed to be fast and easily customizable.
+  Gym Trading Env simulates stock (or crypto) market from historical data. It was designed to be fast and easily customizable.
 
 ### Electrical / Energy environments
 *Manage the flow of Electrons.*
 
 - [EV2Gym: A Realistic EV-V2G-Gym Simulator for EV Smart Charging](https://github.com/StavrosOrf/EV2Gym)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/StavrosOrf/EV2Gym)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/StavrosOrf/EV2Gym)
 
-EV2Gym is a fully customizable and easily configurable environment for Electric Vehicle (EV) smart charging simulations on a small and large scale. Also, includes non-RL baseline implementations such as mathematical programming, model predictive control, and heuristics.
+  EV2Gym is a fully customizable and easily configurable environment for Electric Vehicle (EV) smart charging simulations on a small and large scale. Also, includes non-RL baseline implementations such as mathematical programming, model predictive control, and heuristics.
 
 ### Game environments
 *Board Games, Video Games and all other interactive entrainment mediums.*
 
 - [Craftium: an extensible framework for creating RL environments](https://github.com/mikelma/craftium)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/mikelma/craftium)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/mikelma/craftium)
 
-Craftium wraps the [Minetest](https://www.minetest.net/) game engine into the Gymnasium API, providing a modern and easy-to-use platform for designing Minecraft-like RL environments.
+  Craftium wraps the [Minetest](https://www.minetest.net/) game engine into the Gymnasium API, providing a modern and easy-to-use platform for designing Minecraft-like RL environments.
 
 - [flappy-bird-env](https://github.com/robertoschiavone/flappy-bird-env)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/robertoschiavone/flappy-bird-env)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/robertoschiavone/flappy-bird-env)
 
-Flappy Bird as a Farama Gymnasium environment.
+  Flappy Bird as a Farama Gymnasium environment.
 
 - [flappy-bird-gymnasium: A Flappy Bird environment for Gymnasium](https://github.com/markub3327/flappy-bird-gymnasium)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/markub3327/flappy-bird-gymnasium)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/markub3327/flappy-bird-gymnasium)
 
-A simple environment for single-agent reinforcement learning algorithms on a clone of [Flappy Bird](https://en.wikipedia.org/wiki/Flappy_Bird), the hugely popular arcade-style mobile game. Both state and pixel observation environments are available.
+  A simple environment for single-agent reinforcement learning algorithms on a clone of [Flappy Bird](https://en.wikipedia.org/wiki/Flappy_Bird), the hugely popular arcade-style mobile game. Both state and pixel observation environments are available.
 
 - [pystk2-gymnasium: SuperTuxKart races gymnasium wrapper](https://github.com/bpiwowar/pystk2-gymnasium)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/bpiwowar/pystk2-gymnasium)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/bpiwowar/pystk2-gymnasium)
 
-Uses a [python wrapper](https://github.com/bpiwowar/pystk2) around [SuperTuxKart](https://supertuxkart.net/fr/Main_Page) that allows to access the world state and to control a race.
+  Uses a [python wrapper](https://github.com/bpiwowar/pystk2) around [SuperTuxKart](https://supertuxkart.net/fr/Main_Page) that allows to access the world state and to control a race.
 
 - [QWOP: An environment for Bennet Foddy's game QWOP](https://github.com/smanolloff/qwop-gym)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/smanolloff/qwop-gym)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/smanolloff/qwop-gym)
 
-QWOP is a game about running extremely fast down a 100 meter track. With this Gymnasium environment you can train your own agents and try to beat the current world record (5.0 in-game seconds for humans and 4.7 for AI).
+  QWOP is a game about running extremely fast down a 100 meter track. With this Gymnasium environment you can train your own agents and try to beat the current world record (5.0 in-game seconds for humans and 4.7 for AI).
 
 - [Tetris Gymnasium: A fully configurable Gymnasium compatible Tetris environment](https://github.com/Max-We/Tetris-Gymnasium)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/Max-We/Tetris-Gymnasium)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/Max-We/Tetris-Gymnasium)
 
-Tetris Gymnasium is a clean implementation of Tetris as a Gymnasium environment. It can be extensively customized (e.g. board dimensions, gravity, ...), is well documented and includes many examples on how to use it e.g. by providing training scripts.
+  Tetris Gymnasium is a clean implementation of Tetris as a Gymnasium environment. It can be extensively customized (e.g. board dimensions, gravity, ...), is well documented and includes many examples on how to use it e.g. by providing training scripts.
 
 - [tmrl: TrackMania 2020 through RL](https://github.com/trackmania-rl/tmrl/)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/trackmania-rl/tmrl)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/trackmania-rl/tmrl)
 
-tmrl is a distributed framework for training Deep Reinforcement Learning AIs in real-time applications. It is demonstrated on the TrackMania 2020 video game.
+  tmrl is a distributed framework for training Deep Reinforcement Learning AIs in real-time applications. It is demonstrated on the TrackMania 2020 video game.
 
 
 ### Mathematics / Computational
@@ -152,130 +152,138 @@ tmrl is a distributed framework for training Deep Reinforcement Learning AIs in
 
 - [spark-sched-sim: Environments for scheduling DAG jobs in Apache Spark](https://github.com/ArchieGertsman/spark-sched-sim)
 
-spark-sched-sim simulates Spark clusters for RL-based job scheduling algorithms. Spark jobs are encoded as directed acyclic graphs (DAGs), providing opportunities to experiment with graph neural networks (GNN's) in the RL context.
+  spark-sched-sim simulates Spark clusters for RL-based job scheduling algorithms. Spark jobs are encoded as directed acyclic graphs (DAGs), providing opportunities to experiment with graph neural networks (GNN's) in the RL context.
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/ArchieGertsman/spark-sched-sim)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/ArchieGertsman/spark-sched-sim)
 
 - [gym-saturation: Environments used to prove theorems](https://github.com/inpefess/gym-saturation)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/inpefess/gym-saturation)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/inpefess/gym-saturation)
 
-An environment for guiding automated theorem provers based on saturation algorithms (e.g. [Vampire](https://github.com/vprover/vampire)).
+  An environment for guiding automated theorem provers based on saturation algorithms (e.g. [Vampire](https://github.com/vprover/vampire)).
 
 
 ### Robotics environments
 *Autonomous Robots.*
+
+- [BSK-RL: Environments for Spacecraft Planning and Scheduling](https://avslab.github.io/bsk_rl/)
+
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/AVSLab/bsk_rl)
+
+  BSK-RL is a Python package for constructing Gymnasium environments for spacecraft tasking problems. It is built on top of [Basilisk](http://hanspeterschaub.info/basilisk/), a modular and fast spacecraft simulation framework, making the simulation environments high-fidelity and computationally efficient. BSK-RL also includes a collection of utilities and examples for working with these environments
+
 - [Connect-4-gym : An environment for practicing self playing](https://github.com/lucasBertola/Connect-4-Gym-env-Reinforcement-learning)
 
-Connect-4-Gym is an environment designed for creating AIs that learn by playing against themselves and assigning them an Elo rating. This environment can be used to train and evaluate reinforcement learning agents on the classic board game Connect Four.
+  Connect-4-Gym is an environment designed for creating AIs that learn by playing against themselves and assigning them an Elo rating. This environment can be used to train and evaluate reinforcement learning agents on the classic board game Connect Four.
 
 - [FlyCraft: A Fixed-wing UAV Environment](https://github.com/GongXudong/fly-craft)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/GongXudong/fly-craft)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/GongXudong/fly-craft)
 
-FlyCraft is a Gymnasium environment for fixed-wing UAV tasks. By default, FlyCraft provides two tasks: attitude control and velocity vector control. These tasks are characterized by their multi-goal and long-horizon nature, posing significant challenges for RL exploration. Additionally, the rewards can be configured as either Markovian or non-Markovian, making FlyCraft suitable for research on non-Markovian problems.
+  FlyCraft is a Gymnasium environment for fixed-wing UAV tasks. By default, FlyCraft provides two tasks: attitude control and velocity vector control. These tasks are characterized by their multi-goal and long-horizon nature, posing significant challenges for RL exploration. Additionally, the rewards can be configured as either Markovian or non-Markovian, making FlyCraft suitable for research on non-Markovian problems.
 
 - [gymnax: Hardware Accelerated RL Environments](https://github.com/RobertTLange/gymnax/)
 
-RL Environments in JAX which allows for highly vectorised environments with support for a number of environments, Gym, MinAtari, bsuite and more.
+  RL Environments in JAX which allows for highly vectorised environments with support for a number of environments, Gym, MinAtari, bsuite and more.
 
 - [gym-jiminy: Training Robots in Jiminy](https://github.com/duburcqa/jiminy)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.0-blue)
-![GitHub stars](https://img.shields.io/github/stars/duburcqa/jiminy)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.0-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/duburcqa/jiminy)
 
-gym-jiminy presents an extension of the initial Gym for robotics using [Jiminy](https://github.com/duburcqa/jiminy), an extremely fast and light-weight simulator for poly-articulated systems using Pinocchio for physics evaluation and Meshcat for web-based 3D rendering.
+  gym-jiminy presents an extension of the initial Gym for robotics using [Jiminy](https://github.com/duburcqa/jiminy), an extremely fast and light-weight simulator for poly-articulated systems using Pinocchio for physics evaluation and Meshcat for web-based 3D rendering.
 
 - [gym-pybullet-drones: Environments for quadcopter control](https://github.com/JacopoPan/gym-pybullet-drones)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/JacopoPan/gym-pybullet-drones)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/JacopoPan/gym-pybullet-drones)
 
-A simple environment using [PyBullet](https://github.com/bulletphysics/bullet3) to simulate the dynamics of a [Bitcraze Crazyflie 2.x](https://www.bitcraze.io/documentation/hardware/crazyflie_2_1/crazyflie_2_1-datasheet.pdf) nanoquadrotor.
+  A simple environment using [PyBullet](https://github.com/bulletphysics/bullet3) to simulate the dynamics of a [Bitcraze Crazyflie 2.x](https://www.bitcraze.io/documentation/hardware/crazyflie_2_1/crazyflie_2_1-datasheet.pdf) nanoquadrotor.
 
 - [OmniIsaacGymEnvs: Gym environments for NVIDIA Omniverse Isaac ](https://github.com/NVIDIA-Omniverse/OmniIsaacGymEnvs/)
 
-Reinforcement Learning Environments for [Omniverse Isaac simulator](https://docs.omniverse.nvidia.com/app_isaacsim/app_isaacsim/overview.html).
+  Reinforcement Learning Environments for [Omniverse Isaac simulator](https://docs.omniverse.nvidia.com/app_isaacsim/app_isaacsim/overview.html).
 
 - [panda-gym: Robotics environments using the PyBullet physics engine](https://github.com/qgallouedec/panda-gym/)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.3-blue)
-![GitHub stars](https://img.shields.io/github/stars/qgallouedec/panda-gym)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.3-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/qgallouedec/panda-gym)
 
-PyBullet based simulations of a robotic arm moving objects.
+  PyBullet based simulations of a robotic arm moving objects.
 
 - [PyFlyt: UAV Flight Simulator Environments for Reinforcement Learning Research](https://jjshoots.github.io/PyFlyt/index.html#)
 
-A library for testing reinforcement learning algorithms on various UAVs.
-It is built on the [Bullet](https://github.com/bulletphysics/bullet3) physics engine, offers flexible rendering options, time-discrete steppable physics, Python bindings, and support for custom drones of any configuration, be it biplanes, quadcopters, rockets, and anything you can think of.
+  A library for testing reinforcement learning algorithms on various UAVs.
+  It is built on the [Bullet](https://github.com/bulletphysics/bullet3) physics engine, offers flexible rendering options, time-discrete steppable physics, Python bindings, and support for custom drones of any configuration, be it biplanes, quadcopters, rockets, and anything you can think of.
 
 - [safe-control-gym: Evaluate safety of RL algorithms](https://github.com/utiasDSL/safe-control-gym)
 
-Evaluate safety, robustness and generalization via PyBullet based CartPole and Quadrotor environments—with [CasADi](https://web.casadi.org) (symbolic) *a priori* dynamics and constraints.
+  Evaluate safety, robustness and generalization via PyBullet based CartPole and Quadrotor environments—with [CasADi](https://web.casadi.org) (symbolic) *a priori* dynamics and constraints.
 
 - [Safety-Gymnasium: Ensuring safety in real-world RL scenarios](https://github.com/PKU-MARL/safety-gymnasium)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/PKU-MARL/safety-gymnasium)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/PKU-MARL/safety-gymnasium)
 
-Highly scalable and customizable Safe Reinforcement Learning library.
+  Highly scalable and customizable Safe Reinforcement Learning library.
 
 
 ### Telecommunication Systems environments
 *Interact and/or manage wireless and/or wired telecommunication systems.*
 - [mobile-env: Environments for coordination of wireless mobile networks](https://github.com/stefanbschneider/mobile-env)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/stefanbschneider/mobile-env)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/stefanbschneider/mobile-env)
 
-An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.
+  An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.
 
 
 ### Other
 - [Buffalo-Gym: Multi-Armed Bandit Gymnasium](https://github.com/foreverska/buffalo-gym)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/foreverska/buffalo-gym)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.29.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/foreverska/buffalo-gym)
 
-Buffalo-Gym is a Multi-Armed Bandit (MAB) gymnasium built primarily to assist in debugging RL implementations. MABs are often easy to reason about what the agent is learning and whether it is correct. Buffalo-gym encompasses Bandits, Contextual bandits, and contextual bandits with aliasing.
+  Buffalo-Gym is a Multi-Armed Bandit (MAB) gymnasium built primarily to assist in debugging RL implementations. MABs are often easy to reason about what the agent is learning and whether it is correct. Buffalo-gym encompasses Bandits, Contextual bandits, and contextual bandits with aliasing.
 
 - [CARL: context adaptive RL](https://github.com/automl/CARL)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/automl/carl)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.27.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/automl/carl)
 
-Contextual extensions of popular reinforcement learning environments that enable training and test distributions for generalization, e.g. CartPole with variable pole lengths or Brax robots with different ground frictions.
+  Contextual extensions of popular reinforcement learning environments that enable training and test distributions for generalization, e.g. CartPole with variable pole lengths or Brax robots with different ground frictions.
 
 - [DACBench: a benchmark for Dynamic Algorithm Configuration](https://github.com/automl/DACBench)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.3-blue)
-![GitHub stars](https://img.shields.io/github/stars/automl/DACBench)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.3-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/automl/DACBench)
 
-A benchmark library for [Dynamic Algorithm Configuration](https://www.automl.org/dynamic-algorithm-configuration/). Its focus is on reproducibility and comparability of different DAC methods as well as easy analysis of the optimization process.
+  A benchmark library for [Dynamic Algorithm Configuration](https://www.automl.org/dynamic-algorithm-configuration/). Its focus is on reproducibility and comparability of different DAC methods as well as easy analysis of the optimization process.
 
 - [gym-cellular-automata: Cellular Automata environments](https://github.com/elbecerrasoto/gym-cellular-automata)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/elbecerrasoto/gym-cellular-automata)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/elbecerrasoto/gym-cellular-automata)
 
-Environments where the agent interacts with _Cellular Automata_ by changing its cell states.
+  Environments where the agent interacts with _Cellular Automata_ by changing its cell states.
 
 - [matrix-mdp: Easily create discrete MDPs](https://github.com/Paul-543NA/matrix-mdp-gym)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.2-blue)
-![GitHub stars](https://img.shields.io/github/stars/Paul-543NA/matrix-mdp-gym)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.26.2-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/Paul-543NA/matrix-mdp-gym)
 
-An environment to easily implement discrete MDPs as gym environments. Turn a set of matrices (`P_0(s)`, `P(s'| s, a)` and `R(s', s, a)`) into a gym environment that represents the discrete MDP ruled by these dynamics.
+  An environment to easily implement discrete MDPs as gym environments. Turn a set of matrices (`P_0(s)`, `P(s'| s, a)` and `R(s', s, a)`) into a gym environment that represents the discrete MDP ruled by these dynamics.
 
 - [SimpleGrid: a simple grid environment for Gymnasium](https://github.com/damat-le/gym-simplegrid)
 
-![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
-![GitHub stars](https://img.shields.io/github/stars/damat-le/gym-simplegrid)
+  ![Gymnasium version dependency](https://img.shields.io/badge/Gymnasium-v0.28.1-blue)
+  ![GitHub stars](https://img.shields.io/github/stars/damat-le/gym-simplegrid)
 
-SimpleGrid is a super simple and minimal grid environment for Gymnasium. It is easy to use and customise and it is intended to offer an environment for rapidly testing and prototyping different RL algorithms.
+  SimpleGrid is a super simple and minimal grid environment for Gymnasium. It is easy to use and customise and it is intended to offer an environment for rapidly testing and prototyping different RL algorithms.
 
 
 ## Third-Party Environments using Gym
@@ -286,87 +294,87 @@ Many of these can be adapted to work with gymnasium (see [Compatibility with Gym
 
 - [gym-derk: GPU accelerated MOBA environment](https://gym.derkgame.com/)
 
-A 3v3 MOBA environment where you train creatures to fight each other.
+  A 3v3 MOBA environment where you train creatures to fight each other.
 
 - [SlimeVolleyGym: A simple environment for Slime Volleyball game](https://github.com/hardmaru/slimevolleygym)
 
-A simple environment for benchmarking single and multi-agent reinforcement learning algorithms on a clone of Slime Volleyball game.
+  A simple environment for benchmarking single and multi-agent reinforcement learning algorithms on a clone of Slime Volleyball game.
 
 - [Unity ML Agents: Environments for Unity game engine](https://github.com/Unity-Technologies/ml-agents)
 
-Gym (and PettingZoo) wrappers for arbitrary and premade environments with the Unity game engine.
+  Gym (and PettingZoo) wrappers for arbitrary and premade environments with the Unity game engine.
 
 - [PGE: Parallel Game Engine](https://github.com/222464/PGE)
 
-Uses The [Open 3D Engine](https://www.o3de.org/) for AI simulations and can interoperate with the Gym. Uses [PyBullet](https://github.com/bulletphysics/bullet3) physics.
+  Uses The [Open 3D Engine](https://www.o3de.org/) for AI simulations and can interoperate with the Gym. Uses [PyBullet](https://github.com/bulletphysics/bullet3) physics.
 
 ### Robotics environments
 
 - [MarsExplorer: Environments for controlling robot on Mars](https://github.com/dimikout3/MarsExplorer)
 
-Mars Explorer is a Gym compatible environment designed and developed as an initial endeavor to bridge the gap between powerful Deep Reinforcement Learning methodologies and the problem of exploration/coverage of an unknown terrain.
+  Mars Explorer is a Gym compatible environment designed and developed as an initial endeavor to bridge the gap between powerful Deep Reinforcement Learning methodologies and the problem of exploration/coverage of an unknown terrain.
 
 - [robo-gym: Real-world and simulation robotics](https://github.com/jr-robotics/robo-gym)
 
-Robo-gym provides a collection of reinforcement learning environments involving robotic tasks applicable in both simulation and real-world robotics.
+  Robo-gym provides a collection of reinforcement learning environments involving robotic tasks applicable in both simulation and real-world robotics.
 
 - [Offworld-gym: Control real robots remotely for free](https://github.com/offworld-projects/offworld-gym)
 
-Gym environments that let you control real robots in a laboratory via the internet.
+  Gym environments that let you control real robots in a laboratory via the internet.
 
 - [gym-softrobot: Soft-robotics environments](https://github.com/skim0119/gym-softrobot/)
 
-A large-scale benchmark for co-optimizing the design and control of soft robots.
+  A large-scale benchmark for co-optimizing the design and control of soft robots.
 
 - [iGibson: Photorealistic and interactive robotics environments](https://github.com/StanfordVL/iGibson/)
 
-A simulation environment with high-quality realistic scenes, with interactive physics using [PyBullet](https://github.com/bulletphysics/bullet3).
+  A simulation environment with high-quality realistic scenes, with interactive physics using [PyBullet](https://github.com/bulletphysics/bullet3).
 
 - [DexterousHands: Dual dexterous hand manipulation tasks](https://github.com/PKU-MARL/DexterousHands/)
 
-This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym.
+  This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym.
 
 ### Autonomous Driving environments
 
 - [gym-duckietown: Lane-following for self-driving cars](https://github.com/duckietown/gym-duckietown)
 
-A lane-following simulator built for the [Duckietown](http://duckietown.org/) project (small-scale self-driving car course).
+  A lane-following simulator built for the [Duckietown](http://duckietown.org/) project (small-scale self-driving car course).
 
 - [CommonRoad-RL: Motion planning for traffic scenarios ](https://commonroad.in.tum.de/tools/commonroad-rl)
 
-A Gym for solving motion planning problems for various traffic scenarios compatible with [CommonRoad benchmarks](https://commonroad.in.tum.de/scenarios), which provides configurable rewards, action spaces, and observation spaces.
+  A Gym for solving motion planning problems for various traffic scenarios compatible with [CommonRoad benchmarks](https://commonroad.in.tum.de/scenarios), which provides configurable rewards, action spaces, and observation spaces.
 
 - [racing_dreamer: Latent imagination in autonomous racing](https://github.com/CPS-TUWien/racing_dreamer/)
 
-Train a model-based RL agent in simulation and, without finetuning, transfer it to small-scale race cars.
+  Train a model-based RL agent in simulation and, without finetuning, transfer it to small-scale race cars.
 
 - [l2r: Multimodal control environment where agents learn how to race](https://github.com/learn-to-race/l2r/)
 
-An open-source reinforcement learning environment for autonomous racing.
+  An open-source reinforcement learning environment for autonomous racing.
 
 
 ### Other environments
 
 - [CompilerGym: Optimise compiler tasks](https://github.com/facebookresearch/CompilerGym)
 
-Reinforcement learning environments for compiler optimization tasks, such as LLVM phase ordering, GCC flag tuning, and CUDA loop nest code generation.
+  Reinforcement learning environments for compiler optimization tasks, such as LLVM phase ordering, GCC flag tuning, and CUDA loop nest code generation.
 
 - [gym-sokoban: 2D Transportation Puzzles](https://github.com/mpSchrader/gym-sokoban)
 
-The environment consists of transportation puzzles in which the player's goal is to push all boxes to the warehouse's storage locations.
+  The environment consists of transportation puzzles in which the player's goal is to push all boxes to the warehouse's storage locations.
 
 - [NLPGym: A toolkit to develop RL agents to solve NLP tasks](https://github.com/rajcscw/nlp-gym)
 
-[NLPGym](https://arxiv.org/pdf/2011.08272v1.pdf) provides interactive environments for standard NLP tasks such as sequence tagging, question answering, and sequence classification.
+  [NLPGym](https://arxiv.org/pdf/2011.08272v1.pdf) provides interactive environments for standard NLP tasks such as sequence tagging, question answering, and sequence classification.
 
 - [ShinRL: Environments for evaluating RL algorithms](https://github.com/omron-sinicx/ShinRL/)
 
-ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)
+  ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives (Deep RL Workshop 2021)
 
 - [openmodelica-microgrid-gym: Environments for controlling power electronic converters in microgrids](https://github.com/upb-lea/openmodelica-microgrid-gym)
 
-The OpenModelica Microgrid Gym (OMG) package is a software toolbox for the simulation and control optimization of microgrids based on energy conversion by power electronic converters.
+  The OpenModelica Microgrid Gym (OMG) package is a software toolbox for the simulation and control optimization of microgrids based on energy conversion by power electronic converters.
 
 - [GymFC: A flight control tuning and training framework](https://github.com/wil3/gymfc/)
 
-GymFC is a modular framework for synthesizing neuro-flight controllers. Has been used to generate policies for the world's first open-source neural network flight control firmware [Neuroflight](https://github.com/wil3/neuroflight).
+  GymFC is a modular framework for synthesizing neuro-flight controllers. Has been used to generate policies for the world's first open-source neural network flight control firmware [Neuroflight](https://github.com/wil3/neuroflight).