This project aims to explore and compare the different methods of solving optimal control problems.
This project will focus on apply different optimal control methods on different types of systems with the goal.
The optimal control methods implemented in this project are categorised based on how much information about the system is available as Planning Methods which require complete information on the system, Model-Based Learning which requires some information on the system, and Model-Free Learning which requires almost no information on the system.
The systems modeled in this project are also categorised based on how much information about the system is available as Deterministic systems, and Stochastic Systems.
The objective of this project is to implement an optimal control strategy for different classes of optimal control problems
- Introduction to Optimal Control Problems
- Optimal Control Methods
- Implementation of Solutions
The code for this project will be written in Python with Jupyter Notebook and will be divided into the following sections:
- Optimal Control Methods: This section will contain the implementation of the different optimal control methods.
- System Models: This section will contain the implementation of the different system models.
- Case Studies: This section will contain the implementation of the different case studies.
- Tests: This section will contain the code to test each System Model, and Optimal Control Method
- Tools: This section will contain scripts to automate tasks in the project.
The code is tested with Python 3.12.1
with the following packages:
Package Version
---------- -------
numpy 1.26.4
pip 24.0
torch 2.3.1
Model-based Reinforcement Learning: A Survey.(https://arxiv.org/pdf/2006.16712)
- Complete testing for DDPG Algorithm
- Hyperparametric optimization of DDPG solution of the particle in a field problem
- Complete implementation of the MDP algorithm
- Testing the MDP algorithm
- Use the MDP algorithm to solve the the particle in a field problem
- Benchmark the performance of these algorithms