PSIORI's Machine Learning Library -- The Public Part

 ____   _____ _____ ____ __   __
|  _ \ / ____|_   _|  _ \ \ / /
| |_) | (___   | | | |_) |\ V / 
|  __/ \___ \  | | |  __/  > <  
| |    ____) |_| |_| |    / / 
|_|   |_____/|_____|_|   /_/

PSIORI's Machine Learning Library -- The Public Part

The psipy library is a private collection of machine learning algorithms and tools developed and used by PSIORI. It is designed to be a modular and extensible framework for building and deploying software solutions that incorporate or are based on machine learning components. This public version of the library is a subset of the full private library and currently includes only large parts of the Reinforcement Learning (RL) module. It may be expanded to include additional modules in the future.

Installation

To install the psipy library, you can use pip. First, ensure you have Python 3.8 or later installed on your system.

We suggest creating a virtual environment for your project using psipy or when working on psipy itself. You can use the following command to create a virtual environment and activate it:

python3.8 -m venv .venv
source .venv/bin/activate

Then, you can install psipy directly from the GitHub repository:

python --version  # make sure python 3.8 is used!
git clone git@github.com:psiori/psipy-public.git
pip install -e "./psipy-public[dev,gym]"

The option '-e' is used to install the package in editable mode, which allows you to make changes to the code and have them reflected in the installed package without having to reinstall. Skip this option if you do not plan on making changes to the code of psipy itself.

The options '[dev,gym]' are used to install additional dependencies for the development environment (inclduing pytest and jupyter) as well as AI gym together with its own dependencies. Please be aware that we switched to Farama-Foundations's fork Gymnasium of OpenAI's gym when they took over maintenance of the original gym library.

Getting started

To get started, we propose to have a look at

examples/rl/simulated_cartpole/nfqs_psiori_cartpole_minimal.py

which will learn to swing up and balance a simulated cart-pole system.

The script is an example of about the minimal code that is needed to successfully learn a policy on the simulated cartpole system to swing up, stabilize and balance the pole from scratch within 80 to 140 episodes. It uses our NFQ variant that has the actions encoded in the input layer (named "NFQs"). It can be run after activating the above environment as follows:

cd examples/rl/simulated_cartpole/
python3 nfqs_psiori_cartpole_minimal.py

A slightly longer version that can be run using

cd examples/rl/simulated_cartpole/
python3 nfqs_psiori_cartpole.py

also demonstrate saving and loading models, running evaluations separate from the training runs and creating different plots automatically while running the experiment.

Tutorials

Please be aware, that not all tutorials have been ported from the internal to the public version of psipy, yet. Specifically, import paths are likely to be wrong.

You can also explore the provided tutorials. One of the best ways to familiarize yourself with the library is by running the batch tutorial Jupyter notebook. Here's how you can do that:

Navigate to the examples directory in your terminal:
```
cd psipy-public/examples/rl/tutorials
```
Launch Jupyter Notebook:
```
jupyter notebook
```
In the Jupyter interface that opens in your web browser, locate and click on the "batch_learning_tutorials.ipynb" file to open it.
You can now run the cells in the notebook to see how psipy works with batch reinforcement learning tasks.

This tutorial will guide you through the basics of using psipy for reinforcement learning tasks, specifically focusing on batch learning scenarios.

Further examples, also including python scripts outside jupyter notebooks, can be found in the examples folder.

Documentation

The documentation can be built locally using make doc.

Contributing

With psipy being the core library for most internal python projects, it can sometimes be hard to keep track of the latest changes. To avoid that, we aim for short lived branches, small pull requests and frequent merges to develop.

No project specific release branches.
Feature branches are kept small and are frequently merged to develop.
Releases (specific versions published to github and used in projects or concrete use cases) always are a tagged version off of main.

Sticking to those principles will make changes to the shared codebase frequent, but small. Projects under active development should frequently and therefore easily update to the latest psipy develop state without the need for major refactors. Projects need to plan for the time needed for such merges. The idea of those principles is to avoid prolonged feature branches which get hard to merge at some point or actually never get merged at all.

If a project requires new psipy features after some time of inactivity, it needs to update to the latest head. While this might produce some overhead in that project, it will also ensure an easier to maintain and faster to advance shared codebase.

Documentation

Code documentation should live close to code to keep it maintained. Usage examples are in the best case doctests and therefore both runnable and executed by pytest.

Pre-commit hooks

psipy provides pre-commit hooks for developers. After having installed psipy, run pre-commit install to setup git hook scripts. From now on, flake8, mypy and black (only checks) will be run on all staged files before every commit as well as checks for trailing whitespaces, newlines at end of files and large filesizes. Hooks are configured in .pre-commit-config.yaml.

In case you actively want to ignore all hooks, use git commit --no-verify. For ignoring only specific hooks, you can use the SKIP environment variable, e.g. SKIP=flake8 git commit -m "foo".

License

The code is licensed under the BSD 3-Clause License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.azure		.azure
docs		docs
examples		examples
psipy		psipy
research		research
tools		tools
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
azure-pipelines-staging.yml		azure-pipelines-staging.yml
azure-pipelines.yml		azure-pipelines.yml
prerun.sh		prerun.sh
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PSIORI's Machine Learning Library -- The Public Part

Installation

Getting started

Tutorials

Documentation

Contributing

Documentation

Pre-commit hooks

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

psiori/psipy-public

Folders and files

Latest commit

History

Repository files navigation

PSIORI's Machine Learning Library -- The Public Part

Installation

Getting started

Tutorials

Documentation

Contributing

Documentation

Pre-commit hooks

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages