Citation Request

EvoRBF	Evolving Radial Basis Function Network
Free software	GNU General Public License (GPL) V3 license
Traditional RBF models	`RbfRegressor`, `RbfClassifier`
Advanced RBF models	`AdvancedRbfRegressor`, `AdvancedRbfClassifier`
Nature-inspired RBF models	`NiaRbfRegressor`, `NiaRbfClassifier`
Tuner for traditional RBF models	`NiaRbfTuner`
Provided total ML models	> 400 Models
Supported total metrics	>= 67 (47 regressions and 20 classifications)
Supported loss functions	>= 61 (45 regressions and 16 classifications)
Documentation	https://evorbf.readthedocs.io
Python versions	>= 3.8.x
Dependencies	numpy, scipy, scikit-learn, pandas, mealpy, permetrics

Citation Request

@software{thieu_2024_11136008,
  author       = {Nguyen Van Thieu},
  title        = {EvoRBF: A Nature-inspired Algorithmic Framework for Evolving Radial Basis Function Networks},
  month        = may,
  year         = 2024,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.11136007},
  url          = {https://doi.org/10.5281/zenodo.11136007}
}

@article{van2023mealpy,
  title={MEALPY: An open-source library for latest meta-heuristic algorithms in Python},
  author={Van Thieu, Nguyen and Mirjalili, Seyedali},
  journal={Journal of Systems Architecture},
  year={2023},
  publisher={Elsevier},
  doi={10.1016/j.sysarc.2023.102871}
}

Theory

EvoRBF is mind-blowing framework for Radial Basis Function (RBF) networks. We explain several keys components and provide several types of RBF networks that you will never see in other places.

You can read several papers by using Google Scholar search. There are many ways we can use Nature-inspired Algorithms to optimize Radial Basis Function network, for example, you can read this paper. Here we will walk through some basic concepts and parameters that matter to this network.

Structure

RBF has input, single-hidden, and output layer.
RBF is considered has only hidden-output weights. The output layer is linear combination.
The output of hidden layer is calculate by radial function (e.g, Gaussian function, Thin spline,...)

Training algorithm

Traditional RBF models

In case of traditional RBF model. There are a few parameters need to identify to get the best model.

1. The number of hidden nodes in hidden layer
2. The centers and widths (sigmas) of Gaussian function
3. The output weights
4. The regularization factor (lambda) L2

To train their parameters,

1. Using hyper-parameter tuning model such as GridSearchCV or RandomizedSearchCV to get the best hidden nodes
2. The centers can be calculated by Random or KMeans or unsupervised learning algorithms
3. The widths (sigmas) can be computed by hyper-parameter tuning process.
   + Width can be a single value that represent all hidden nodes has the same curve of Gaussian function
   + Width can be multiple values that each hidden node has a different value.
4. The output weights can be calculated by Moore-Penrose inverse (Matrix multiplication). Do not use Gradient Descent.
5. When setting regularization L2. lambda can be computed by hyper-parameter tuning process.

Example,

from evorbf import RbfRegressor, RbfClassifier

model = RbfClassifier(size_hidden=10, center_finder="kmeans", sigmas=2.0, reg_lambda=0.1, seed=None)
model = RbfRegressor(size_hidden=4, center_finder="random", sigmas=(1.5, 2, 2, 2.5), reg_lambda=0, seed=42)

model.fit(X=X_train, y=y_train)
y_pred = model.predict(X_test)
y_pred_prob = model.predict_proba(X_test)

Advanced RBF models

In case of advanced RBF model. User can have so many different options.

1. Choice different RBF kernel function such as Multiquadric (MQ), Inverse Multiquadric (IMQ), Thin Plate Spline (TPS), Exponential, Power,...
2. Choice different unsupervised learning algorithms to calculate the centers, and may be the number of hidden nodes.
   + For example, KMeans, or random algorithms, you need to set up the number of hidden nodes.
   + But, for MeanShift or DBSCAN algorithms, you don't need to set that value. They can automatically identify the number of cluters (number of hidden nodes).
3. This version may have the bias in output layer.

Examples,

from evorbf import AdvancedRbfClassifier, AdvancedRbfRegressor

model = AdvancedRbfClassifier(center_finder="random", finder_params={"n_centers": 15},
                 rbf_kernel="gaussian", kernel_params={"sigma": 1.5},
                 reg_lambda=0.1, has_bias=True, seed=42)

model = AdvancedRbfClassifier(center_finder="random", finder_params=None,        # Default n_centers = 10
                 rbf_kernel="gaussian", kernel_params=None,                     # Default sigma = 1.0
                 reg_lambda=0.1, has_bias=False, seed=42)

model = AdvancedRbfClassifier(center_finder="kmeans", finder_params={"n_centers": 20},
                 rbf_kernel="multiquadric", kernel_params=None,
                 reg_lambda=0.1, has_bias=False, seed=42)

model = AdvancedRbfClassifier(center_finder="meanshift", finder_params={"bandwidth": 0.6},      # Give us 28 hidden nodes
                 rbf_kernel="inverse_multiquadric", kernel_params={"sigma": 1.5},
                 reg_lambda=0.5, has_bias=True, seed=42)

model = AdvancedRbfClassifier(center_finder="dbscan", finder_params={"eps": 0.2},      # Give us 42 hidden nodes
                 rbf_kernel="multiquadric", kernel_params={"sigma": 1.5},
                 reg_lambda=0.5, has_bias=True, seed=42)

model = AdvancedRbfClassifier(center_finder="dbscan", finder_params={"eps": 0.175},      # Give us 16 hidden nodes
                 rbf_kernel="multiquadric", kernel_params={"sigma": 1.5},
                 reg_lambda=None, has_bias=False, seed=42)

model.fit(X=X_train, y=y_train)
y_pred = model.predict(X_test)
y_pred_prob = model.predict_proba(X_test)

Nature-inspired Algorithm-based RBF models

This is the main purpose of this library. In this type of models,

1. We use Nature-inspired Algorithm (NIA) to train widths (sigmas) value for each hidden node.
2. If you set up the Regularization technique, then NIA is automatically calculated the lambda factor

Examples,

from evorbf import NiaRbfRegressor, NiaRbfClassifier

model = NiaRbfClassifier(size_hidden=25, center_finder="kmeans", 
                         regularization=False, obj_name="F1S",
                         optim="OriginalWOA", 
                         optim_paras={"epoch": 50, "pop_size": 20}, 
                         verbose=True, seed=42)

model = NiaRbfRegressor(size_hidden=10, center_finder="random", 
                         regularization=True, obj_name="AS",
                         optim="BaseGA", 
                         optim_paras={"epoch": 50, "pop_size": 20}, 
                         verbose=True, seed=42)

model.fit(X=X_train, y=y_train)
y_pred = model.predict(X_test)
y_pred_prob = model.predict_proba(X_test)

Nature-inspired Algorithm-based hyperparameter RBF tuning model

In this case, user can use NIA to tune hyper-parameters of traditional RBF models.

from evorbf import NiaRbfTuner, IntegerVar, StringVar, FloatVar

# Design the boundary (for hyper-parameters)
my_bounds = [
    IntegerVar(lb=5, ub=21, name="size_hidden"),
    StringVar(valid_sets=("kmeans", "random"), name="center_finder"),
    FloatVar(lb=(0.01,), ub=(3.0,), name="sigmas"),
    FloatVar(lb=(0, ), ub=(1.0, ), name="reg_lambda"),
]

model = NiaRbfTuner(problem_type="classification", bounds=my_bounds, cv=3, scoring="AS",
                    optim="OriginalWOA", optim_paras={"epoch": 10, "pop_size": 20}, 
                    verbose=True, seed=42)

My notes

RBF needs to train the centers, and widths of Gaussian activation function. (This is 1st phase)
RBF usually use KMeans to find centers ==> Increase the complexity and time.
- In that case, user need to define widths ==> Can use 1 single width or each hidden with different width.
- Or RBF use random to find centers ==> Not good to split samples to different clusters.
RBF needs to train the output weights. (This is 2nd phase)
RBF do not use Gradient descent to calculate output weights, it used Moore–Penrose inverse (matrix multiplication, least square method) ==> so it is faster than MLP network.
Moore-Penrose inverse can find the exact solution ==> So we don't have to use Gradient Descent or Approximation algorithm here.
In case of overfitting, what can we do with this network ==> We add L2 regularization method.
If you have large-scale dataset ==> Set more hidden nodes ==> Then increase the L2 regularization parameter.

1. RbfRegressor, RbfClassifier: You need to set up 4 types of hyper-parameters.
2. AdvancedRbfRegressor, AdvancedRbfClassifier: You need to set up 6 types of hyper-parameters. 
   But you have many option to choice, and you can design your own RBF models, a new one that nobody has used it before.
   For example, RBF that has bias in output layer or RBF that use DBSCAN and Exponential kernel function. 
3. NiaRbfRegressor, NiaRbfClassifier: You need to set up the hidden size. However, these are best classes in this library.
   + The sigmas are automatically calculated for each hidden nodes.
   + The reguarlization factor is also automatically tuned to fine the best one.
4. NiaRbfTuner. This class also extremely useful for traditional RBF models, it can tune hidden size, however, 
   there is only 1 sigma value will be presented all hidden nodes.

Usage

Install the current PyPI release:

$ pip install evorbf

After installation, you can check EvoRBF version:

$ python
>>> import evorbf
>>> evorbf.__version__

We have provided above several ways to import and call the proposed classes. If you need more details how to use each of them, please check out the folder examples. In this short demonstration, we will use Whale Optimization Algorithm to optimize the sigmas (in non-linear Gaussian kernel) and reg_lambda of L2 regularization in RBF network (WOA-RBF model) for Diabetes prediction problem.

import numpy as np
from evorbf import Data, NiaRbfRegressor
from sklearn.datasets import load_diabetes

## Load data object
# total samples = 442, total features = 10
X, y = load_diabetes(return_X_y=True)
data = Data(X, y)

## Split train and test
data.split_train_test(test_size=0.2, random_state=2)
print(data.X_train.shape, data.X_test.shape)

## Scaling dataset
data.X_train, scaler_X = data.scale(data.X_train, scaling_methods=("standard"))
data.X_test = scaler_X.transform(data.X_test)

data.y_train, scaler_y = data.scale(data.y_train, scaling_methods=("standard", ))
data.y_test = scaler_y.transform(np.reshape(data.y_test, (-1, 1)))

## Create model
opt_paras = {"name": "WOA", "epoch": 50, "pop_size": 20}
model = NiaRbfRegressor(size_hidden=25,             # Set up big enough hidden size 
                        center_finder="kmeans",     # Use KMeans to find the centers
                        regularization=True,        # Use L2 regularization 
                        obj_name="MSE",             # Mean squared error as fitness function for NIAs
                        optim="OriginalWOA",        # Use Whale Optimization
                        optim_paras={"epoch": 50, "pop_size": 20},  # Set up parameter for Whale Optimization
                        verbose=True, seed=42)

## Train the model
model.fit(data.X_train, data.y_train)

## Test the model
y_pred = model.predict(data.X_test)

print(model.optimizer.g_best.solution)
## Calculate some metrics
print(model.score(X=data.X_test, y=data.y_test))
print(model.scores(X=data.X_test, y=data.y_test, list_metrics=["R2", "R", "KGE", "MAPE"]))
print(model.evaluate(y_true=data.y_test, y_pred=y_pred, list_metrics=["MSE", "RMSE", "R2S", "NSE", "KGE", "MAPE"]))

Please go check out the examples folder. You'll be surprised by what this library can do for your problem. You can also read the documentation for more detailed installation instructions, explanations, and examples.

Official Links (Get support for questions and answers)

Developed by: Thieu @ 2024

Name	Name	Last commit message	Last commit date
Latest commit thieu1995 Update ChangeLog Dec 1, 2024 421a845 · Dec 1, 2024 History 92 Commits
.github/workflows	.github/workflows	Update workflows	Nov 30, 2024
docs	docs	Update docs	Dec 1, 2024
evorbf	evorbf	Update docstring	Dec 1, 2024
examples	examples	Update examples for rbf_tuner module	Nov 30, 2024
tests	tests	Fix bug tests with higher python version	Nov 30, 2024
.flake8	.flake8	Update Manifest	May 7, 2024
.gitignore	.gitignore	Add ChangeLog	May 11, 2024
.readthedocs.yaml	.readthedocs.yaml	Add config for docs	May 8, 2024
CITATION.cff	CITATION.cff	Update id new version	Dec 1, 2024
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Add Code of Conduct and License	Aug 8, 2023
ChangeLog.md	ChangeLog.md	Update ChangeLog	Dec 1, 2024
LICENSE	LICENSE	Add Code of Conduct and License	Aug 8, 2023
MANIFEST.in	MANIFEST.in	Update Manifest	May 7, 2024
README.md	README.md	Update README	Dec 1, 2024
requirements.txt	requirements.txt	Add requirements	May 7, 2024
setup.py	setup.py	Update id new version	Dec 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citation Request

Theory

Structure

Training algorithm

Traditional RBF models

Advanced RBF models

Nature-inspired Algorithm-based RBF models

Nature-inspired Algorithm-based hyperparameter RBF tuning model

My notes

Usage

Official Links (Get support for questions and answers)

About

Releases 4

Packages

Contributors 3

Languages

License

thieu1995/EvoRBF

Folders and files

Latest commit

History

Repository files navigation

Citation Request

Theory

Structure

Training algorithm

Traditional RBF models

Advanced RBF models

Nature-inspired Algorithm-based RBF models

Nature-inspired Algorithm-based hyperparameter RBF tuning model

My notes

Usage

Official Links (Get support for questions and answers)

About

Topics

Resources

License

Code of conduct

Citation

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 3

Languages

Packages