GitHub - ActurialCapital/blocks: Extra blocks for scikit-learn features.

Overview
Open Source
Code
CI/CD
Downloads

Table of Contents

About The Project
- Introduction
- Built With
Installation
Testing
Getting Started
Contributing
License

About The Project

Introduction

blocks is a package designed to extend the functionality of scikit-learn by providing additional blocks for creating custom pipelines, easy-to-use base transformers, and useful decorators. This package aims to simplify the process of building and managing machine learning workflows in Python.

The current version of the package offers:

Custom Pipelines: Easily create and manage custom pipelines
Base Transformers and Samplers: A collection of base transformers and samplers to streamline feature transformation
Decorators: Handy decorators to simplify repetitive tasks

(back to top)

Built With

scikit-learn = "^1.5.0"
imbalanced-learn = "^0.12.3"
pandas = "^2.2.2"
numpy = "^1.26.4"

(back to top)

Installation

The easiest way to install blocks is via PyPI:

pip install python-blocks

Or via poetry:

poetry add python-blocks

Testing

To run the test suite after installation, follow these steps from the source directory. First, install pytest version 8.2.2:

pip install pytest==8.2.2

Then run pytest as follow:

pytest tests

Alternatively, if you are using poetry, execute:

poetry run pytest

For more information, visit our Codecov page.

(back to top)

Getting Started

Pipeline

Callback function that logs information in between each intermediate step
Access particular named step data
Inherites from imblearn pipeline, which works with both transformers and samplers

Dataset

>>> from sklearn.datasets import make_regression
>>> X, y = make_regression(n_samples=1000, n_features=10, random_state=42)

Model with both recorded and logged callbacks

>>> from sklearn.preprocessing import StandardScaler
>>> from sklearn.linear_model import LinearRegression
>>> from sklego.meta import EstimatorTransformer
>>> from blocks import BlockPipeline, custom_log_callback
>>> 
>>> pipe = BlockPipeline([
...   ("scaler", StandardScaler()),
...   ("regression", EstimatorTransformer(LinearRegression()))
... ],
...   record="scaler",
...   log_callback=custom_log_callback
... )

Logs

>>> pipe.fit(df, y)
# [custom_log_callback:78] - [scaler][StandardScaler()] shape=(1000, 10) time=0s

Records

>>> predicted = pipe.transform(df)
>>> pipe.name_record
# 'scaler'
>>> pipe.record
# array([[ ...

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement". Don't forget to give the project a star! Thanks again!

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

we also recommend to have a look at project-template.

project-template is a template project for scikit-learn compatible extensions. It aids development of estimators that can be used in scikit-learn pipelines and (hyper)parameter search, while facilitating testing (including some API compliance), documentation, open source development, packaging, and continuous integration.

Refer to the Official Documentation to modify the template for your own scikit-learn contribution.

(back to top)

License

Distributed under the BSD-3 License. See LICENSE.txt for more information.

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
blocks		blocks
docs		docs
examples		examples
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About The Project

Introduction

Built With

Installation

Testing

Getting Started

Pipeline

Dataset

Model with both recorded and logged callbacks

Logs

Records

Contributing

License

About

Releases 10

Packages

Languages

License

ActurialCapital/blocks

Folders and files

Latest commit

History

Repository files navigation

About The Project

Introduction

Built With

Installation

Testing

Getting Started

Pipeline

Dataset

Model with both recorded and logged callbacks

Logs

Records

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases 10

Packages 0

Languages

Packages