SlopeAD

Slope is a small automatic differentation (AD) engine, focused on machine learning (ML), supporting forward, reverse and higher-order AD.

This project is designed to be a small, hackable and educational AD engine focused on ML, yet able to do things end-to-end from training to deployment, instead of just some simple toy examples.

Tensor semantics are similar to Pytorch, functional API is similar to JAX, tensor operators code is heavily derived from tinygrad.

Example:

import slope

def f(x):
    y = x * 2.0
    return y.sum()

x = slope.tensor([1.,2.,3.])
gf_x = slope.grad(f)(x)
print(f"{gf_x=}")

gf_x=<Tensor: val=
[2. 2. 2.]
shape=(3,), dtype=float32, device='cpu:0'>

Install

pip install slope-ad

or latest from main branch:

git clone https://github.com/radenmuaz/slope-ad
cd slope
pip install -e .

or you can just copy src/slope to your projects.

Features

Small (?)
- <3000 lines of core code slope/core.py, after formatted with black src --line-length 140
Functional API for forward-mode, reverse-mode, and higher-order AD, like in JAX:
- grad vjp jvp jit vmap
- register_node tree_flatten tree_unflatten
Just-in-time compilation, where code is compiled to these supported backends running on either CPU, CUDA and Metal:
- ONNX Runtime (ONNX graph)
- OpenXLA IREE (StableHLO MLIR)
- NumPy (Python code)
Training and inference, examples:
Operators and procedures system
- 33 core operators defined in slope/operators.py
  - Unary: exp log sin sqrt invert cast stop_gradient
  - Binary: add mul sub div pow equal less greater maximum
  - Reduce: sum max
  - Shape: reshape expand permute slice pad flip cat
  - Init: full arange random_normal random_uniform
  - GeneralReduce: matmul conv gather_nd scatter_nd
- Composite operators system with "procedures" slope/procedures.py
  - For defining Tensor functions composed with core operators, e.g.
    - x.cos(), where def cos(x): return (math.pi/2 - x).sin()
    - x.conv_transpose(w): where def conv_transpose(x, w, ... ): ... is a very long function.
  - Procedures are exposed with Tensor.procedure_name(*args) syntax.
Extensible
- Add new backend by defining implementation translations slope/backends
- Define new modules with NN module slope/nn.py

Docs

Docs are available online at radenmuaz.github.io/slope-ad API reference: radenmuaz.github.io/slope-ad/api

Tutorials

Quickstart: How Tensors work, how to write and jit compile functions and train something.

NN Training: Train MLP on MNIST with slope.nn module

Internals Walkthrough: Understand the core of SlopeAD (hint: like JAX). Useful if you want to start contributing to SlopeAD

Extending SlopeAD: Add new backend, operators, procedures. Modify the core functions.

API reference

Contributing

Open a PR, things on the roadmap below need to be done.

Roadmap

Docs
Symbolic shape inference
Dynamic shape jit
Optimizer filter frozen params
vmap vjp and jvp to compute jacobian and hessian
iree backend currently has fixed seed random, implement threefry and JAX-like random
make things fast
llama (gpt) training
whisper inference
core tests, operators tests on all Trace types

Name		Name	Last commit message	Last commit date
Latest commit History 536 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
examples		examples
experimental		experimental
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SlopeAD

Install

Features

Docs

Tutorials

API reference

Contributing

Roadmap

About

Releases 1

Packages

Contributors 2

Languages

License

radenmuaz/slope-ad

Folders and files

Latest commit

History

Repository files navigation

SlopeAD

Install

Features

Docs

Tutorials

API reference

Contributing

Roadmap

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages