mlx-spectro

High-performance STFT/iSTFT for Apple MLX with fused Metal kernels.

Fused overlap-add with autotuned Metal kernels
PyTorch-compatible STFT/iSTFT semantics
Cached transforms for zero-overhead repeated calls
Optional torch fallback for strict numerical parity

Install

pip install mlx-spectro

With optional torch fallback support:

pip install mlx-spectro[torch]

Quick Start

import mlx.core as mx
from mlx_spectro import SpectralTransform

# Create a transform
transform = SpectralTransform(
    n_fft=2048,
    hop_length=512,
    window_fn="hann",
)

# Forward STFT
audio = mx.random.normal((1, 44100))
spec = transform.stft(audio, output_layout="bnf")

# Inverse STFT
reconstructed = transform.istft(spec, length=44100, input_layout="bnf")

API

`SpectralTransform`

Main class for STFT/iSTFT operations.

SpectralTransform(
    n_fft: int,
    hop_length: int,
    win_length: int | None = None,
    window_fn: str = "hann",       # "hann", "hamming", "rect"
    window: mx.array | None = None,  # custom window array
    periodic: bool = True,
    center: bool = True,
    normalized: bool = False,
    istft_backend_policy: str | None = None,  # "auto", "mlx_fft", "metal", "torch_fallback"
)

Methods:

stft(x, output_layout="bfn") — Forward STFT. Input: [T] or [B, T].
istft(z, length=None, ...) — Inverse STFT. Returns [B, T].
compiled_pair(length, layout="bnf", warmup_batch=None) — Return compiled (stft_fn, istft_fn) for steady-state loops (10–20% faster).
warmup(batch=1, length=4096) — Force kernel compilation.

`get_transform_mlx(**kwargs)`

Factory that returns cached SpectralTransform instances for repeated use.

`make_window(window, window_fn, win_length, n_fft, periodic)`

Create or validate a 1D analysis window.

`resolve_fft_params(n_fft, hop_length, win_length, pad)`

Resolve effective FFT parameters with PyTorch-compatible defaults.

Benchmarks

Apple M4 Max, macOS 26.3, MLX 0.30.6, PyTorch 2.10.0, 20 iterations (5 warmup).

STFT Forward

Config	mlx-spectro	torch MPS	mlx-stft	vs torch	vs mlx-stft
B=1 T=16k nfft=512	0.16 ms	0.21 ms	0.31 ms	1.4x	1.9x
B=4 T=160k nfft=1024	0.37 ms	0.78 ms	1.09 ms	2.1x	3.0x
B=8 T=160k nfft=1024	0.28 ms	0.68 ms	1.53 ms	2.5x	5.6x
B=4 T=1.3M nfft=1024	0.79 ms	1.71 ms	5.03 ms	2.2x	6.3x
B=8 T=480k nfft=1024	0.58 ms	1.30 ms	3.73 ms	2.2x	6.4x

iSTFT Forward

Config	mlx-spectro	torch MPS	mlx-stft	vs torch	vs mlx-stft
B=1 T=16k nfft=512	0.17 ms	0.49 ms	0.25 ms	3.0x	1.5x
B=4 T=160k nfft=1024	0.21 ms	0.99 ms	0.98 ms	4.8x	4.7x
B=8 T=160k nfft=1024	0.29 ms	1.58 ms	1.62 ms	5.4x	5.6x
B=4 T=1.3M nfft=1024	0.77 ms	5.74 ms	6.68 ms	7.5x	8.7x
B=8 T=480k nfft=1024	0.60 ms	4.10 ms	4.55 ms	6.8x	7.6x

Differentiable STFT + iSTFT (Forward + Backward)

Config	mlx-spectro	torch MPS	vs torch
B=1 T=16k nfft=512	0.32 ms	0.97 ms	3.0x
B=4 T=160k nfft=1024	0.61 ms	2.28 ms	3.7x
B=8 T=160k nfft=1024	1.05 ms	4.33 ms	4.1x
B=4 T=1.3M nfft=1024	4.30 ms	17.44 ms	4.1x
B=8 T=480k nfft=1024	3.01 ms	12.53 ms	4.2x

Roundtrip Accuracy (STFT → iSTFT max abs error)

Config	mlx-spectro	torch MPS
B=1 T=16k nfft=512	1.67e-06	2.38e-06
B=4 T=160k nfft=2048	2.86e-06	5.25e-06
B=8 T=480k nfft=1024	3.81e-06	4.77e-06

Compiled Mode

For tight inference loops with fixed input shapes, compiled_pair eliminates per-call Python dispatch overhead (10–20% faster for small workloads):

t = SpectralTransform(n_fft=1024, hop_length=256, window_fn="hann")
stft, istft = t.compiled_pair(length=44100, warmup_batch=2)

for chunk in audio_stream:
    z = stft(chunk)
    z = process(z)
    y = istft(z)
    mx.eval(y)

Use the eager t.stft() / t.istft() methods when input shapes vary.

Environment Variables

Variable	Default	Description
`SPEC_MLX_AUTOTUNE`	`1`	Enable Metal kernel autotuning
`SPEC_MLX_TGX`	—	Force threadgroup size (e.g. `256` or `kernel:256`)
`SPEC_MLX_AUTOTUNE_PERSIST`	`1`	Persist autotune results to disk
`SPEC_MLX_AUTOTUNE_CACHE_PATH`	—	Override autotune cache file path
`MLX_OLA_FUSE_NORM`	`1`	Enable fused OLA+normalization kernel
`SPEC_MLX_CACHE_STATS`	`0`	Enable cache debug counters

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github/workflows		.github/workflows
scripts		scripts
src/mlx_spectro		src/mlx_spectro
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mlx-spectro

Install

Quick Start

API

`SpectralTransform`

`get_transform_mlx(**kwargs)`

`make_window(window, window_fn, win_length, n_fft, periodic)`

`resolve_fft_params(n_fft, hop_length, win_length, pad)`

Benchmarks

STFT Forward

iSTFT Forward

Differentiable STFT + iSTFT (Forward + Backward)

Roundtrip Accuracy (STFT → iSTFT max abs error)

Compiled Mode

Environment Variables

License

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

ssmall256/mlx-spectro

Folders and files

Latest commit

History

Repository files navigation

mlx-spectro

Install

Quick Start

API

SpectralTransform

get_transform_mlx(**kwargs)

make_window(window, window_fn, win_length, n_fft, periodic)

resolve_fft_params(n_fft, hop_length, win_length, pad)

Benchmarks

STFT Forward

iSTFT Forward

Differentiable STFT + iSTFT (Forward + Backward)

Roundtrip Accuracy (STFT → iSTFT max abs error)

Compiled Mode

Environment Variables

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

`SpectralTransform`

`get_transform_mlx(**kwargs)`

`make_window(window, window_fn, win_length, n_fft, periodic)`

`resolve_fft_params(n_fft, hop_length, win_length, pad)`

Packages