HTDemucs CoreML

The first high-precision CoreML port of Meta's HTDemucs music source separation model.

Separate any song into 6 stems—drums, bass, vocals, other, piano, guitar—running natively on Apple Silicon via CoreML. No Python runtime, no cloud API, just fast on-device inference.

What Makes This Different

HTDemucs is notoriously difficult to port. The model uses complex-valued STFT/iSTFT operations that CoreML doesn't support natively. Previous attempts either failed or required keeping PyTorch in the loop.

This project solves that by:

Model surgery — Extract the "inner model" that operates on spectrograms, bypassing the problematic STFT layers
Native signal processing — Implement STFT/iSTFT using Apple's vDSP Accelerate framework, matching HTDemucs exactly
Mixed precision — FP32 for normalization and attention (precision-sensitive), FP16 elsewhere (performance)

The result: CoreML inference that matches PyTorch output within perceptual tolerance.

Quick Start

# Separate a song into stems
htdemucs-cli separate song.mp3 --output-dir stems/

Output:

stems/
├── drums.wav
├── bass.wav
├── vocals.wav
├── other.wav
├── piano.wav
└── guitar.wav

Installation

Swift Package Manager

Add to your Package.swift:

dependencies: [
    .package(url: "https://github.com/youruser/HTDemucsCoreML.git", from: "1.0.0")
]

Building from Source

git clone https://github.com/youruser/HTDemucsCoreML.git
cd HTDemucsCoreML
swift build -c release

The CLI tool will be at .build/release/htdemucs-cli.

Usage

Command Line

# Basic separation
htdemucs-cli separate input.mp3 --output-dir output/

# Specify output format
htdemucs-cli separate input.wav --output-dir output/ --format flac

# Process multiple files
htdemucs-cli separate *.mp3 --output-dir stems/

As a Library

import HTDemucsKit

let pipeline = try SeparationPipeline()
let stems = try await pipeline.separate(url: audioURL)

// Access individual stems
try stems.drums.write(to: drumsURL)
try stems.vocals.write(to: vocalsURL)

See Swift API Guide for progress tracking, configuration, and advanced usage.

The 6 Stems

Stem	Description
drums	Kick, snare, hi-hats, cymbals, percussion
bass	Bass guitar, synth bass, sub-bass
vocals	Lead vocals, backing vocals, spoken word
other	Everything else—synths, pads, FX, strings
piano	Acoustic and electric piano, keys
guitar	Acoustic and electric guitar

Requirements

macOS 13+ or iOS 18+
Apple Silicon recommended (Intel Macs work but slower)
~500MB RAM per separation

Documentation

Architecture Overview — How the pipeline works
Swift API Guide — Using HTDemucsKit in your projects
Technical Decisions — Why things are built this way

Quality

CoreML output matches PyTorch reference within 1-2 dB across SDR/SIR/SAR metrics. For audio applications, this is perceptually identical.

License

This project builds on Meta's Demucs model. See the original repository for model licensing.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github		.github
Resources		Resources
Sources		Sources
benchmarks		benchmarks
docs		docs
scripts		scripts
src/htdemucs_coreml		src/htdemucs_coreml
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.swiftformat		.swiftformat
Makefile		Makefile
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HTDemucs CoreML

What Makes This Different

Quick Start

Installation

Swift Package Manager

Building from Source

Usage

Command Line

As a Library

The 6 Stems

Requirements

Documentation

Quality

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

zakkeown/HTDemucsCoreML

Folders and files

Latest commit

History

Repository files navigation

HTDemucs CoreML

What Makes This Different

Quick Start

Installation

Swift Package Manager

Building from Source

Usage

Command Line

As a Library

The 6 Stems

Requirements

Documentation

Quality

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages