Unofficial PyTorch implementation of Higgs Audio V2 Tokenizer with HuBERT semantic features. Complete training pipeline for semantic-acoustic audio tokenization with 960x downsampling and 8-layer RVQ.
pytorch audio-synthesis speech-processing audio-processing vector-quantization dac semantic-features hubert audio-generation neural-audio-codec rvq audio-tokenizer neural-codec higgs-audio speech-tokenization
-
Updated
Oct 8, 2025 - Python