Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open source models while minimizing dependencies.
speech mos vad youtube-downloader data-cleaning source-separation cross-talk spoken-language-identification
-
Updated
Feb 27, 2025 - Python