******* THIS REP IS NOT FULLY UPLOADED. COME BACK LATER. _JAN 28 2025 *******
Project Title: Bark Revival
Description: An adaptation (creative hack) of Bark TTS (GitHub)
This research explores the application of language models and machine-learning techniques to generate musical voices with personality. By utilizing autoregressive transformers, specifically the Bark model, tokens are generated from input text to produce a unique invented and undefinable language. Composition is being made between text and sound using various control techniques, including tokens repetition and windowing, Lempel-Ziv-Welch compression, and token clustering from acoustic feature extraction, to regulate the output voice's granularity, intelligibility, and meaning. A recursive generation system is also introduced, allowing for the creation of a large series of interrelated voices. The research is used in various artistic applications, including music remixing and theater production. It explores other forms of expressive voices and storytelling seamlessly lying right in the middle between text and sound.
Paper to come...
Features:
- Experiments with code and audio examples for ICMC 2025 paper.
- Explores application of language models and machine-learning techniques to generate musical voices with personality using autoregressive transformers (Bark model).
- Generates unique invented language from input text tokens.
- Composition techniques:
- Text-to-sound control: token repetition, windowing, Lempel-Ziv-Welch compression, token clustering from acoustic feature extraction.
- Recursive generation system for creating interrelated voices.
Applications:
- Music remixing.
- Theater-Music production.
- Other expressive voices and storytelling forms seamlessly blending text and sound.
Compatibility:
Operating System | Installer Availability |
---|---|
Windows | ✅ |
MacOS | ✅ |