Skip to content

Musical Voice Synthesis at the Midpoint: Where Text Meets Sound

License

Notifications You must be signed in to change notification settings

opasquetdotfr/Bark_revival

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 

Repository files navigation

bark_revival png

******* THIS REP IS NOT FULLY UPLOADED. COME BACK LATER. _JAN 28 2025 *******

Project Title: Bark Revival

Description: An adaptation (creative hack) of Bark TTS (GitHub)

This research explores the application of language models and machine-learning techniques to generate musical voices with personality. By utilizing autoregressive transformers, specifically the Bark model, tokens are generated from input text to produce a unique invented and undefinable language. Composition is being made between text and sound using various control techniques, including tokens repetition and windowing, Lempel-Ziv-Welch compression, and token clustering from acoustic feature extraction, to regulate the output voice's granularity, intelligibility, and meaning. A recursive generation system is also introduced, allowing for the creation of a large series of interrelated voices. The research is used in various artistic applications, including music remixing and theater production. It explores other forms of expressive voices and storytelling seamlessly lying right in the middle between text and sound.

Paper to come...

Features:

  • Experiments with code and audio examples for ICMC 2025 paper.
  • Explores application of language models and machine-learning techniques to generate musical voices with personality using autoregressive transformers (Bark model).
  • Generates unique invented language from input text tokens.
  • Composition techniques:
    • Text-to-sound control: token repetition, windowing, Lempel-Ziv-Welch compression, token clustering from acoustic feature extraction.
    • Recursive generation system for creating interrelated voices.

Applications:

  • Music remixing.
  • Theater-Music production.
  • Other expressive voices and storytelling forms seamlessly blending text and sound.

Compatibility:

Operating System Installer Availability
Windows
MacOS

About

Musical Voice Synthesis at the Midpoint: Where Text Meets Sound

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published