Exo Stack

Exo Stack is an audiovisual way to interface with an exo AI, and includes speech-to-text as an input to enable 'conversations' with the AI.

Installation

Clone the repository
Add a (short) video or a picture to the folder
Add a short audio clip of a voice to the folder
Edit the config, and set the video and vocal references
Download the Video Engine checkpoint from here and put it in the folder 'videoEngine/checkpoints'
Download the Vocal Engine checkpoints from here, and extract them here:

encoder\saved_models\pretrained.pt
synthesizer\saved_models\pretrained\pretrained.pt
vocoder\saved_models\pretrained\pretrained.pt

python stack.py

Contributing is greatly appreciated, please contact one of the team members to get started working on the codebase.