This project is dedicated to developing advanced voice conversion technology using deep learning techniques. Our aim is to transform diverse voice inputs into a single, consistent target voice while preserving the original speech's nuances.
- Google Colab account
- Access to Google Drive
-
Google Colab/Drive Setup: This project is implemented and executed using Google Colab and Google Drive. Ensure you have access to these services.
-
Accessing the Notebooks:
main.ipynb
is the primary notebook for the project, featuring the successful implementation of our voice conversion model.failed_example.ipynb
provides insights into our initial approach, which was later revised due to its limitations.
-
Data and Pre-trained Models:
- Links to download necessary data and our pre-trained models are included within
main.ipynb
. Follow the instructions in the notebook for setup. - You also have the option to download data directly from YouTube for training. Instructions for this process are provided in the notebook.
- Links to download necessary data and our pre-trained models are included within
main.ipynb
: The main Jupyter notebook with the implementation of the voice conversion model.failed_example.ipynb
: A Jupyter notebook documenting our initial, unsuccessful approach.
To use the voice conversion process, follow the step-by-step instructions in main.ipynb
. The notebook guides you through data loading, model execution, and optionally downloading and using data from YouTube.