EaseVoice Trainer

EaseVoice Trainer is a backend project designed to streamline and enhance the training process for voice synthesis and transformation. It is built upon the foundation of GPT-SoVITS, inheriting its core concepts while introducing various improvements to make the system more accessible, elegant, and user-friendly.

Although EaseVoice Trainer takes inspiration from GPT-SoVITS, we chose not to directly fork the original repository due to significant design differences and unique objectives. Our focus is on creating a refined, modular system tailored to specific use cases and improving maintainability.

Key Features

User-Friendly Design: Simplified workflows and intuitive configurations make it easier for users to deploy and manage.
Stability: ensuring consistent and reliable performance in the process of cloing and training.
Training Observability: offering comprehensive monitoring tools, providing clear insights into progress of cloning and training with performance metrics.
Clean Architecture: We have split the project into separate frontend and backend repositories for better modularity and maintainability. The portal repository is at EaseVoice Trainer Frontend.
RESTful API: The backend provides a RESTful API for easy integration with other services and applications.
Scalability: Built with scalability in mind, making it suitable for both small-scale experiments and large-scale production.
Integrate Tensorboard: We have integrated Tensorboard for real-time monitoring and visualization of training progress.

Screenshorts

Getting Started

Pretrained Models

As in GPT-SoVITS, you can download Pretrained Models, and then put them into the models directory.

Local Run

Before using EaseVoice Trainer, ensure you have the following installed: • Python 3.9 or higher • uv.

We leverage uv for robust and efficient project management. To start the server using uv:

uv run src/main.py

Run with Docker

Build the Docker image

cd scripts/Dockerfile
docker build -t megaease/easevoice-trainer .

Run the Docker Container

docker run -p 8000:8000 megaease/easevoice-trainer

This command ensures that the application running inside the container on port 8000 is accessible locally via http://localhost:8000.

Community

We welcome contributions from the community! Whether it’s fixing bugs, adding new features, or improving documentation, feel free to submit a pull request or open an issue.

License

EaseVoice Trainer is under the Apache 2.0 license. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 267 Commits
.github/workflows		.github/workflows
configs		configs
docs/images		docs/images
examples		examples
models/pretrained		models/pretrained
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
MAINTAINERS		MAINTAINERS
README.md		README.md
README.zh-CN.md		README.zh-CN.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EaseVoice Trainer

Key Features

Screenshorts

Getting Started

Pretrained Models

Local Run

Run with Docker

Community

License

About

Releases

Packages

Contributors 3

Languages

License

megaease/easevoice-trainer

Folders and files

Latest commit

History

Repository files navigation

EaseVoice Trainer

Key Features

Screenshorts

Getting Started

Pretrained Models

Local Run

Run with Docker

Community

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages