Releases: matatonic/openedai-speech
Releases · matatonic/openedai-speech
0.14.0
Recent Changes
Version 0.14.0, 2024-06-26
- Added
response_format
:wav
andpcm
support - Output streaming (while generating) for
tts-1
andtts-1-hd
- Enhanced generation parameters for xtts models (temperature, top_p, etc.)
- Idle unload timer (optional) - doesn't work perfectly yet
- Improved error handling
0.13.0
Recent Changes
Version 0.13.0, 2024-06-25
- Added Custom fine-tuned XTTS model support
- Initial prebuilt arm64 image support (Apple M-series, Raspberry Pi - MPS is not supported in XTTS/torch), thanks @JakeStevenson, @hchasens
- Initial attempt at AMD GPU (ROCm 5.7) support
- Parler-tts support removed
- Move the *.default.yaml to the root folder
- Run the docker as a service by default (
restart: unless-stopped
) - Added
audio_reader.py
for streaming text input and reading long texts
0.12.3
0.12.2
0.12.1
0.12.0
0.11.0
Recent Changes
Version 0.11.0, 2024-05-29
- 🌐 Multilingual support (16 languages) with XTTS
- Remove high Unicode filtering from the default
config/pre_process_map.yaml
- Update Docker build & app startup. thanks @justinh-rahb
- Fix: "Plan failed with a cudnnException"
- Remove piper cuda support
0.10.1
0.10.0
Recent Changes
Version: 0.10.0
- Pre-built & tested docker images, smaller docker images (8GB or 860MB)
- Better upgrades: reorganize config files under
config/
, voice models undervoices/
- Compatibility! If you customized your
voice_to_speaker.yaml
orpre_process_map.yaml
you need to move them to theconfig/
folder.