Releases · matatonic/openedai-speech · GitHub

27 Jun 00:56

matatonic

0.14.0

Recent Changes

Version 0.14.0, 2024-06-26

Added response_format: wav and pcm support
Output streaming (while generating) for tts-1 and tts-1-hd
Enhanced generation parameters for xtts models (temperature, top_p, etc.)
Idle unload timer (optional) - doesn't work perfectly yet
Improved error handling

Assets 2

25 Jun 21:28

matatonic

0.13.0

Recent Changes

Version 0.13.0, 2024-06-25

Added Custom fine-tuned XTTS model support
Initial prebuilt arm64 image support (Apple M-series, Raspberry Pi - MPS is not supported in XTTS/torch), thanks @JakeStevenson, @hchasens
Initial attempt at AMD GPU (ROCm 5.7) support
Parler-tts support removed
Move the *.default.yaml to the root folder
Run the docker as a service by default (restart: unless-stopped)
Added audio_reader.py for streaming text input and reading long texts

Contributors

JakeStevenson and hchasens

Assets 2

17 Jun 16:33

matatonic

0.12.3

Additional logging details for BadRequests (400)

Assets 2

17 Jun 04:27

matatonic

0.12.2

Recent Changes

Version 0.12.2, 2024-06-16

Fix :min image requirements (numpy<2?)

Assets 2

17 Jun 03:54

matatonic

0.12.1

Recent Changes

Version 0.12.1, 2024-06-16

no change, just poking github packages.

Assets 2

17 Jun 03:39

matatonic

0.12.0

Version 0.12.0, 2024-06-16

Improved error handling and logging
Restore the original alloy tts-1-hd voice by default, use alloy-alt for the old voice.

Assets 2

29 May 21:03

matatonic

0.11.0

Recent Changes

Version 0.11.0, 2024-05-29

🌐 Multilingual support (16 languages) with XTTS
Remove high Unicode filtering from the default config/pre_process_map.yaml
Update Docker build & app startup. thanks @justinh-rahb
Fix: "Plan failed with a cudnnException"
Remove piper cuda support

Contributors

justinh-rahb

Assets 2

05 May 17:38

matatonic

0.10.1

Recent Changes

Version: 0.10.1, 2024-05-05

Remove runtime: nvidia from docker-compose.yml, this assumes nvidia/cuda compatible runtime is available by default. thanks @jmtatsch

Contributors

jmtatsch

Assets 2

27 Apr 15:40

matatonic

0.10.0

Recent Changes

Version: 0.10.0

Pre-built & tested docker images, smaller docker images (8GB or 860MB)
Better upgrades: reorganize config files under config/, voice models under voices/
Compatibility! If you customized your voice_to_speaker.yaml or pre_process_map.yaml you need to move them to the config/ folder.

Assets 2