A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Sep 7, 2025 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education field, dedicated to achieving seamless interconnection and interoperability among different AI models, educational applications, smart hardware, and teaching AGENTs.
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
✨ NovelAI api python sdk, easy to use, modern and user-friendly.
AI generates conversational podcast for ANY research paper, vividly!
OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline
Voice Alignment and Conversion with Neural Networks and the WORLD codec.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件。
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
Project page for StableForm-TTS: Improving Robustness of Diffusion-Based Zero-Shot Speech Synthesis via Stable Formant Generation
Voice Generator Project FEAT. Hatsune Miku
A raspberryPi magic mirror based on facial recognition
Personal surveillance system using screenshots, webcam and OpenAI GPT-4o to check if the user is focused on his tasks. If not, the user will be roasted by GLaDOS voice and character to regain focus again.
Tool for scraping posts and corresponding comments from reddit, adding music and voiceovers, creating the shorts and automatically uploading to Youtube
Add a description, image, and links to the voice-generation topic page so that developers can more easily learn about it.
To associate your repository with the voice-generation topic, visit your repo's landing page and select "manage topics."