Gemini TTS Automator WebApp

Description

A web-based application that fully automates text-to-speech (TTS) generation using Google Agents in conjunction with Gemini TTS models. Supports modern Persian (Farsi, fa-IR, in preview) and English language capabilities. Leverages Google Cloud's Vertex AI or AI Studio for agent orchestration, ensuring automation through agent-driven workflows.

Features

User input handling for Persian and English text.
Language and voice selection with auto-detection.
Automated TTS generation with style controls (tone, pace, accent).
Output delivery in formats like MP3, with playback and download options.
Multilingual UI (Persian/English).
User-managed Google API tokens for quotas.

Getting Started

Clone the repo: git clone https://github.com/your-username/gemini-tts-automator-webapp.git
Install dependencies: npm install (for Node.js backend) or follow frontend setup.
Configure Google API credentials.

Links

Live Demo: index.html (placeholder)
History: history.json

Contributing

See CONTRIBUTING.md for guidelines.

License

MIT License - see LICENSE for details.

Notes about Persian (fa-IR) preview support

Persian (fa-IR) support in Gemini TTS may be in preview and could not be available in all accounts/regions yet. Recommended fallbacks:

Provide a configuration option to select a fallback TTS engine (e.g., another Google voice region or a third-party TTS provider) for Persian if the preview model is unavailable.
Document in the README and configuration the required Vertex AI/AI Studio quotas and the steps to request access to preview models.
Implement graceful error handling that informs users and offers a fallback voice/locale automatically.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.github		.github
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
history.json		history.json
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gemini TTS Automator WebApp

Description

Features

Getting Started

Links

Contributing

License

Notes about Persian (fa-IR) preview support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gemini TTS Automator WebApp

Description

Features

Getting Started

Links

Contributing

License

Notes about Persian (fa-IR) preview support

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages