Neo TTS

A modern, modular Text-to-Speech (TTS) system built with Flask, featuring local inference with Kokoro TTS models.

🚀 Features

Modular Architecture: Easily extensible with new TTS models
Web Interface: Clean, responsive Flask-based UI
Local Inference: Run completely offline with local models
Performance Monitoring: Real-time GPU/CPU usage tracking
Multiple Voices: Support for various voices and speakers
Audio Generation: High-quality WAV output
Logging: Comprehensive generation logs with performance metrics

🛠️ Supported Models

Kokoro: Fast, lightweight TTS optimized for local macOS deployment

📋 Prerequisites

macOS (Intel or Apple Silicon)
Python 3.10+
Homebrew

🏗️ Installation

Automated Setup (Recommended)

Run the setup script to automatically configure your environment:

./setup-neo-tts.sh

This will:

Install Homebrew dependencies (ffmpeg, pkg-config, Python 3.10)
Create a virtual environment
Install all required Python packages
Download TTS models
Set up project directories

Manual Setup

If you prefer manual installation:

Install system dependencies:

brew install ffmpeg pkg-config python@3.10

Create virtual environment:

python3.10 -m venv venv
source venv/bin/activate

Install Python dependencies:
```
pip install -r requirements-neo.txt
```

Download models:

python -c "
from huggingface_hub import snapshot_download
snapshot_download(repo_id='hexgrad/Kokoro-82M', local_dir='models/kokoro_cache')
"

🚀 Usage

Activate virtual environment:
```
source venv/bin/activate
```
Start the server:
```
python app/app.py
```
Open your browser: Visit http://localhost:5000
Generate speech:
- Select a TTS model
- Choose a voice
- Enter your text
- Click generate

📁 Project Structure

local-tts-devlopment/
├── app/                    # Flask application
│   ├── static/            # Static assets (CSS, JS, output files)
│   ├── templates/         # HTML templates
│   ├── app.py            # Main Flask application
│   ├── device_utils.py   # Device monitoring utilities
│   └── __init__.py
├── models/                # TTS model implementations
│   └── kokoro.py         # Kokoro TTS wrapper
├── logs/                  # Generation logs and metrics
├── venv/                  # Virtual environment (created by setup)
├── setup-neo-tts.sh      # Automated setup script
├── requirements-neo.txt  # Python dependencies
└── README.md             # This file

🔧 Configuration

The application automatically detects your hardware and optimizes accordingly:

Apple Silicon (M1/M2/M3): Uses Metal acceleration when available
Intel Macs: Falls back to CPU inference
GPU Monitoring: Tracks VRAM usage and performance metrics

📊 Monitoring

The application provides real-time monitoring of:

CPU/GPU usage during generation
Generation time and audio duration
Model performance metrics
Device information

Access monitoring data via the /api/device-info endpoint.

🧪 Testing

Run the included verification tests:

# Test Kokoro model
python -c "
from models.kokoro import list_voices, generate_audio
voices = list_voices()
if voices:
    generate_audio('Hello from Kokoro', voices[0])
    print('✅ Kokoro OK')
"

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes
Test thoroughly
Submit a pull request

Adding New Models

To add support for new TTS models:

Create a new module in models/
Implement the required interface:
- list_voices(): Return available voices
- generate_audio(text, voice, output_path): Generate audio file
Register the model in app/app.py MODELS dictionary

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Kokoro TTS - Fast local TTS
Flask - Web framework
PyTorch - Machine learning framework

🐛 Issues

If you encounter any issues:

Check the logs in app/logs/results.csv
Ensure your virtual environment is activated
Verify all dependencies are installed
Check device compatibility

For bugs or feature requests, please open an issue on GitHub.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Neo TTS

🚀 Features

🛠️ Supported Models

📋 Prerequisites

🏗️ Installation

Automated Setup (Recommended)

Manual Setup

🚀 Usage

📁 Project Structure

🔧 Configuration

📊 Monitoring

🧪 Testing

🤝 Contributing

Adding New Models

📝 License

🙏 Acknowledgments

🐛 Issues

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements-neo.txt		requirements-neo.txt
setup-neo-tts.sh		setup-neo-tts.sh

License

ihunna/neo-tts

Folders and files

Latest commit

History

Repository files navigation

Neo TTS

🚀 Features

🛠️ Supported Models

📋 Prerequisites

🏗️ Installation

Automated Setup (Recommended)

Manual Setup

🚀 Usage

📁 Project Structure

🔧 Configuration

📊 Monitoring

🧪 Testing

🤝 Contributing

Adding New Models

📝 License

🙏 Acknowledgments

🐛 Issues

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages