Empathic Voice Companion

An AI-powered voice assistant that detects emotional states from speech and responds with appropriate empathy using open-source tools.

Features

Real-time Speech Recognition - Powered by OpenAI Whisper
Emotion Detection - Analyzes voice tone, pitch, and prosodic features
Empathic Response Generation - Context-aware responses using local LLM
Adaptive Text-to-Speech - Voice output that matches emotional context
Conversation Memory - Tracks emotional context across interactions
Privacy-First - All processing happens locally, no data sent to external services

Architecture

Audio Input → Speech Recognition → Emotion Detection → Response Generation → Text-to-Speech → Audio Output
     ↓              ↓                    ↓                     ↓                  ↓
  Microphone    Whisper STT         Librosa +            Local LLM          Piper TTS
                                   ML Classifier        (Ollama/HF)

Supported Emotions

Happy - Joyful, excited, positive
Sad - Melancholic, disappointed, down
Angry - Frustrated, irritated, upset
Anxious - Worried, stressed, nervous
Calm - Peaceful, relaxed, content
Neutral - Balanced, matter-of-fact

Installation

Prerequisites

Python 3.8+ (3.9-3.11 recommended)
FFmpeg (for audio processing)
At least 4GB RAM (for local LLM)

Quick Setup (Recommended)

Clone or download the project:

cd empathic-voice-companion

Create virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Run the safe installer:

python install.py

This installer handles dependency compatibility issues automatically.

Alternative Setup

If you prefer manual installation:

# Install minimal dependencies
pip install -r requirements-minimal.txt

# Then run setup for models
python setup_models.py

Usage

Basic Usage

python main.py

Web Interface

python app.py
# Open http://localhost:8000 in your browser

API Mode

python api_server.py
# API available at http://localhost:8001

Configuration

Edit config.yaml to customize:

Emotion detection sensitivity
Response personality styles
Voice models and settings
Audio input/output devices

Development

Project Structure

empathic-voice-companion/
├── src/
│   ├── speech/
│   │   ├── recognition.py      # Whisper STT integration
│   │   └── synthesis.py        # Piper TTS integration
│   ├── emotion/
│   │   ├── detector.py         # Emotion detection engine
│   │   └── features.py         # Audio feature extraction
│   ├── response/
│   │   ├── generator.py        # LLM response generation
│   │   └── empathy.py          # Empathic response patterns
│   ├── memory/
│   │   └── conversation.py     # Conversation history
│   └── utils/
│       ├── audio.py            # Audio processing utilities
│       └── config.py           # Configuration management
├── models/                     # Downloaded AI models
├── data/                       # Training data and samples
├── tests/                      # Unit tests
├── web/                        # Web interface files
├── main.py                     # Main CLI application
├── app.py                      # Web application
├── api_server.py              # REST API server
├── requirements.txt           # Python dependencies
├── config.yaml               # Configuration file
└── setup_models.py           # Model download script

License

MIT License - See LICENSE file for details

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests
Submit a pull request

Acknowledgments

OpenAI Whisper for speech recognition
Librosa for audio analysis
Piper TTS for speech synthesis
Hugging Face for ML models

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src		src
tests		tests
web		web
.gitignore		.gitignore
DEPLOY.md		DEPLOY.md
GETTING_STARTED.md		GETTING_STARTED.md
Procfile		Procfile
README.md		README.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
app.py		app.py
build.sh		build.sh
config.yaml		config.yaml
install.py		install.py
main.py		main.py
render.yaml		render.yaml
requirements-minimal.txt		requirements-minimal.txt
requirements-render.txt		requirements-render.txt
requirements.txt		requirements.txt
runtime.txt		runtime.txt
setup_models.py		setup_models.py
test_avatar.html		test_avatar.html
test_system.py		test_system.py
test_tunda.py		test_tunda.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Empathic Voice Companion

Features

Architecture

Supported Emotions

Installation

Prerequisites

Quick Setup (Recommended)

Alternative Setup

Usage

Basic Usage

Web Interface

API Mode

Configuration

Development

Project Structure

License

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

NickEinstein1/TUNDA

Folders and files

Latest commit

History

Repository files navigation

Empathic Voice Companion

Features

Architecture

Supported Emotions

Installation

Prerequisites

Quick Setup (Recommended)

Alternative Setup

Usage

Basic Usage

Web Interface

API Mode

Configuration

Development

Project Structure

License

Contributing

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages