🌐 Real-Time Voice Translator

A professional multilingual real-time voice translation application with separate backend and frontend architecture.

🏗️ Project Structure

Real_time_Voice_translater/
│
├── backend/                    # FastAPI Backend
│   ├── app.py                  # Main API application
│   ├── translation_graph.py    # Translation engine
│   ├── audio_handler.py        # Audio processing
│   ├── requirements.txt        # Python dependencies
│   ├── runtime.txt            # Python 3.11.0
│   ├── render.yaml            # Render deployment config
│   ├── .env                   # Environment variables
│   └── README.md              # Backend documentation
│
├── frontend/                   # Web Frontend
│   ├── index.html             # User interface
│   ├── app.js                 # Client-side logic
│   ├── styles.css             # Styling
│   └── README.md              # Frontend documentation
│
├── docs/                       # Documentation
│   ├── RENDER_DEPLOYMENT_STEPS.md
│   ├── QUICK_DEPLOY.md
│   ├── PROJECT_STRUCTURE.md
│   └── CLEAN_STRUCTURE.md
│
├── .env                        # Root environment variables
├── .gitignore                  # Git ignore rules
├── serve_frontend.py           # Development server
└── README.md                   # This file

✨ Features

🎤 Real-time Voice Translation - Speak and hear translations instantly
💬 Text Translation - Type and translate between languages
🌍 Multi-Language Support - English, Hindi, Telugu, Tamil, Kannada
🔄 WebSocket Communication - Real-time bidirectional translation
🎨 Modern UI - Beautiful, responsive interface
🔒 Secure - Environment-based API key management
🚀 Free Deployment - Deploy on Render and Vercel for free

🚀 Quick Start

Prerequisites

Python 3.11+
pip
Git
Web browser (Chrome/Edge recommended)

Local Development

1. Clone the Repository

git clone https://github.com/YOUR-USERNAME/voice-translator.git
cd voice-translator

2. Set Up Backend

# Create virtual environment
python -m venv venv

# Activate virtual environment
# Windows:
.\venv\Scripts\Activate
# Mac/Linux:
source venv/bin/activate

# Install dependencies
cd backend
pip install -r requirements.txt

# Create .env file
echo "GROQ_API_KEY=your_api_key_here" > .env

# Run backend
uvicorn app:app --host 0.0.0.0 --port 8000 --reload

Backend will be available at: http://localhost:8000

3. Set Up Frontend

Open a new terminal:

# From project root
python serve_frontend.py

Frontend will be available at: http://localhost:3000

Testing

Open http://localhost:3000 in two browser windows
Enter the same room code (e.g., room123)
Select different languages in each window
Start chatting!

📚 Documentation

Backend README - API documentation, endpoints, deployment
Frontend README - UI guide, customization, deployment
Deployment Guide - Step-by-step deployment
Quick Deploy - Quick reference commands

🌐 Deployment

Backend Deployment (Render)

Push code to GitHub
Create account on Render
Create new Web Service
Connect GitHub repository
Set root directory: backend
Add environment variable: GROQ_API_KEY
Deploy!

Detailed guide: RENDER_DEPLOYMENT_STEPS.md

Frontend Deployment (Vercel)

Create account on Vercel
Import GitHub repository
Set root directory: frontend
Update API URL in frontend/app.js
Deploy!

Detailed guide: Frontend README

🛠️ Tech Stack

Backend

FastAPI - Modern Python web framework
LangChain - LLM orchestration
Groq API - Fast AI inference
WebSockets - Real-time communication
Uvicorn - ASGI server

Frontend

HTML5/CSS3/JavaScript - Modern web technologies
WebSocket API - Real-time updates
Web Speech API - Voice recognition and synthesis
Glassmorphism UI - Modern design

📊 API Endpoints

REST API

GET / - Health check
POST /translate - Translate text
POST /audio/translate - Translate audio

WebSocket

WS /ws/{room_id} - Real-time translation

Full API documentation: http://localhost:8000/docs

🔑 Environment Variables

Create .env file in backend/:

GROQ_API_KEY=your_groq_api_key_here
PORT=8000

Get your Groq API key from: https://console.groq.com

🎯 Use Cases

Language Learning - Practice conversations in different languages
International Teams - Communicate across language barriers
Customer Support - Assist customers in their native language
Travel - Communicate while traveling abroad
Education - Teach and learn multiple languages

🤝 Contributing

Contributions are welcome! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 License

This project is open source and available under the MIT License.

🙏 Acknowledgments

Groq - For fast AI inference
LangChain - For LLM orchestration
FastAPI - For the amazing web framework
Render - For free hosting
Vercel - For frontend hosting

📧 Contact

Your Name - your.email@example.com

Project Link: https://github.com/YOUR-USERNAME/voice-translator

🐛 Known Issues

First request after inactivity may take 30-60 seconds (Render free tier)
Speech recognition requires HTTPS in production
Some browsers may not support all features

🔮 Roadmap

Add more languages (Spanish, French, German, etc.)
Implement conversation history
Add user authentication
Support for group conversations (3+ people)
Mobile app version (React Native)
Offline mode with local models
Voice cloning for personalized TTS
Real-time video translation

📈 Performance

Translation Speed: ~500ms average
WebSocket Latency: < 100ms
Concurrent Users: Unlimited
Uptime: 99.9% (on paid tier)

🔐 Security

API keys stored in environment variables
CORS configured for security
WebSocket connections validated
No sensitive data logged

💰 Cost

Free Tier:

Render: 750 hours/month
Vercel: Unlimited
Groq API: Free tier available

Total Cost: $0/month for moderate usage

🌟 Star History

If you find this project useful, please consider giving it a star! ⭐

Made with ❤️ by [Your Name]

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
backend		backend
docs		docs
frontend		frontend
GITHUB_PUSH_INSTRUCTIONS.md		GITHUB_PUSH_INSTRUCTIONS.md
LICENSE		LICENSE
README.md		README.md
RENDER_DEPLOYMENT_GUIDE.md		RENDER_DEPLOYMENT_GUIDE.md
RESTRUCTURE_COMPLETE.txt		RESTRUCTURE_COMPLETE.txt
USER_GUIDE.md		USER_GUIDE.md
serve_frontend.py		serve_frontend.py

License

karansrabcd01/voice-translator

Folders and files

Latest commit

History

Repository files navigation