JARVIS AI Assistant

A modular, voice-controlled AI assistant featuring a futuristic HUD interface, advanced automation skills, and a "living" responsiveness. Built with Python, PyQt6, and Groq's LLM engine.

🌟 Core Features

Futuristic HUD: A "Starark-style" interface featuring:
- Arc Reactor: Central animated core that pulses with voice activity.
- Hexagon Panel: Dynamic background visualizations.
- Telemetry: Real-time visual feedback bars.
Dual Modes:
- Voice Mode: Full hands-free interaction using speech recognition and TTS.
- Text Mode: Silent command-line interface for distinct environments.
Modular Skill System: Ease of extensibility. New capabilities can be added as drop-in modules in the skills/ directory.

🧠 Skills & Capabilities

JARVIS is equipped with a diverse set of skills:

🌐 Web & Communication

Web Operations: Performs Google searches and opens websites (web_ops).
WhatsApp: Automates messaging via selenium-driven web interface (whatsapp_skill).
Email: Capable of managing email operations (email_ops).

👁️ Vision & Sensing

Computer Vision: Real-time object detection using YOLO (detection_skill).
Camera Access: Captures photos and processes visual input (camera_skill).
Screen Awareness: Takes and analyzes screenshots (screenshot_ops).
Gemini Live: Advanced multimodal interaction capabilities (gemini_live_skill).

🛠️ System Control

System Operations: Controls volume, screen brightness, and application management (system_ops).
File Management: Create, read, and organize files (file_ops).
Context Awareness: Tracks date, time, and maintains long-term memory (memory_ops, datetime_ops).

🚀 Setup & Installation

Prerequisites

Python 3.10+
A Groq API Key for the LLM brain.

Installation

Clone the Repository
```
git clone <YOUR_REPO_URL>
cd JARVIC
```
Install Dependencies
```
pip install -r requirements.txt
```
Note: This project relies on PyQt6 for the GUI and ultralytics for vision.

Configure Environment Create a .env file in the root directory:

GROQ_API_KEY=your_key_here
# Add other keys as needed by specific skills

💻 Usage

Standard Voice Mode (with GUI)

python main.py

The HUD will launch.
Speak naturally to interact.
Click the center reactor to Pause/Resume listening.

Text-Only Mode

python main.py --text

Runs in the terminal without voice I/O. Ideal for debugging or quiet environments.

📂 Project Structure

core/: The brain (Engine), voice processing, and skill registry.
gui/: PyQt6 application logic and rendering.
skills/: Individual capability modules.
assets/: Images and resources.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Windows		Windows
assets		assets
core		core
gui		gui
skills		skills
.env.template		.env.template
.gitignore		.gitignore
PyWhatKit_DB.txt		PyWhatKit_DB.txt
README.md		README.md
check_safari.py		check_safari.py
gemini_client.py		gemini_client.py
inccomplit.txt		inccomplit.txt
learn.txt		learn.txt
list_models.py		list_models.py
list_voices.py		list_voices.py
main.py		main.py
psudocode.py		psudocode.py
requirements.txt		requirements.txt
test_whatsapp.py		test_whatsapp.py
verify_changes.py		verify_changes.py
verify_new_skills.py		verify_new_skills.py
verify_refactor.py		verify_refactor.py
verify_regex.py		verify_regex.py
video_system.py		video_system.py
yolov8n.pt		yolov8n.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JARVIS AI Assistant

🌟 Core Features

🧠 Skills & Capabilities

🌐 Web & Communication

👁️ Vision & Sensing

🛠️ System Control

🚀 Setup & Installation

Prerequisites

Installation

💻 Usage

📂 Project Structure

About

Uh oh!

Releases

Packages

Languages

Skandamrao/Project_JARVIS

Folders and files

Latest commit

History

Repository files navigation

JARVIS AI Assistant

🌟 Core Features

🧠 Skills & Capabilities

🌐 Web & Communication

👁️ Vision & Sensing

🛠️ System Control

🚀 Setup & Installation

Prerequisites

Installation

💻 Usage

📂 Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages