Aria

Traditional airport kiosks rely on touchscreens that are unhygienic, hard to use with luggage, and lack personalization. We wanted a voice-first, multimodal AI assistant that feels natural and helpful—like talking to a knowledgeable airport employee. By combining conversational AI with a 3D visual interface, we make airport navigation more intuitive and engaging.

Tech Stack

Frontend

React with TanStack Router and Query
Three.js for 3D visualization
Tailwind CSS
Web Audio API for audio analysis

Backend

FastAPI with WebSocket support
LangGraph for agent orchestration
ElevenLabs TTS
MongoDB Atlas for RAG

Quick Start

Frontend

cd frontend
pnpm install
pnpm dev          # http://localhost:3000

Backend

cd backend
uv run main.py    # http://localhost:8000

Requires .env with:

vultr_api_key, vultr_model
elevenlabs_api_key, elevenlabs_voice_id
mongodb_uri, mongodb_database, mongodb_collection

Project Structure

frontend/          React + TanStack + Three.js
  src/
    components/Assistant3D/    3D particle sphere with audio reactivity
    routes/                    File-based routing
backend/           FastAPI + LangGraph
  core/            Abstractions, interfaces, events
  transport/       WebSocket handler
  session/         Session management
  agent/           LangGraph orchestration
  tools/           Tool registry and execution
  rag/             RAG service (stub)
  tts/             TTS service

Frontend Architecture

Commands

cd frontend
pnpm dev          # Dev server on port 3000
pnpm build        # Build and type check
pnpm test         # Vitest tests
pnpm lint         # ESLint
pnpm format       # Prettier
pnpm check        # Format and fix

Routing

TanStack Router - File-based routing in src/routes/:

Routes auto-generate from files
Layout in __root.tsx wraps all routes with <Outlet />
Router context includes QueryClient for data fetching integration

TanStack Query - Data fetching:

Provider in src/integrations/tanstack-query/root-provider.tsx
QueryClient passed to router context in main.tsx
Use loaders in routes or useQuery in components

Path Alias: @/ maps to src/

3D Assistant Component

Located in src/components/Assistant3D/, built using refs instead of React state for 60fps animations without re-renders.

Component Structure:

index.tsx - Entry point, WebGL detection, WebSocket integration
AssistantInterface.tsx - UI layer: mic toggle, prompt input, error toast
AssistantCanvas.tsx - Three.js scene, mouse interaction
entities/ParticleSphereEntity.ts - Particle system with audio-reactive deformation
hooks/useAudioAnalyzer.ts - Web Audio API with FFT analysis (bass, mid, high bands)
hooks/useAssistantAnimation.ts - Animation loop: dragging → momentum → auto-rotation
hooks/useThreeScene.ts - Three.js scene initialization
entities/PostProcessing.ts - Bloom effects via EffectComposer
constants/animationConstants.ts - Tunable parameters

Animation State Machine:

Dragging: Mouse rotation with velocity tracking (100ms window, 5-sample averaging)
Momentum: Post-drag inertia with exponential damping (0.95)
Auto-rotation: Idle rotation (velocity < 0.0001 threshold)

Audio-Reactive Features:

FFT: 128 size, 64 frequency bins
3 bands: bass (0-10), mid/vocal (10-20), high (30-64)
20 spike centers (Fibonacci sphere distribution)
Smoothing: 0.30 factor prevents jitter

Performance Optimizations:

Refs for animation state (not React state)
Pixel ratio capped at 2x
No shadows
Delta-time aware
requestAnimationFrame pattern

Backend Architecture

Commands

cd backend
uv run main.py              # FastAPI server on port 8000 with reload
uv run python verify_refactor.py  # Verify refactoring
uv run pytest tests/        # Run tests

Setup Requirements

Python >=3.13
Package manager: uv
Dependencies: fastapi, uvicorn, websockets, langchain, langgraph, elevenlabs, motor, pymongo
Configuration: Uses config.py (loads from .env)

Layered Architecture

Transport (WebSocket) → Session Manager → Agent Orchestrator
                                            ↓
                        LLM Client | Tool Registry | RAG Service | TTS Service

Key Directories:

core/ - Core abstractions and contracts

interfaces.py - Protocols: Agent, Tool, RAGService, SessionStore
events.py - Event types for streaming (ReasoningEvent, ToolCallEvent, AudioEvent, etc.)
exceptions.py - Exception hierarchy with natural language translation
schemas.py - Shared Pydantic models

transport/ - WebSocket layer

websocket_handler.py - /ws endpoint, maintains WebSocket contract
serializers.py - Event → JSON conversion

session/ - Session management

session_manager.py - Lifecycle, interruption handling
session_store.py - In-memory store (future: DB persistence)
models.py - Session, ConversationTurn schemas

agent/ - Orchestration

orchestrator.py - Self-correcting loop, retry logic, event emission
graph.py - LangGraph implementation
state.py - AgentState schema
strategies.py - RetryStrategy

tools/ - Tool layer

registry.py - Tool discovery, registration
executor.py - Resilient wrapper: timeout, error translation
base.py - BaseTool with is_read_only flag
implementations/ - time_tool.py, arithmetic_tools.py, rag_tool.py (stub)

rag/ - RAG service (stub)

service.py - Retrieval orchestration (stub)

tts/ - Text-to-speech

service.py - TTS abstraction
elevenlabs_client.py - ElevenLabs implementation

observability/ - Structured logging

logger.py

WebSocket API

Client Sends:

{"message": "user text"}

Server Streams:

{"type": "audio", "chunk": "base64..."}
{"type": "alignment", "characters": [...], "character_start_times_seconds": [...], ...}
{"type": "done"}
{"type": "error", "message": "..."}
{"type": "retry", "attempt": N, "reason": "..."}

Error Handling

AgentException base class with to_natural_language() method
Tool execution errors auto-translate to user-friendly messages
Resilient tool execution: timeout enforcement (30s default), schema validation
Never crashes app - all errors caught and translated

Configuration

Environment variables in .env:

vultr_api_key, vultr_model - LLM client
elevenlabs_api_key, elevenlabs_voice_id - Text-to-speech
mongodb_uri, mongodb_database, mongodb_collection - RAG database
tool_execution_timeout_seconds (30) - Tool timeout
llm_call_timeout_seconds (60) - LLM timeout
agent_max_execution_time_seconds (300) - Overall agent timeout
agent_max_retries (3) - Agent retry limit
tool_max_retries (1) - Tool retry limit
rag_top_k (5) - RAG results count
embedding_model (all-MiniLM-L6-v2) - Embedding model name

Design Decisions

Session Management: In-memory store, conversation history persists across turns
Event Streaming: All agent events stream to frontend for observability
Self-Correction: Orchestrator supports retry logic for error recovery
Tool Resilience: Timeout enforcement, validation, error translation
RAG: Stubbed for MongoDB Atlas Vector Search integration

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
.claude		.claude
backend		backend
frontend		frontend
frontend_test		frontend_test
registration		registration
testing		testing
.gitignore		.gitignore
CHANGES_SUMMARY.md		CHANGES_SUMMARY.md
CLAUDE.md		CLAUDE.md
README.md		README.md
REFACTORING_COMPLETE.md		REFACTORING_COMPLETE.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aria

Tech Stack

Quick Start

Frontend

Backend

Project Structure

Frontend Architecture

Commands

Routing

3D Assistant Component

Backend Architecture

Commands

Setup Requirements

Layered Architecture

WebSocket API

Error Handling

Configuration

Design Decisions

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

luvadlamudi/TAMUHack26

Folders and files

Latest commit

History

Repository files navigation

Aria

Tech Stack

Quick Start

Frontend

Backend

Project Structure

Frontend Architecture

Commands

Routing

3D Assistant Component

Backend Architecture

Commands

Setup Requirements

Layered Architecture

WebSocket API

Error Handling

Configuration

Design Decisions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages