Skip to content

Comments

Emma Wang - Homework3-Submission#12

Open
EmmaW215 wants to merge 1 commit intoinference-ai-course:mainfrom
EmmaW215:main
Open

Emma Wang - Homework3-Submission#12
EmmaW215 wants to merge 1 commit intoinference-ai-course:mainfrom
EmmaW215:main

Conversation

@EmmaW215
Copy link

Core Features

1. Speech Recognition (ASR)

  • Engine: OpenAI Whisper
  • Model: Small (configurable)
  • Accuracy: >90%
  • Latency: 2-3 seconds

2. Language Model (LLM)

  • Model: Llama 3.2-1B
  • Context: 5-turn conversation memory
  • Response: Natural, conversational
  • Latency: 3-5 seconds

3. Text-to-Speech (TTS)

  • Engine: Google TTS (gTTS)
  • Quality: Natural-sounding
  • Languages: 100+
  • Latency: 1-2 seconds

4. API Server

  • Framework: FastAPI
  • Protocol: HTTP REST
  • Format: Multipart form-data
  • Documentation: Auto-generated (Swagger)

5. Session Management

  • Type: In-memory
  • Capacity: Unlimited concurrent sessions
  • History: Last 5 turns per session
  • Timeout: 1 hour (configurable)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant