Pipecat Gemini Live Demo

End-to-end starter kit for building a realtime Pipecat experience that pairs a FastAPI backend with an Expo/React Native client. The backend provisions Daily rooms + tokens, runs a Pipecat pipeline that talks to Gemini Live, and exposes RTVI-friendly endpoints. The mobile app (currently the Expo blank TypeScript scaffold) will host the Pipecat RN SDK integration.

TODO (next repo visit): Before assuming Gemini sees video, upgrade Pipecat to the latest release, reinstall deps (pip install -e "./server[dev]"), and inspect pipecat.services.google.gemini_live.GeminiModalities / GeminiLiveLLMService to confirm a VIDEO enum and frame forwarding exist. If VIDEO appears, re-enable GOOGLE_MODALITIES=AUDIO_AND_VIDEO and rerun the end-to-end test flow.

Repository Layout

.
├── server/          # FastAPI app, services, and API tests
├── mobile/          # Expo/React Native project (TypeScript)
├── docs/            # Architecture notes and design references
└── .github/         # Copilot instructions + automation hooks

Requirements

macOS or Linux with Homebrew (recommended)
Python 3.11 (backend)
Node.js 18+ and npm (Expo CLI)
Xcode Simulator or Android emulator / physical device for testing the app

Backend Setup

Create / activate the virtual environment

python3.11 -m venv .venv
source .venv/bin/activate

Install dependencies
```
pip install -e "./server[dev]"
```

Copy the sample environment and fill in secrets

cp .env.example .env
# edit .env with your Daily, Google, and Expo base URL settings

The backend reads configuration via pydantic-settings, so any variable defined in .env automatically flows into Settings.

Running the API locally

source .venv/bin/activate
uvicorn server.app.main:app --factory --reload

Default port: http://127.0.0.1:8000
Health check: GET /health
Session endpoints: POST /api/rtvi/start, POST /api/rtvi/{sessionId}/stop, GET /api/rtvi/{sessionId}

Running the backend tests

source .venv/bin/activate
pytest server/tests -q

Key environment variables

Variable	Purpose
`EXPO_PUBLIC_API_BASE_URL`	Base URL that the Expo app will call (e.g., `https://<tunnel>.ms` so devices can reach FastAPI).
`API_PREFIX`	Route prefix for backend APIs (`/api` by default).
`ALLOW_ORIGINS`	JSON array of dev/prod origins to allow through CORS (include Expo dev server ports).
`LOG_LEVEL`	Controls FastAPI logging noise (`INFO`, `DEBUG`, etc.).
`DAILY_API_KEY`	Lets the backend create Daily rooms and issue meeting tokens.
`DAILY_API_URL`	Daily REST endpoint (usually `https://api.daily.co/v1`).
`DAILY_SAMPLE_ROOM_URL`	Optional fixed room URL for local testing when you don't auto-create rooms.
`DAILY_ROOM_EXP_MINUTES`, `DAILY_TOKEN_EXP_MINUTES`	Lifetimes (minutes) for ad-hoc rooms and Daily access tokens.
`MOCK_DAILY`	Set `true` to bypass Daily REST calls and generate mock tokens.
`GOOGLE_API_KEY`	Gemini Live API key for the Pipecat pipeline.
`GOOGLE_MODEL`, `GOOGLE_VOICE_ID`, `GOOGLE_LANGUAGE`, `GOOGLE_REGION`	Voice + locale tuning plus routing hints for Gemini Live.
`GOOGLE_API_VERSION`, `GOOGLE_MODALITIES`	Optional overrides for advanced Gemini Live features (`AUDIO_AND_VIDEO` enables video input).
`SYSTEM_INSTRUCTION`	Default instructions given to Gemini for each session.
`BOT_NAME`	Friendly display name for your Pipecat assistant.
`BOT_RUNNER_ENABLED`	Toggle to disable the Pipecat runner during tests.
`ENABLE_VIDEO_PIPELINE`	When `true` (and `GOOGLE_MODALITIES=AUDIO_AND_VIDEO`), Pipecat forwards Daily camera frames to Gemini Live.
`SESSION_TTL_SECONDS`, `CLEANUP_INTERVAL_SECONDS`	Session lifecycle timing knobs used by the cleanup service.
`DUMMY_TOKENS_ENABLED`	Generate placeholder auth tokens for development flows.

See .env.example for the complete list with defaults.

Mobile (Expo) Setup

The Expo client now includes:

A Pre-Join experience where you can set the FastAPI base URL, choose your display name, and provide a short system prompt.
A Session screen with split Daily-powered video panes (local preview + Gemini remote feed), live transcripts, audio meters, and controls to send text prompts, restart the transport, or hang up.
A reusable VoiceSessionProvider that wraps the Pipecat RN SDK + Daily transport, handles camera/mic permissions, primes devices via transport.initDevices(), and streams transcripts/audio levels through Zustand state.

Because Pipecat relies on native Daily modules, this project uses the @daily-co/config-plugin-rn-daily-js plugin. You need to generate development builds (Expo Go will not load the native modules).

Configure environment variables

cd mobile
cp .env.example .env
# update EXPO_PUBLIC_API_BASE_URL so devices can reach your FastAPI server

Install project dependencies
```
npm install
```
Prebuild native projects and install pods (first run or after native dependency changes):
```
npx expo prebuild --clean
```
Run on a simulator / device
```
npm run ios   # or: npm run android
```
Tip: use npx expo start --dev-client --tunnel if running on a physical device that needs to reach your local backend.

Inside the app, tap Start Conversation on the Pre-Join screen. The provider will request camera/mic permissions up front, call POST /api/rtvi/start, join the Daily room via RNDailyTransport, and render the conversation with Gemini Live. Use the new Restart Session button on the session screen if you need to cycle the transport without leaving the call.

Verification script

Before shipping changes to Gemini Live configuration, run the helper script from the repo root:

./scripts/verify-video.sh

It activates the virtualenv, runs pytest server/tests -q, type-checks the Expo app with npx tsc --noEmit, and then prints the manual steps for launching FastAPI + Expo (npx expo start --clear).

Additional Documentation

docs/architecture.md – high-level system overview, runtime flow, and component responsibilities.
docs/video-integration-plan.md – detailed Gemini Live rollout guide plus the full 26-gemini-multimodal-live.py reference snippet now mirrored locally.
.github/copilot-instructions.md – workspace-specific automation guardrails.

Troubleshooting

Missing Python packages: Ensure the virtual environment is active (which python should point inside .venv).
Module import errors in tests: Confirm pip install -e "./server[dev]" ran successfully and you're using Python 3.11+.
Daily API failures: Toggle MOCK_DAILY=true locally or set DAILY_SAMPLE_ROOM_URL to reuse a test room until you have your API key ready.
Gemini "can't see" video: The backend currently installs Pipecat 0.0.94 (pip show pipecat-ai), whose GeminiModalities enum only exposes TEXT and AUDIO. InputParams.modalities therefore rejects strings like AUDIO_AND_VIDEO, and GeminiLiveLLMService.handle_user_image_frame is a no-op, so Daily camera frames never reach Gemini. To enable true video grounding you'll need a Pipecat release that adds a VIDEO modality plus frame forwarding—track their release notes and upgrade when that lands.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
docs		docs
mobile		mobile
scripts		scripts
server		server
tmp		tmp
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
pipecat-ai-client-js-1.4.1.tgz		pipecat-ai-client-js-1.4.1.tgz
pipecat-ai-react-native-daily-transport-1.4.0.tgz		pipecat-ai-react-native-daily-transport-1.4.0.tgz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pipecat Gemini Live Demo

Repository Layout

Requirements

Backend Setup

Running the API locally

Running the backend tests

Key environment variables

Mobile (Expo) Setup

Verification script

Additional Documentation

Troubleshooting

About

Uh oh!

Languages

kiarashplusplus/expo-gemini-live

Folders and files

Latest commit

History

Repository files navigation

Pipecat Gemini Live Demo

Repository Layout

Requirements

Backend Setup

Running the API locally

Running the backend tests

Key environment variables

Mobile (Expo) Setup

Verification script

Additional Documentation

Troubleshooting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages