Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Intergration with pipecat framework #3014

Open
wizd opened this issue Jan 21, 2025 · 3 comments
Open

Intergration with pipecat framework #3014

wizd opened this issue Jan 21, 2025 · 3 comments

Comments

@wizd
Copy link

wizd commented Jan 21, 2025

Summary

Pipecat is open source framework for voice and multimodal conversational AI. link

Motivation

PipeCat supports multiple WebRTC services modularly; however, there are currently no open-source providers available. Therefore, PipeCat serves as an excellent option for individuals looking to establish their own WebRTC solutions.

Describe alternatives you've considered

Pipecat + Pion will create a full stack of opensource webrtc solution.

Additional context

Real-Time Voice and Video Inference (RTVI)

@dsa
Copy link

dsa commented Jan 21, 2025

Check out:
https://github.com/livekit/livekit
https://github.com/livekit/agents

It’s built on Pion, and fully open source including the transport.

@kwindla
Copy link

kwindla commented Jan 23, 2025

In case it's useful, there's a Pipecat LiveKit transport:

https://github.com/pipecat-ai/pipecat/blob/main/src/pipecat/transports/services/livekit.py

Example:

https://github.com/pipecat-ai/pipecat/blob/main/examples/foundational/29-livekit-audio-chat.py

Separately, though, it would be really fun to build a little hackable WebRTC server designed from the ground up for realtime AI.

I've been thinking about this a little bit and would love to work on it if other people who are interested. Pion and Pipecat could fit together really nicely for this. I've written chunks of three or four SFUs (depending on how you count) over the years, and I think that the requirements/architecture for voice+vision AI are different from the things that shaped SFU design up until now.

@wizd
Copy link
Author

wizd commented Jan 24, 2025

In case it's useful, there's a Pipecat LiveKit transport:

https://github.com/pipecat-ai/pipecat/blob/main/src/pipecat/transports/services/livekit.py

Example:

https://github.com/pipecat-ai/pipecat/blob/main/examples/foundational/29-livekit-audio-chat.py

Separately, though, it would be really fun to build a little hackable WebRTC server designed from the ground up for realtime AI.

I've been thinking about this a little bit and would love to work on it if other people who are interested. Pion and Pipecat could fit together really nicely for this. I've written chunks of three or four SFUs (depending on how you count) over the years, and I think that the requirements/architecture for voice+vision AI are different from the things that shaped SFU design up until now.

I couldn't agree more. Next-gen agent ecosystems are set to run on WebRTC networks, and I believe we can work together to develop an improved server architecture for these systems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants