๐ฅ YouTube Transcriber (LangExtract + Streamlit)
This project is an interactive YouTube Q&A Transcriber built with Streamlit and powered by LangExtract for transcript extraction and semantic search. Users can paste a YouTube link, ask natural language questions about the video, and instantly receive AI-generated answers.
๐ Features
๐ Input any YouTube video URL.
๐ค Ask questions about the video content.
๐ง Uses LangExtract for transcript retrieval + embeddings.
๐จ Simple, clean Streamlit interface.
๐ณ Containerized with Docker for easy deployment.
๐ฆ Installation
Clone the repo:
git clone https://github.com/your-username/youtube-assistant.git cd youtube-assistant
Install dependencies:
pip install -r requirements.txt
Run locally:
streamlit run main.py
๐ Environment Setup
Create a .env file (or pass via Streamlit sidebar):
OPENAI_API_KEY=your_api_key_here
Get your API key: OpenAI API Keys
๐ณ Run with Docker
Build the image:
docker build -t youtube-transcriber .
Run the container:
docker run -p 8501:8501 youtube-transcriber
Access the app at: http://localhost:8501
๐ Example
Paste a YouTube URL in the sidebar.
Enter your OpenAI API key.
Ask: โSummarize the main ideas in this videoโ.
Get instant AI answers based on transcript + embeddings.
๐ Tech Stack
Python 3.9+
Streamlit โ UI
LangExtract โ transcript + retrieval
OpenAI โ LLM responses
Docker โ deployment
๐ Roadmap
Support multiple videos at once.
Add caching for transcripts.
Option to export Q&A results as PDF.
๐ License
MIT License โ free to use and modify.