Skip to content

Latest commit

 

History

History
125 lines (70 loc) · 3.58 KB

README.md

File metadata and controls

125 lines (70 loc) · 3.58 KB

Open Sourced NoteBookLM

Podcast Creator

Overview

The Podcast Creator script (open sourced notebookLM) is designed to automate the process of creating a podcast from a PDF document. It extracts text from the PDF, generates a detailed podcast script using OpenAI's GPT-4 model, converts the script to audio, and then combines the audio with images of the PDF pages to create a video. The final output includes both an audio file and a video file with synchronized audio.

Examples

Mistral 7B

Watch the video

LLama2

Watch the video

Attention is all you need

Watch the video

open sourced NoteBookLM Features

  1. PDF Text Extraction: Extracts text content from a PDF document.
  2. Script Generation: Uses OpenAI's GPT-4 model to generate a detailed podcast script based on the extracted text.
  3. Text-to-Speech Conversion: Converts the generated script into audio using OpenAI's text-to-speech capabilities.
  4. Audio Processing: Processes the audio to ensure it meets the desired specifications (e.g., stereo, sample rate).
  5. Video Creation: Converts PDF pages to images and combines them with the audio to create a video.
  6. Environment Configuration: Loads environment variables from a .env file for secure API key management.

Workflow

  1. Extract Text from PDF: The script starts by extracting text from the provided PDF file.
  2. Generate Podcast Script: The extracted text is used to generate a podcast script featuring two hosts, Alice and John, who engage in a detailed conversation about the content.
  3. Convert Script to Audio: The script is converted to audio, with different voices assigned to Alice and John.
  4. Process Audio: The audio is processed to ensure it is in the correct format and quality.
  5. Create Video: Images of the PDF pages are created and combined with the audio to produce a video.
  6. Save Outputs: The final audio and video files are saved to the specified output paths.

How to Use open sourced NoteBookLM

To run the project:

   pip install poetry
   poetry install
  1. fill .env file
  OPENAI_API_KEY=""
cd podcast_creator
  1. To use the script, simply provide the path to the PDF file and run the script. The script will handle the rest, generating the podcast script, converting it to audio, processing the audio, and creating the video.
if name == "main":
pdf_path = "/path/to/your/pdf/document.pdf"
create_podcast_from_pdf(pdf_path)
   poetry run python podcast_creator/main.py

Dependencies

  • langchain
  • pydantic
  • openai
  • pydub
  • fitz (PyMuPDF)
  • numpy
  • subprocess
  • tqdm
  • PIL (Pillow)
  • textwrap
  • dotenv

Ensure all dependencies are installed before running the script.

Conclusion

The Podcast Creator (open sourced NoteBookLM) script provides a comprehensive solution for converting PDF documents into engaging podcast episodes, complete with audio and video outputs. By leveraging advanced AI models and audio processing techniques, it automates the entire workflow, making it easy to create high-quality podcast content from textual documents.

License

Let's Have a Chat ;)