Drop in a paper. Get summaries, highlights, margin notes, searchable embeddings, and a full audio walkthrough. Fully offline, fully yours.
- Process PDFs from local path or URL.
- High-level summary of the entire paper using Gemini.
- Paragraph-by-paragraph walkthrough:
- Narration of each paragraph.
- Explanation of each paragraph (using Gemini with Ollama fallback).
- Visual Highlighting: The current paragraph is highlighted on the page.
- Full Video Output: Generates a single MP4 video combining audio and visuals.
- Offline-first: Core functionality works offline. Gemini API is used when available for higher quality explanations.
hearyourpaper/
βββ main.py
βββ cli/
β βββ process.py
βββ core/
β βββ config.py
β βββ fetch_pdf.py
β βββ pdf_parser.py
β βββ explainer.py
β βββ llm_gemini.py
β βββ llm_ollama.py
β βββ tts.py
β βββ highlight.py
β βββ video_builder.py
β βββ pipeline.py
β βββ utils.py
βββ storage/
β βββ raw/
β βββ parsed/
β βββ text/
β βββ audio/
β βββ frames/
β βββ video/
βββ config.yaml
βββ LICENSE
βββ README.md
python3 main.py <PATH_TO_PDF_OR_URL>
# example:
python3 main.py https://arxiv.org/pdf/2401.10166