Skip to content

Vector PDF Pro is not just another PDF tool—it is an autonomous processing sandbox. Unlike traditional solutions that require complex system installations (Poppler, Ghostscript, Redis), Vector PDF Pro bundles its entire dependency tree into a single portable executable.

Notifications You must be signed in to change notification settings

CL4YMOR3/Vector-PDF-Pro

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vector PDF Pro Logo

⚡ Vector PDF Pro: The Autonomous Standalone PDF Engine

Vector PDF Pro is a high-performance, single-executable PDF orchestration engine. It self-manages its internal infrastructure (Redis, Tesseract, Poppler) at runtime, delivering industrial-strength PDF processing with zero system installation.

Architecture Overview FastAPI Svelte Build


🚀 Key Features

PDF Manipulation

Feature Description Library
Merge Combine multiple PDFs with resource deduplication pikepdf
Split/Extract Extract specific pages or page ranges pikepdf
Reorder Rearrange pages in any order pikepdf
Rotate Rotate pages by 90°, 180°, or 270° pikepdf
Remove Pages Delete specific pages from PDF pikepdf

Conversion Engine

Feature Description Library
PDF → Images High-DPI rendering to JPG/PNG (delivered as ZIP) PyMuPDF
Images → PDF Combine multiple images into a single PDF PyMuPDF
PDF → Word Convert to editable .docx with layout preservation pdf2docx
Word → PDF Convert .docx back to PDF docx2pdf
PDF → PPTX Render PDF pages as high-quality slides PyMuPDF + python-pptx
PPTX → PDF Convert PowerPoint presentations to PDF comtypes (Office Req)
PDF → XLSX Intelligent table extraction to spreadsheets pdfplumber + pandas
XLSX → PDF Convert Excel workbooks to PDF reports comtypes (Office Req)
PDF → Text High-fidelity text stream extraction PyMuPDF
Text → PDF Create paginated PDFs from raw text files reportlab

Security & Enhancement

Feature Description Library
Encryption Password-protect with AES-256 pikepdf
Compression 3-tier optimization (structure, images, streams) pikepdf + PyMuPDF
Watermarks Text, image, or PDF overlays PyMuPDF + pikepdf

🏗️ Technical Architecture

1. The Autonomous Sandbox (vector.py)

Vector PDF Pro acts as a Process Manager. When launched, it:

  • Recursive Spawn Guard: Uses multiprocessing.freeze_support() to safely manage child processes on Windows.
  • Infrastructure Boot: Launches a hidden, bundled redis-server.exe and a Solo Pool Celery worker using sys.executable --worker.
  • Dynamic PATH Injection: Maps internal Tesseract and Poppler binaries to the system's PATH for the session duration.

2. Svelte SPA Dashboard

A modern, interactive dashboard with a premium Silicon Valley look:

  • Design System: Dark-first theme with glassmorphism and vivid accents.
  • Hero Dropzone: Interactive zone with real-time feedback.
  • Multi-Step Monitor: Live task tracking with status badges.
  • Embedded Console: Visual log viewer displaying the engine's heartbeat.

3. Core Engine Structure

backend/app/core/
├── merge_service.py       # PDF merging with resource sharing
├── organize_service.py    # Split, reorder, remove pages
├── security_service.py    # Encryption & rotation
├── compression_service.py # 3-tier compression pipeline
├── conversion_service.py  # Hybrid conversion engine
├── watermark_service.py   # Text, image, and PDF overlays
└── document_store.py      # Extension-aware storage with TTL cleanup

🛠️ Usage

Developer Setup

# Navigate to the backend
cd backend

# Install dependencies
pip install -r requirements.txt

# Launch development environment
python vector.py

Standalone Build

Vector PDF Pro is designed to be bundled into a single .exe.

# Inside the backend folder
pyinstaller vector.spec

The final production executable will be located in the dist/ directory.

Important

Anti-Virus Note: Since Vector PDF Pro bundles its own infrastructure (Redis, Workers), some enterprise AVs may flag it as a "False Positive". If the app fails to launch, please add an exclusion for the executable.


📂 Project Anatomy

/backend
  /app          # FastAPI routes & Core "Engine" services
  /resources     # Bundled binary dependencies (Redis, Tesseract, etc.)
  vector.py      # Main Bootstrap & "Sandbox" Orchestrator
  vector.spec    # PyInstaller Production Blueprint
  main.py        # FastAPI Gateway Entry
/frontend       # Svelte SPA project (Dashboard)
/dist           # Target directory for Production Builds

🛡️ Maintenance & Reliability

  • Process Guardian: Uses atexit to surgically terminate background Redis and Worker processes on exit.
  • TTL Garbage Collection: Automatic cleanup of orphaned files every 24 hours.
  • Zero-Copy Uploads: Files are spooled directly to disk for maximum performance.

About

Vector PDF Pro is not just another PDF tool—it is an autonomous processing sandbox. Unlike traditional solutions that require complex system installations (Poppler, Ghostscript, Redis), Vector PDF Pro bundles its entire dependency tree into a single portable executable.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published