PDF Toolkit (offline CLI + optional Obsidian UI)

A small, local PDF utility for repeatable document prep workflows (especially “scan → images → OCR-ready page images”). Built to be CLI-first, safe by default, and auditable via per-command JSON manifests.

This project is intentionally lightweight: once dependencies are installed, it runs fully offline.

Optional thin UI wrapper: pdf-toolkit-obsidian-plugin (keeps the CLI as the contract; the plugin just calls the commands).

Why this exists

This started as a local-first alternative to subscription PDF tooling and untrusted freeware. I wanted a small, offline CLI pipeline for preparing scanned PDFs into OCR-ready page images (rotate, render, crop, split spreads) with deterministic outputs.

This tool will also make workflows:

Predictable (deterministic naming + config precedence)
Safe (explicit overwrite, dry-run, clear output locations)
Hand-off friendly (JSON manifest records inputs/options/outputs/actions)
Easy to integrate (CLI contract usable by scripts or a thin UI wrapper)

Features

Render PDF pages to PNGs (PyMuPDF)
Split a PDF into multiple PDFs
Rotate PDF pages or rotate PNGs (Pillow)
Prepare “page-images”: split spread scans into single pages + crop page bounds (Pillow)
Safe defaults with --dry-run and --overwrite
JSON manifest written for each command (inputs/options/outputs + action log)

Install (Windows)

Create and activate a virtual environment (optional but recommended):

python -m venv .venv
.venv\Scripts\Activate.ps1

Install dependencies:

pip install -r requirements.txt

Install in editable mode so python -m pdf-toolkit works:

pip install -e .

If you prefer not to install it, you can temporarily set PYTHONPATH:

$env:PYTHONPATH = "src"

Quickstart

See all commands:

python -m pdf-toolkit --help

Recommended pipeline (typical scan/OCR prep):

render -> page-images

Example:

python -m pdf-toolkit render --pdf "in.pdf" --out_dir "out\pages" --dpi 300 --format png --prefix "book"
python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --glob "*.png" --mode auto --debug

Commands

Render PDF to PNG

python -m pdf-toolkit render --pdf "in.pdf" --out_dir "out\pages" --dpi 300 --format png --prefix "book1"

Dry-run (no files written):

python -m pdf-toolkit render --pdf "in.pdf" --out_dir "out\pages" --pages "1-10,15" --dry-run

Output naming is predictable: book1_p0001.png, book1_p0002.png, etc.

Split PDF

Explicit ranges:

python -m pdf-toolkit split --pdf "in.pdf" --out_dir "out\splits" --ranges "1-120,121-240" --prefix "book"

Automatic chunking:

python -m pdf-toolkit split --pdf "in.pdf" --out_dir "out\splits" --pages_per_file 120 --prefix "book"

Outputs: book_part01.pdf, book_part02.pdf, etc.

Rotate PDF pages

python -m pdf-toolkit rotate pdf --pdf "in.pdf" --out_pdf "in_rotated.pdf" --degrees 90 --pages "all"

In-place (overwrites input):

python -m pdf-toolkit rotate pdf --pdf "in.pdf" --out_pdf "in.pdf" --degrees 180 --pages "1-5" --inplace --overwrite

Rotate PNGs in a folder

python -m pdf-toolkit rotate images --in_dir "out\pages" --glob "*.png" --degrees 90 --out_dir "out\pages_rot"

In-place (overwrites files):

python -m pdf-toolkit rotate images --in_dir "out\pages" --glob "*.png" --degrees 90 --out_dir "out\pages" --inplace --overwrite

Page-images (split spreads + crop)

Auto mode (split if wide enough, otherwise crop-only):

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --glob "*.png" --mode auto --debug

Always split:

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --mode split --overwrite

Never split (crop-only):

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --mode crop

Useful tuning flags:

--gutter_trim_px: shave pixels from both sides of the detected gutter when splitting spreads
--edge_inset_px: inset the final padded crop box inward to remove faint border noise

Example with both knobs:

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --mode split --gutter_trim_px 20 --edge_inset_px 6 --debug

Page-images YAML config

Dump the default YAML config:

python -m pdf-toolkit page-images --dump-default-config

Use a config file:

python -m pdf-toolkit page-images --in_dir "out\pages" --out_dir "out\pages_single" --config "configs\page_images.default.yaml"

Precedence is deterministic:

built-in defaults < YAML config < explicitly provided CLI flags

This means optional CLI defaults do not overwrite YAML values unless the flag is explicitly passed.

Supported YAML shapes:

Root form:

mode: auto
split_ratio: 1.25
crop_threshold: 180
pad_px: 20

Wrapped form:

page_images:
  mode: auto
  split_ratio: 1.25
  gutter_search_frac: 0.35
  crop_threshold: 180
  min_area_frac: 0.25

Page selection format

Pages are 1-based for user input:

all
1-10
1-10,15,20-25

Manifest output (audit trail)

Each command writes a JSON manifest describing:

Inputs, outputs, options
Actions taken (written, skipped, dry-run)
Timestamps and logs

page-images action outputs list written files, plus split/crop metadata (e.g., gutter_x, bboxes, spread detection notes).

Example output list:

["out/pages_single/book_p0001_L.png", "out/pages_single/book_p0001_R.png"]

By default the manifest is written to:

Render: out_dir\manifest.json
Split: out_dir\manifest.json
Rotate PDF: out_pdf folder \manifest.json
Rotate images: out_dir\manifest.json
Page-images: out_dir\manifest.json

Note: --dry-run skips writing the manifest (it is treated like an output file).

Testing

Run the minimal unit tests:

python -m unittest discover -s tests -p "test_*.py"

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
configs		configs
src/pdf-toolkit		src/pdf-toolkit
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Toolkit (offline CLI + optional Obsidian UI)

Why this exists

Features

Install (Windows)

Quickstart

Commands

Render PDF to PNG

Split PDF

Rotate PDF pages

Rotate PNGs in a folder

Page-images (split spreads + crop)

Page-images YAML config

Page selection format

Manifest output (audit trail)

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PDF Toolkit (offline CLI + optional Obsidian UI)

Why this exists

Features

Install (Windows)

Quickstart

Commands

Render PDF to PNG

Split PDF

Rotate PDF pages

Rotate PNGs in a folder

Page-images (split spreads + crop)

Page-images YAML config

Page selection format

Manifest output (audit trail)

Testing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages