Hoardwick

Storage analysis and cleanup system for cross-platform filesystems

Scan WSL, Windows, and network drives to identify duplicates, stale files, and cleanup opportunities. Built for large filesystems with resumable scans, aggregate patterns, and LLM-friendly output.

Quick Start

# Install dependencies
pip3 install -r requirements.txt

# Scan your storage
python3 scripts/scan.py /home/chris --label wsl-home
python3 scripts/scan.py /mnt/c/Users/Chris --label windows-user

# Analyse for quick wins
python3 scripts/analyse.py --report duplicates
python3 scripts/analyse.py --report stale --stale-days 365

# Execute cleanup
python3 scripts/cleanup.py docker-prune
python3 scripts/cleanup.py clear-cache --type npm

Key Features

Resumable scanning - Checkpoint-based scans handle timeouts on large filesystems
Aggregate patterns - Treats node_modules, .git, caches as single units for performance
Multi-root support - Scan WSL, Windows, network drives separately with labels
Duplicate detection - Find identical files across all scanned locations
Stale file identification - Track last access times, find archival candidates
Built-in cleanup - Docker pruning, cache clearing, build artifact removal

Documentation

Usage Guide - Commands, workflows, and examples
Cleanup Guide - Comprehensive cleanup recipes and safety protocols
Architecture - Specification relationships and system design
Development Guide - For contributors and Claude Code development

Common Operations

Emergency space recovery:

python3 scripts/cleanup.py uv-cache           # Clear UV cache (often 50-200GB)
python3 scripts/cleanup.py docker-prune --all # Remove all unused Docker data

Scanning large drives:

# Windows paths are slow from WSL - use native PowerShell when possible
python3 scripts/scan.py /mnt/c/Users/Chris --label windows

# Scans checkpoint every 1000 files - safe to interrupt and resume
python3 scripts/scan.py /large/path --label big-scan

Finding duplicates:

python3 scripts/analyse.py --report duplicates
python3 scripts/analyse.py --report duplicates --output json > duplicates.json

Architecture

Storage: SQLite database (generated/hoardwick.db)

files - Indexed files with metadata, hashes, aggregation flags
directory_stats - Pre-computed directory metrics
scans - Multi-root scan tracking with timestamps
scan_progress - Resumable checkpoints

Aggregate Patterns (treated as single database entries):

node_modules/ - npm packages
.git/objects/ - git objects
__pycache__/, .next/, dist/, target/ - build outputs
venv/, .venv/ - Python virtual environments
Cache/, cache2/ - browser caches

Performance:

Checkpoints every 1000 files (configurable)
Symlink loop detection via inode tracking
Lazy hashing (only files < 100MB)
Aggregation reduces database size by ~90%

Requirements

Python 3.10+
click, rich

pip3 install -r requirements.txt

Project Structure

hoardwick/
├── scripts/           # Executable utilities
│   ├── scan.py        # Storage scanner
│   ├── analyse.py     # Duplicate/stale detection
│   └── cleanup.py     # Cleanup operations
├── specs/             # LiveSpec specifications
├── docs/              # Documentation
├── generated/         # SQLite database (gitignored)
└── .livespec/         # LiveSpec framework

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.claude		.claude
.livespec		.livespec
docs		docs
scripts		scripts
specs		specs
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
prompts		prompts
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hoardwick

Quick Start

Key Features

Documentation

Common Operations

Architecture

Requirements

Project Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hoardwick

Quick Start

Key Features

Documentation

Common Operations

Architecture

Requirements

Project Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages