HTTP Range Reader

Minimal, production-ready HTTP byte-range reader that behaves like a read-only file object. It supports 2‑chunk LRU caching, parallel prefetch, and clean random access into large remote files (think: ZIP archives, tarballs, parquet splits, ISO images) without downloading the whole object.

Python 3.9+. Transport: requests (HTTP/1.1).

Features

Single-file, zero-deps (runtime) except requests
2-chunk LRU (current + previous) to reduce re-fetches on back-seeks
Background prefetch of the next chunk for smooth sequential reads
If-Range with ETag/Last-Modified to prevent mixing chunks after remote updates
Graceful fallback when servers ignore Range (200 OK)
Works anywhere a file-like object works (zipfile, tarfile, PIL.Image.open, etc.)

Install

pip install http-range-reader

(or use directly by copying src/http_range_reader/reader.py into your project)

Quickstart

from zipfile import ZipFile
from http_range_reader import HTTPRangeReader

url = "https://github.com/psf/requests/archive/refs/heads/main.zip"
rdr = HTTPRangeReader(url, chunk_size=1024*1024, prefetch=True)

with rdr:
    with ZipFile(rdr) as zf:
        print(len(rdr), "bytes over HTTP")
        print("first 5 entries:")
        for info in zf.infolist()[:5]:
            print("-", info.filename, info.file_size)
        data = zf.read(zf.infolist()[0].filename)
        print("read", len(data), "bytes from first member")

When to use this

You need random access into large objects over HTTP
You want to avoid full downloads and keep RAM small
You can rely on standard HTTP servers/CDNs that support Range requests

FAQ

Does it cache the whole file? No. It caches at most two chunks at a time.

HTTP/2 or HTTP/3? Default transport is requests (HTTP/1.1). You can swap your own transport if needed.

Thread safety? Intended for single-reader usage. The internal executor is only for prefetching.

CLI demo

python -m examples.http_zip_demo --url https://github.com/psf/requests/archive/refs/heads/main.zip --list 10

Roadmap

Optional httpx transport (HTTP/2)
Adaptive prefetch sizing
Multi-range coalescing (multipart/byteranges) when beneficial

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github/workflows		.github/workflows
examples		examples
src/http_range_reader		src/http_range_reader
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

HTTP Range Reader

Features

Install

Quickstart

When to use this

FAQ

CLI demo

Roadmap

License

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

RevEngine3r/http-range-reader

Folders and files

Latest commit

History

Repository files navigation

HTTP Range Reader

Features

Install

Quickstart

When to use this

FAQ

CLI demo

Roadmap

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages