Skip to content

Comments

Add EOPF Zarr tiling notebook#91

Open
emmanuelmathot wants to merge 10 commits intomainfrom
tiling_benchmark
Open

Add EOPF Zarr tiling notebook#91
emmanuelmathot wants to merge 10 commits intomainfrom
tiling_benchmark

Conversation

@emmanuelmathot
Copy link
Contributor

Introduce a comprehensive notebook to understand the fundamentals for tiling workflows with EOPF Zarr data using rio-tiler.

- Created TILING_BENCHMARK_README.md detailing the implementation of efficient tiling workflows with EOPF Zarr data using rio-tiler.
- Developed Notebook 1: EOPF Zarr + Rio-tiler Fundamentals with Sentinel-2, covering integration, tile generation, and performance analysis.
- Introduced reusable utilities in zarr_tiling_utils.py for rechunking, performance benchmarking, and overview generation.
- Updated pyproject.toml to include necessary dependencies for the new functionalities.
- Established a testing checklist and outlined next steps for future notebooks focusing on chunking strategy optimization and projections.
- Removed the `compare_chunking_strategies` function and replaced it with a new implementation that visualizes how a single tile request maps to chunks across different strategies.
- Introduced `visualize_chunks_and_tiles` function to display the spatial relationship between Zarr chunks and tile requests, including chunk boundaries and example tile requests.
- Enhanced performance summary printing for better clarity on chunk access efficiency.
- Deleted the comprehensive README for the Rio-tiler EOPF Zarr use case, which included detailed notebook summaries, key features, and technical decisions.
- Updated `compare_chunking_strategies` function in `zarr_tiling_utils.py` to calculate and display data transfer volumes and overhead ratios for different chunking strategies.
- Improved output formatting for chunk access comparison, providing clearer metrics on tile data, transferred data, and efficiency.
- Expanded the efficiency labels to include overhead considerations.
- Adjusted the printed table header for better readability.
- Updated efficiency definitions to clarify the impact of chunk count and overhead on performance.
@emmanuelmathot emmanuelmathot marked this pull request as ready for review November 24, 2025 08:54
@sabrinaszeto sabrinaszeto requested review from jwagemann and removed request for gisromerocandanedo December 2, 2025 08:14
@ciaransweet
Copy link
Contributor

@emmanuelmathot is this still relevant? If not can we close?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants