SedonaDB

SedonaDB is an open-source single-node analytical database engine with geospatial as a first-class citizen. It aims to deliver the fastest spatial analytics query speed and the most comprehensive function coverage available.

SedonaDB is perfect for processing smaller to medium datasets on local machines or cloud instances. For distributed workloads, you can leverage the power of SedonaSpark, SedonaFlink, or SedonaSnow.

Architecture

Columnar in-memory datasets
- Spatial indexing
- Spatial statistics
- CRS tracking
- Arrow format and zero serialization overhead
Spatial query optimization
- Spatial-aware heuristic based optimization
- Spatial-aware cost based optimization
Spatial query processing
- Spatial range query, KNN query, spatial join query, KNN join query
- Map algebra, NDVI, mask, zonal statistics

Raster functions are coming soon. We expect SedonaDB Raster will match all raster functions provided in SedonaSpark.

Features of SedonaDB

SedonaDB has several advantages:

High Performance: Built in Rust for exceptional speed and memory efficiency
Comprehensive Spatial Toolkit: Supports both vector and raster functions in a single library
CRS Propagation: Always maintains coordinate reference system information
Format Flexibility: Supports legacy and modern file formats including GeoParquet, Shapefile, GeoJSON
Dual APIs: Python and SQL interfaces for seamless workflow integration
Extensible: Easily customizable and extensible architecture
Ecosystem Integration: Interoperable with PyArrow-compatible libraries like GeoPandas, DuckDB, and Polars
Active Community: Great maintainers and contributors who encourage external contributions

Performance Benchmarks

This is a performance benchmark comparing SedonaDB 0.1.0, DuckDB 1.4.0, and GeoPandas 1.1.1 using SpatialBench Queries 1-12 at Scale Factors 1 and 10. Details can be found at Apache Sedona SpatialBench.

Install

You can install Python SedonaDB with PyPI:

pip install "apache-sedona[db]"

Quick Start

Get started with SedonaDB in just a few lines:

import sedona.db

# Connect to SedonaDB
sd = sedona.db.connect()

# Run a simple spatial query
result = sd.sql("SELECT ST_Point(0, 1) as geom")
result.show()

Supported File Formats

SedonaDB supports a wide range of geospatial file formats:

Vector: GeoParquet, WKT, WKB, all formats supported by GeoPandas
Raster: Coming soon with full SedonaSpark compatibility

Overture buildings example

This section shows how to query the Overture buildings data.

Start by establishing a connection:

import sedona.db
import os
sd = sedona.db.connect()

Set some AWS environment variables to access the data:

import os
os.environ["AWS_SKIP_SIGNATURE"] = "true"
os.environ["AWS_DEFAULT_REGION"] = "us-west-2"

Read the dataset into a Python SedonaDB DataFrame. This is lazy: even though the Overture buildings table contains millions of rows, SedonaDB will only fetch the data required for the query.

df = sd.read_parquet(
    "s3://overturemaps-us-west-2/release/2025-11-19.0/theme=buildings/type=building/"
)
df.to_view("buildings")

Now run a query to compute the centroids of tall buildings (above 20 meters) in New York City:

nyc_bbox_wkt = (
    "POLYGON((-74.2591 40.4774, -74.2591 40.9176, -73.7004 40.9176, -73.7004 40.4774, -74.2591 40.4774))"
)

sd.sql(f"""
SELECT
    id,
    height,
    num_floors,
    roof_shape,
    ST_Centroid(geometry) as centroid
FROM
    buildings
WHERE
    is_underground = FALSE
    AND height IS NOT NULL
    AND height > 20
    AND ST_Intersects(geometry, ST_SetSRID(ST_GeomFromText('{nyc_bbox_wkt}'), 4326))
LIMIT 5;
""").show()

Here's the query output:

┌─────────────────────────┬────────────────────┬────────────┬────────────┬─────────────────────────┐
│            id           ┆       height       ┆ num_floors ┆ roof_shape ┆         centroid        │
│           utf8          ┆       float64      ┆    int32   ┆    utf8    ┆         geometry        │
╞═════════════════════════╪════════════════════╪════════════╪════════════╪═════════════════════════╡
│ 1b9040c2-2e79-4f56-aba… ┆               22.4 ┆            ┆            ┆ POINT(-74.230407502993… │
├╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
│ 1b5e1cd2-d697-489e-892… ┆               21.5 ┆            ┆            ┆ POINT(-74.231451103592… │
├╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
│ c1afdf78-bf84-4b8f-ae1… ┆               20.9 ┆            ┆            ┆ POINT(-74.232593032240… │
├╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
│ 88f36399-b09f-491b-bb6… ┆               24.5 ┆            ┆            ┆ POINT(-74.231878209597… │
├╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┤
│ df37a283-f5bd-4822-a05… ┆ 24.154542922973633 ┆            ┆            ┆ POINT(-74.241910239840… │
└─────────────────────────┴────────────────────┴────────────┴────────────┴─────────────────────────┘

Community & Support

Get Help

Discord: Join our Discord community for real-time chat and support
GitHub Discussions: Start a GitHub Discussion with questions or ideas
Documentation: Check out our comprehensive docs

Contributing

We welcome contributions! Here's how you can get involved:

Report Issues: Found a bug? Open an issue on GitHub
Suggest Features: Have an idea? Start a GitHub Discussion
Fix Issues: Comment "take" on any open issue to claim it
Submit PRs: Brainstorm features with contributors and submit pull requests
Join Meetings: Monthly contributor meetings - we'd love to have you!

About SedonaDB

SedonaDB is a subproject of Apache Sedona, an Apache Software Foundation project. The project is governed by the Apache Software Foundation and subject to all the rules and oversight requirements. SedonaDB is built on top of Apache Arrow and Apache DataFusion for fast query processing.

Related Projects

Apache Sedona - The main Apache Sedona project for distributed spatial analytics
Sedona SpatialBench - Comprehensive benchmarking suite for spatial analytics performance testing

Name		Name	Last commit message	Last commit date
Latest commit History 433 Commits
.github		.github
benchmarks		benchmarks
c		c
ci/scripts		ci/scripts
dev/release		dev/release
docs		docs
examples		examples
python/sedonadb		python/sedonadb
r/sedonadb		r/sedonadb
rust		rust
sedona-cli		sedona-cli
submodules		submodules
.asf.yaml		.asf.yaml
.cmake-format		.cmake-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
compose.yml		compose.yml
mkdocs.yml		mkdocs.yml
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SedonaDB

Architecture

Features of SedonaDB

Performance Benchmarks

Install

Quick Start

Supported File Formats

Overture buildings example

Community & Support

Get Help

Contributing

About SedonaDB

Related Projects

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 23

Languages

License

apache/sedona-db

Folders and files

Latest commit

History

Repository files navigation

SedonaDB

Architecture

Features of SedonaDB

Performance Benchmarks

Install

Quick Start

Supported File Formats

Overture buildings example

Community & Support

Get Help

Contributing

About SedonaDB

Related Projects

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 23

Languages

Packages