pg_zerialize

PostgreSQL extension for converting rows to efficient binary formats using the zerialize library.

21% smaller than JSON with MessagePack/CBOR formats.

Status: All Formats Complete! 🎉

✅ FlexBuffers - Fully implemented ✅ MessagePack - Fully implemented ✅ CBOR - Fully implemented ✅ ZERA - Fully implemented

Requirements

PostgreSQL 12+ (with development headers)
C++20 compatible compiler (GCC 10+, Clang 10+)
FlatBuffers library (libflatbuffers-dev)
MessagePack C library (libmsgpack-c-dev)
jsoncons library (libjsoncons-dev)
zerialize library (header-only, included in vendor/)

Building

make
sudo make install

Usage

Single Record Serialization

CREATE EXTENSION pg_zerialize;

-- Convert a single row to any of the four binary formats
SELECT row_to_msgpack(ROW('John', 25, true));
SELECT row_to_cbor(ROW('John', 25, true));
SELECT row_to_zera(ROW('John', 25, true));
SELECT row_to_flexbuffers(ROW('John', 25, true));

-- Serialize table rows individually
SELECT row_to_msgpack(users.*) FROM users;

Batch Processing (Faster for Multiple Rows)

-- Serialize multiple rows in a single call (2-3x faster!)
SELECT rows_to_msgpack(array_agg(users.*)) FROM users;
SELECT rows_to_cbor(array_agg(users.*)) FROM users;
SELECT rows_to_zera(array_agg(users.*)) FROM users;
SELECT rows_to_flexbuffers(array_agg(users.*)) FROM users;

-- Compare sizes across all formats
SELECT
    octet_length(rows_to_msgpack(array_agg(users.*))) as msgpack_bytes,
    octet_length(rows_to_cbor(array_agg(users.*))) as cbor_bytes,
    octet_length(rows_to_zera(array_agg(users.*))) as zera_bytes,
    octet_length(rows_to_flexbuffers(array_agg(users.*))) as flexbuffers_bytes
FROM users;

Performance

Based on real-world testing with user records (5 rows average):

Format	Avg Size	vs JSON	Best For
MessagePack	71 bytes	-21% 🥇	Max compression, APIs, caching
CBOR	71 bytes	-21% 🥈	IoT, IETF standard (RFC 8949)
JSON	90 bytes	baseline	Human-readable, debugging
FlexBuffers	142 bytes	+58%	Zero-copy reads, lazy access
ZERA	209 bytes	+132%	Zerialize ecosystem, advanced features

MessagePack and CBOR are the most compact, both saving ~21% vs JSON. FlexBuffers trades size for zero-copy deserialization capability. ZERA includes additional structure for advanced features but is larger.

Performance Optimizations

All major performance optimizations complete! Combined speedup: ~3-5x faster than original!

✅ Schema Caching - TupleDesc lookups cached, 20-30% faster bulk operations

✅ Batch Processing - Multiple rows in single call, 2-3x faster for bulk operations

✅ Buffer Pre-allocation - Map/array capacity reserved upfront, 5-10% faster with reduced memory fragmentation

Next Steps

✅ ~~Implement FlexBuffers support~~
✅ ~~Implement MessagePack support~~
✅ ~~Implement CBOR support~~
✅ ~~Implement ZERA support~~
✅ ~~Add array support for PostgreSQL arrays~~
✅ ~~Add proper NUMERIC/DECIMAL handling~~
✅ ~~Schema caching optimization~~
✅ ~~Batch processing for multiple rows~~
✅ ~~Buffer pre-allocation optimization~~
Add nested composite type support
Add date/timestamp types
Add deserialization functions

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
vendor/zerialize		vendor/zerialize
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
ARRAYS_NUMERIC_SUCCESS.md		ARRAYS_NUMERIC_SUCCESS.md
BATCH_PROCESSING_SUCCESS.md		BATCH_PROCESSING_SUCCESS.md
BUFFER_PREALLOC_SUCCESS.md		BUFFER_PREALLOC_SUCCESS.md
CBOR_SUCCESS.md		CBOR_SUCCESS.md
COMPLETE.md		COMPLETE.md
DEPLOYMENT_SUMMARY.md		DEPLOYMENT_SUMMARY.md
ENHANCEMENTS.md		ENHANCEMENTS.md
LICENSE		LICENSE
MSGPACK_SUCCESS.md		MSGPACK_SUCCESS.md
Makefile		Makefile
QUICKSTART.md		QUICKSTART.md
README.md		README.md
SCHEMA_CACHE_SUCCESS.md		SCHEMA_CACHE_SUCCESS.md
SUCCESS.md		SUCCESS.md
build.sh		build.sh
demo.sql		demo.sql
demo_all_formats.sql		demo_all_formats.sql
demo_formats.sql		demo_formats.sql
pg_zerialize--1.0.sql		pg_zerialize--1.0.sql
pg_zerialize.control		pg_zerialize.control
pg_zerialize.cpp		pg_zerialize.cpp
test.sql		test.sql
test_batch_processing.sql		test_batch_processing.sql
test_buffer_preallocation.sql		test_buffer_preallocation.sql
test_cbor.sql		test_cbor.sql
test_deployment.sql		test_deployment.sql
test_enhancements.sql		test_enhancements.sql
test_final.sql		test_final.sql
test_msgpack.sql		test_msgpack.sql
test_pg_zerialize.sql		test_pg_zerialize.sql
test_schema_cache.sql		test_schema_cache.sql
verify_flexbuffers.py		verify_flexbuffers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pg_zerialize

Status: All Formats Complete! 🎉

Requirements

Building

Usage

Single Record Serialization

Batch Processing (Faster for Multiple Rows)

Performance

Performance Optimizations

Next Steps

About

Uh oh!

Releases

Packages

Languages

License

mrayva/pg_zerialize

Folders and files

Latest commit

History

Repository files navigation

pg_zerialize

Status: All Formats Complete! 🎉

Requirements

Building

Usage

Single Record Serialization

Batch Processing (Faster for Multiple Rows)

Performance

Performance Optimizations

Next Steps

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages