CHUCC Server

Versioned SPARQL server implementing SPARQL 1.2 Protocol with Version Control Extension

CHUCC-server is a versioned, in-memory SPARQL server with CQRS + event sourcing, using RDF Patch and Kafka to replay, branch, and time-travel queries—fast like Redis, precise like Git.

Overview

This server implements:

SPARQL 1.2 Protocol - Complete query and update operations
Version Control Extension - Git-like branching, commits, merges, and time-travel
RDF Patch - Efficient changeset format for versioned updates
Backward Compatibility - Non-versioned SPARQL 1.1/1.2 clients work seamlessly

Features

Core SPARQL Protocol

✅ SPARQL Query (SELECT, CONSTRUCT, ASK, DESCRIBE)
✅ SPARQL Update (INSERT, DELETE, MODIFY)
✅ Content negotiation (JSON, XML, CSV, TSV)
✅ Graph protocol support (default-graph-uri, named-graph-uri)
✅ Both GET and POST methods
✅ Service Description - SPARQL 1.1 Service Description with version control vocabulary extension

Graph Store Protocol

✅ GET, PUT, POST, DELETE, PATCH, HEAD operations
✅ Named graph support (?graph=<uri> parameter)
✅ Default graph operations (?default=true parameter)
✅ Quad-based RDF Patch handling
✅ Full version control integration

Version Control

✅ Branches - Create, list, get info, delete with Git-like metadata (timestamps, commit count, protection) and RFC 5988 pagination
✅ Commits - Create commits, query metadata (id, message, author, timestamp, parents, patchSize)
✅ History - Browse commit history with filtering (branch, date range, author) and RFC 5988 pagination
✅ Time-travel - Query dataset state at any point in time (asOf)
✅ Merging - Fast-forward and three-way merge with conflict resolution strategies (ours, theirs)
✅ Tags - Create, list, get, and delete tags with immutability enforcement and RFC 5988 pagination
✅ Diff - Compare any two commits with RDFPatch output (configurable endpoint)
✅ Blame - Last-writer attribution per quad with graph-scoped analysis and pagination
✅ Batch operations - Apply multiple write operations (SPARQL updates or RDF patches) in single commit
✅ Prefix Management - Version-controlled RDF namespace prefixes with time-travel queries and automatic suggestions

Advanced Features

✅ Optimistic Concurrency - ETags and If-Match headers
✅ Fast-forward merges - Automatic when possible
✅ Conflict detection - Structured representation of merge conflicts (including prefix conflicts)
✅ Conflict resolution - Automatic strategies (ours, theirs) with configurable scope (graph-level, dataset-level)
✅ RFC 7807 Problem Details - Standardized error responses
✅ RESTful API - Dataset-in-path routing (/{dataset}/version/{endpoint}) following Apache Jena Fuseki pattern

Performance & Scalability

✅ Snapshot Optimization - Fast recovery and query materialization from snapshots
✅ LRU Cache Eviction - Bounded memory usage with Caffeine-based cache
✅ Cache Metrics - Monitor cache hit rates, evictions, and memory usage
✅ On-demand Snapshot Loading - Snapshots loaded from Kafka when needed (not stored in memory)
✅ Metadata Caching - Fast snapshot lookups with minimal memory footprint
✅ Event Deduplication - Exactly-once processing semantics with UUIDv7-based deduplication cache
✅ Distributed Tracing - Correlation IDs track requests across HTTP → Kafka → Projector
✅ Parallel Event Replay - Configurable consumer concurrency for faster startup (6x with 6 datasets)
✅ Event Processing Metrics - Timing metrics for identifying performance bottlenecks

Operations

✅ Dataset Creation - Dynamic dataset creation with automatic Kafka topic provisioning
✅ Dataset Deletion - Delete entire datasets with optional Kafka topic cleanup
✅ Branch Deletion - Delete branches with protection for main branch
✅ Robust Error Handling - Kafka errors mapped to RFC 7807 with retry logic
✅ Monitoring & Metrics - Dataset and Kafka metrics via Micrometer
✅ Health Monitoring - Kafka topic health checks and manual healing endpoints
✅ Confirmation Requirements - Safeguards against accidental deletion
✅ Audit Trail - All operations recorded as events in Kafka

Example: Basic Query

# Query the default branch
curl -X GET "http://localhost:3030/sparql?query=SELECT+*+WHERE+{+?s+?p+?o+}+LIMIT+10" \
  -H "Accept: application/sparql-results+json"

Example: Versioned Update

# Create a commit on main branch
curl -X POST http://localhost:3030/sparql \
  -H "Content-Type: application/sparql-update" \
  -H "SPARQL-VC-Message: Add new person" \
  -H "SPARQL-VC-Author: alice@example.org" \
  --data-binary @- <<'EOF'
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
INSERT DATA {
  <http://example.org/alice> foaf:name "Alice" .
  <http://example.org/alice> foaf:age 30 .
}
EOF

Example: Time-Travel Query

# Query state as of yesterday
curl -X GET "http://localhost:3030/sparql?query=SELECT+*+WHERE+{+?s+?p+?o+}&asOf=2025-10-03T12:00:00Z" \
  -H "Accept: application/sparql-results+json"

Example: Branch and Merge

# Create a new branch
curl -X POST http://localhost:3030/version/branches \
  -H "Content-Type: application/json" \
  -d '{"name": "feature-x", "from": "main"}'

# Update on feature branch
curl -X POST http://localhost:3030/sparql?branch=feature-x \
  -H "Content-Type: application/sparql-update" \
  -H "SPARQL-VC-Message: Experimental change" \
  -H "SPARQL-VC-Author: bob@example.org" \
  --data-binary "INSERT DATA { ... }"

# Merge back to main
curl -X POST http://localhost:3030/version/merge \
  -H "Content-Type: application/json" \
  -d '{"from": "feature-x", "into": "main", "strategy": "three-way"}'

API Documentation

See the complete OpenAPI 3.1 specification: api/openapi.yaml

Interactive API documentation (when server is running):

Swagger UI: http://localhost:3030/api-docs
ReDoc: http://localhost:3030/redoc

Protocol Documentation

SPARQL 1.2 Protocol with Version Control Extension

Project Structure

api/                    # OpenAPI specification and JSON schemas
protocol/               # Protocol documentation
src/                    # Source code
docs/                   # Comprehensive documentation (see docs/README.md)
  architecture/         # System architecture (C4 model, CQRS guide)
  api/                  # API documentation (OpenAPI, error codes)
  development/          # Development guides (contributing, quality tools)
  operations/           # Operations guides (performance, deployment)
  conformance/          # Protocol conformance documentation
tests/                  # Test suites

Conformance

This implementation aims for:

SPARQL 1.2 Protocol conformance (Level 1: Basic, Level 2: Advanced)
RDF Patch format support (text/rdf-patch)
RFC 7807 Problem Details for errors
UUIDv7 for commit identifiers

Technology Stack

Language: Java
Runtime: Java 21
Framework: Spring Boot 3.5
RDF + SPARQL: Apache Jena 5.5
Persistent Event Sourcing: Kafka integration

Building

Quick Build

mvn clean install

The project uses batch mode by default for cleaner console output (configured in .mvn/maven.config).

Build Options

Verbose output (when debugging):

mvn clean install -X

Extra quiet (minimal output):

mvn clean install -q

Debug with full stack traces:

mvn clean install -X -e

Test Logging

Test execution logging is configured in src/test/resources/logback-test.xml:

Application logs: INFO level
Spring Boot/Kafka/Testcontainers: WARN level (reduces noise)

To temporarily increase test logging, override in your test:

@SpringBootTest(properties = {"logging.level.org.springframework=DEBUG"})

Development Status

⏳ In Active Development - Core features complete, API endpoints in progress

Completed:

✅ Protocol specification (SPARQL 1.2 + Version Control Extension)
✅ OpenAPI specification
✅ JSON schemas
✅ Core SPARQL endpoint (Query + Update)
✅ Graph Store Protocol (GET, PUT, POST, DELETE, PATCH, HEAD)
✅ Named graph support (quad-based RDF Patch handling)
✅ Version control core (commits, advanced operations)
✅ Storage backend (Apache Jena + Kafka event sourcing)
✅ Full CQRS + Event Sourcing architecture (command handlers → Kafka → projectors)
✅ Comprehensive test suite (711 tests, including async event flow validation)
✅ Performance optimizations (snapshots, LRU cache)
✅ Deletion operations (branches, datasets)
✅ Time-travel query validation tests (5 comprehensive integration tests)
✅ Performance refactoring (Model API → Graph API migration complete)
✅ Consistent dataset parameter support (removed all hardcoded "default" values)
✅ Branch Management API (GET/POST/GET/{name}/DELETE /version/branches) - Completed 2025-10-24
✅ Commit Metadata API (GET /version/commits/{id}) - Completed 2025-01-25
✅ Tag Management API (GET/POST /version/tags) - Completed 2025-10-25
✅ Merge Operations API (POST /version/merge) - Completed 2025-10-28
✅ History & Diff API (GET /version/history, GET /version/diff) - Completed 2025-11-01
✅ Blame API (GET /version/blame) - Completed 2025-11-02
✅ Batch Operations API (POST /version/batch) - Completed 2025-11-03
✅ Prefix Management Protocol (6 endpoints for RDF namespace prefix management) - Completed 2025-11-12

See .tasks/README.md for detailed task roadmap.

✅ Kafka best practices: Aggregate-ID partition key strategy
✅ Kafka best practices: UUIDv7-based event deduplication
✅ Kafka best practices: Correlation ID for distributed tracing
✅ Kafka best practices: Comprehensive event serialization tests
✅ Production-ready CQRS/Event Sourcing implementation
✅ Specification cleanup (removed redundant features via Occam's Razor)

Quality Gates:

✅ Zero Checkstyle violations
✅ Zero SpotBugs warnings
✅ Zero PMD violations
✅ Zero compiler warnings (enforced by -Werror)
✅ All 1444 tests passing (869 unit + 575 integration)

See Task Roadmap for complete implementation history.

Documentation

Comprehensive documentation is available in the docs/ directory:

Documentation Index - Start here for navigation
Architecture Guide - For AI agents and developers
C4 Model - System architecture diagrams
API Documentation - OpenAPI, error codes, extensions
Development Guides - Contributing, quality tools
Operations Guides - Performance, deployment

Contributing

Contributions welcome! Please see Contributing Guide for guidelines.

License

Apache 2.0 - see LICENSE for details.

Acknowledgments

Based on the SPARQL 1.2 Protocol (W3C)
RDF Patch format from Apache Jena
Inspired by Git version control model

Name		Name	Last commit message	Last commit date
Latest commit History 316 Commits
.claude		.claude
.frontend-concept		.frontend-concept
.github/workflows		.github/workflows
.mvn		.mvn
.tasks		.tasks
api		api
docs		docs
protocol		protocol
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dependency-check-suppressions.xml		dependency-check-suppressions.xml
google_checks_chucc.xml		google_checks_chucc.xml
pmd-custom-ruleset.xml		pmd-custom-ruleset.xml
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CHUCC Server

Overview

Features

Core SPARQL Protocol

Graph Store Protocol

Version Control

Advanced Features

Performance & Scalability

Operations

Example: Basic Query

Example: Versioned Update

Example: Time-Travel Query

Example: Branch and Merge

API Documentation

Protocol Documentation

Project Structure

Conformance

Technology Stack

Building

Quick Build

Build Options

Test Logging

Development Status

Documentation

Contributing

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

arne-bdt/CHUCC-server

Folders and files

Latest commit

History

Repository files navigation

CHUCC Server

Overview

Features

Core SPARQL Protocol

Graph Store Protocol

Version Control

Advanced Features

Performance & Scalability

Operations

Example: Basic Query

Example: Versioned Update

Example: Time-Travel Query

Example: Branch and Merge

API Documentation

Protocol Documentation

Project Structure

Conformance

Technology Stack

Building

Quick Build

Build Options

Test Logging

Development Status

Documentation

Contributing

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages