🛰️ VeriStamp Incident Atlas (VIA)

A Qdrant "Think Outside the Bot" Hackathon Submission

VeriStamp Incident Atlas (VIA) is a real-time, on-premise log intelligence platform designed to showcase the advanced, non-AI capabilities of Qdrant for observability. Instead of a typical RAG chatbot, VIA treats the entire log stream as a living system, using vector search to detect behavioral anomalies in its "rhythm".

Innovative Use of Qdrant Features

This project moves beyond simple semantic search to highlight Qdrant's power as a core operational intelligence engine:

Two-Tiered Detection: A high-throughput Tier 1 monitor uses "Rhythm Hashing" for real-time anomaly detection, promoting significant events to a permanent Tier 2 knowledge graph.
Advanced Triage Engine: The "Atlas" UI is powered by Qdrant's Recommendation API, allowing operators to provide positive and negative examples to surgically isolate the root cause of an incident.
Automated Incident Clustering: We use Qdrant's Grouping API to dynamically cluster related log events into unique incidents directly within the database, without external ML models.
Scalable by Design: The architecture uses time-partitioned collections and a federated query layer to handle massive data volumes, with Tier 2 collections using on-disk storage and scalar quantization for a minimal memory footprint.
Adaptive Control Loop: A complete feedback system allows operators to "Snooze" alerts or "Mark as Normal" to permanently patch the detection engine, with every patch generating a new regression test case.

For a detailed breakdown of how VIA leverages these and other Qdrant features, see QDRANT-FEATURES.md.

Core Features

Two-Tiered Anomaly Detection:

Tier 1 (Rhythm Monitor): A high-throughput, in-memory monitor that analyzes the behavioral patterns of all logs (including INFO/DEBUG) using "Rhythm Hashing" to detect novel anomalies.
Tier 2 (Forensic Index): A permanent, time-partitioned knowledge graph where high-signal events from Tier 1 are promoted for deep, federated analysis and historical correlation.

Adaptive Control Loop:

A complete feedback system that allows operators to "Snooze" alerts for temporary relief or "Mark as Normal" to permanently patch the detection engine, creating a robust evaluation harness.

Streaming-First Architecture:

Designed to ingest data from real-time sources like OpenTelemetry (OTel) streams, with a modular, multi-service backend built on FastAPI.

Scalable by Design:

Leverages time-partitioned collections in Qdrant, managed by a federated query layer in the API, allowing the system to scale to terabytes of daily log volume.

Dynamic Schema Engine:

A complete data onboarding system that makes VIA adaptable to new log formats. Instead of requiring manual configuration, users can upload a sample log file through the "Data Sources" UI tab. The backend analyzes the structure—supporting JSON, OTel, and BGL formats—and suggests a parsing schema, which can then be saved and used for future ingestion.

Architecture Overview

VIA is composed of three main services designed to run locally or in a containerized environment.

Diagram-in-Words: Data Flow

+----------------+   +-------------------+   +----------------------------+
| OTel Mock API  |-->|    VIA Backend    |-->| Tier 1: Rhythm Monitor     |
| (Streaming)    |   | (FastAPI Service) |   | (Qdrant, Ephemeral)        |
+----------------+   +-------------------+   +----------------------------+
                               |                      | (Anomalous Events)
   (API Calls) <---------------+                      v
                               |          +----------------------------+
                               +--------->| Tier 2: Forensic Index     |
                                          | (Qdrant, Daily Collections)|
                                          +----------------------------+

OTel Mock Service: Simulates a real-time stream of enterprise logs in a structured OTel format.
VIA Backend: The core FastAPI application, now refactored into a modular, multi-service architecture. It handles ingestion, analysis, and the control loop.
Qdrant: The vector database, used to power both the ephemeral Tier 1 monitor and the permanent, time-partitioned Tier 2 knowledge graph.

Getting Started

1. Prerequisites

Python 3.12+
Docker and Docker Compose

2. Configuration

Create a .env file in the root of the project. You can copy the contents from .env.example if it exists, or use the following:

# .env

# --- Backend & Qdrant Configuration ---
# These variables configure the main FastAPI application (app/main.py)
# See app/core/config.py for all options 
QDRANT_HOST="localhost"
QDRANT_PORT=6333
QDRANT_REPLICATION_FACTOR=2 # Must be <= number of qdrant nodes in docker-compose.yml 
QDRANT_SHARD_NUMBER=2
REGISTRY_DB_PATH="registry.db"

# --- OTel Mock Streamer Configuration ---
# These variables configure the log generator (otel_mock/main.py)
INGESTOR_URL="http://localhost:8000/api/v1/ingest/stream"
LOGS_PER_SECOND=100
MAX_BATCH_SIZE=100
MAX_BATCH_INTERVAL_SEC=0.5

3. Installation

This project uses uv for fast dependency management. If you don't have it, install it first:

# Install uv (if you haven't already)
pip install uv

Then, install the project dependencies from pyproject.toml:

uv pip install -r requirements.txt
or 
pip install -r requirements.txt

4. Running the System Locally

You will need three separate terminal windows to run the full system.

Terminal 1: Start Qdrant

docker-compose up

This starts Qdrant and makes it available at http://localhost:6333.

Terminal 2: Start the OTel Mock Streamer

This service will begin streaming log data to the main API.

uvicorn otel_mock.main:app --host 127.0.0.1 --port 8002 --reload

Terminal 3: Start the Main VIA API Backend

This runs the core application. On startup, it will initialize the necessary databases and Qdrant collections.

uvicorn app.main:app --host 127.0.0.1 --port 8000 --reload

The API is now live and available at http://localhost:8000.

Start the UI

Now to run the UI

uv run ui.py
or
python ui.py

5. Testing the End-to-End Flow

With all services running, you can use curl to interact with the API.

Step A: Check the Health

curl http://localhost:8000/health

Step B: Analyze Tier 1 for Novel Anomalies

After letting the streamer run for about 30-60 seconds, check for rhythm anomalies. This call will detect novel patterns and automatically promote them to Tier 2.

curl -X POST http://localhost:8000/api/v1/analysis/tier1/rhythm_anomalies \
-H "Content-Type: application/json" \
-d '{"window_sec": 300}'

Step C: Query Tier 2 for Promoted Events

Check the permanent forensic index for the events that were just promoted from Tier 1.Note: The best way to explore Tier-2 is now through the interactive UI!

# Get the current Unix timestamp
# (On Linux/macOS: `date +%s`, on Windows you may need to get it manually)
END_TS=$(date +%s)
START_TS=$((END_TS - 3600)) # Look back 1 hour

curl -X POST http://localhost:8000/api/v1/analysis/tier2/clusters \
-H "Content-Type: application/json" \
-d "{\"start_ts\": $START_TS, \"end_ts\": $END_TS}"

Step D: Test the Control Loop

If Step B found an anomaly, take its rhythm_hash and use the control endpoint to suppress it.

# Replace 'your_hash_here' with an actual hash from the Tier 1 response
RHYTHM_HASH="your_hash_here"

curl -X POST http://localhost:8000/api/v1/control/suppress \
-H "Content-Type: application/json" \
-d "{\"rhythm_hash\": \"$RHYTHM_HASH\"}"

If you run the Tier 1 analysis again (Step B), this anomaly should no longer appear.

Step E: Test the Dynamic Schema Engine

You can test the schema detection endpoint with a sample log line to see the suggested structure.

curl -X POST http://localhost:8000/api/v1/schemas/detect \
-H "Content-Type: application/json" \
-d '{
  "source_name": "BGL_Test",
  "sample_logs": [
    "1117838570 2005.06.03 R02-M1-N0-C:J12-U11 2005-06-03-15.42.50.675872 R02-M1-N0-C:J12-U11 RAS KERNEL INFO instruction cache parity error corrected"
  ]
}'

API Endpoints Overview

All endpoints are prefixed with /api/v1.

Ingestion & Streaming

POST /ingest/stream: Endpoint for the OTel streamer to send log batches.
GET /stream/tail: Tails the live log file for the UI, with support for text filtering.

Analysis

POST /analysis/tier1/rhythm_anomalies: Detects novel and frequency-based patterns in Tier 1 and promotes them.
POST /analysis/tier2/clusters: Retrieves and groups promoted event clusters from the Tier 2 forensic index.
POST /analysis/tier2/triage: Finds similar past events using positive/negative examples from the Tier 2 knowledge graph.

Adaptive Control Loop

POST /control/suppress: Temporarily snoozes a specific rhythm_hash.
POST /control/patch: Permanently marks a rhythm_hash as normal and generates an eval case.
GET /control/rules: Fetches all active patch and suppression rules.
DELETE /control/patch/{hash}: Deactivates a permanent patch rule
DELETE /control/suppress/{hash}: Removes a temporary suppression rule.

Schema Management

POST /schemas/detect: Suggests a schema from a sample of raw logs.
POST /schemas: Saves or updates a schema configuration.
GET /schemas: Lists the names of all saved schemas.
GET /schemas/{source_name}: Retrieves a specific saved schema by name.

Technology Stack

Backend: FastAPI, Python 3.12+
Vector Database: Qdrant
Embeddings: fastembed with ONNX models
Data Schemas: Pydantic
Registries: SQLite
Local Services: Docker Compose, Uvicorn

Roadmap

Build a production-grade Vite/React frontend for visualization and interaction.
Expand the "Rhythm Hashing" engine to include frequency-based anomalies.
Add real-world ingestion sources (e.g., Kafka consumer, direct OTel collector integration).
Integrate with ChatOps tools (Slack) for proactive alerting.

License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
app		app
otel_mock		otel_mock
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
QDRANT-FEATURES.md		QDRANT-FEATURES.md
README.md		README.md
docker-compose.yml		docker-compose.yml
generate_logs.py		generate_logs.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
ui.py		ui.py

License

veristamp/VIA

Folders and files

Latest commit

History

Repository files navigation

🛰️ VeriStamp Incident Atlas (VIA)

Innovative Use of Qdrant Features

For a detailed breakdown of how VIA leverages these and other Qdrant features, see QDRANT-FEATURES.md.

Core Features

Two-Tiered Anomaly Detection:

Adaptive Control Loop:

Streaming-First Architecture:

Scalable by Design:

Dynamic Schema Engine:

Architecture Overview

Diagram-in-Words: Data Flow

Getting Started

1. Prerequisites

2. Configuration

3. Installation

4. Running the System Locally

Terminal 1: Start Qdrant

Terminal 2: Start the OTel Mock Streamer

Terminal 3: Start the Main VIA API Backend

Start the UI

5. Testing the End-to-End Flow

Step A: Check the Health

Step B: Analyze Tier 1 for Novel Anomalies

Step C: Query Tier 2 for Promoted Events

Step D: Test the Control Loop

Step E: Test the Dynamic Schema Engine

API Endpoints Overview

Ingestion & Streaming

Analysis

Adaptive Control Loop

Schema Management

Technology Stack

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages