feat: ops warehouse, incident tools, monitoring bridge#5
Open
masonjames wants to merge 5 commits intoagno-agi:mainfrom
Open
feat: ops warehouse, incident tools, monitoring bridge#5masonjames wants to merge 5 commits intoagno-agi:mainfrom
masonjames wants to merge 5 commits intoagno-agi:mainfrom
Conversation
… base Phase 3.1-3.2 of the Unified Platform Capsule Roadmap: - Add ops_warehouse.sql migration with 8 tables: desired_services, actual_services, drift_observations, deploy_events, docker_events, incident_markers, update_status, state_snapshots - Add agents_ops.py — Ops-flavored Dash variant with separate knowledge base, ops-specific SQLTools connection, and operational instructions - Add 8 semantic table JSONs for the ops warehouse knowledge layer - Add ops_metrics.json with drift debt, deploy success rate, incident frequency, exposure multiplier business rules and gotchas - Add ops_queries.sql with 10 seed validated queries (drift ledger, version triangulation, crash loop detection, platform health score, etc.) - Register Ops Dash + Reasoning Ops Dash in AgentOS app Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
The ON CONFLICT (service_name, category) WHERE resolved_at IS NULL clause in the ETL requires a matching unique index. Without it, PostgreSQL rejects the UPSERT at runtime. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
… pack - incidents.py: search/create/timeline tools for ops warehouse incidents - infra_agent.py: bridge tool to query dockhand infra-agent endpoints - knowledge_pack.py: knowledge document listing and retrieval tool - ops_unified_timeline.json: unified timeline knowledge table schema - Extended ops_queries.sql with incident and timeline queries - Extended ops_warehouse.sql with incidents and timeline tables - Registered new tools in agents_ops.py and tools/__init__.py Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
… bridge - prometheus_query: PromQL queries for metrics (CPU, memory, request rates) - loki_query: LogQL queries for log analysis - grafana_alerts: active/pending/resolved alert statuses - docker_state: container/service state for managed hosts - Updated agent instructions to document the 4 new tools - Updated tests: expected tool count 7 → 11, 23 tests passing Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Includes idempotent migration at
db/migrations/ops_warehouse.sql.Test plan
psql -U ai -d ai < db/migrations/ops_warehouse.sql\dtshows desired_services, drift_observations, etl_runs🤖 Generated with Claude Code