diff --git a/docs/community/engagement-plan.md b/docs/community/engagement-plan.md
new file mode 100644
index 0000000..59e2ccf
--- /dev/null
+++ b/docs/community/engagement-plan.md
@@ -0,0 +1,138 @@
+# LangChain Community Engagement Plan
+
+Goal: Establish AgentGuard as the go-to tool for AI agent cost control by engaging where developers already ask for help. 2-3 posts per week for the first month.
+
+## Channels to Monitor
+
+| Channel | URL | Frequency |
+|---------|-----|-----------|
+| LangChain GitHub Issues | github.com/langchain-ai/langchain/issues | Daily |
+| LangChain GitHub Discussions | github.com/langchain-ai/langchain/discussions | Daily |
+| LangGraph GitHub Issues | github.com/langchain-ai/langgraph/issues | 3x/week |
+| r/LangChain | reddit.com/r/LangChain | 3x/week |
+| r/LocalLLaMA | reddit.com/r/LocalLLaMA | 2x/week |
+| LangChain Discord #general | discord.gg/langchain | Daily |
+| LangChain Discord #help | discord.gg/langchain | Daily |
+| CrewAI GitHub Discussions | github.com/joaomdmoura/crewAI/discussions | 2x/week |
+| Hacker News (AI threads) | news.ycombinator.com | 2x/week |
+| Stack Overflow [langchain] tag | stackoverflow.com/questions/tagged/langchain | 2x/week |
+
+## 10 Pre-Researched GitHub Issues to Engage
+
+These are recurring problem categories. Search for new instances weekly.
+
+### 1. "Agent stuck in infinite loop"
+- **Search:** `is:issue is:open label:bug "infinite loop" OR "stuck" OR "repeating"`
+- **Repos:** langchain, langgraph, crewai
+- **Response angle:** LoopGuard detects repeated tool calls and raises LoopDetected
+
+### 2. "Unexpected high token usage / cost"
+- **Search:** `is:issue "token usage" OR "cost" OR "expensive" OR "budget"`
+- **Repos:** langchain, langgraph
+- **Response angle:** BudgetGuard enforces hard dollar limits at runtime
+
+### 3. "Agent makes too many API calls"
+- **Search:** `is:issue "too many calls" OR "rate limit" OR "429" OR "max iterations"`
+- **Repos:** langchain, langgraph
+- **Response angle:** RateLimitGuard + BudgetGuard(max_calls=N)
+
+### 4. "How to track costs per agent run"
+- **Search:** `is:issue OR is:discussion "track cost" OR "cost per run" OR "cost tracking"`
+- **Repos:** langchain, crewai
+- **Response angle:** Tracer + patch_openai auto-tracks cost per call, estimate_cost() for estimates
+
+### 5. "Agent timeout / takes too long"
+- **Search:** `is:issue "timeout" OR "takes too long" OR "hanging" OR "never finishes"`
+- **Repos:** langchain, langgraph, crewai
+- **Response angle:** TimeoutGuard enforces wall-clock limits
+
+### 6. "How to add callbacks / custom logging"
+- **Search:** `is:issue OR is:discussion "callback" OR "custom logging" OR "trace"`
+- **Repos:** langchain
+- **Response angle:** AgentGuardCallbackHandler plugs into existing callback system
+
+### 7. "LangGraph node cost tracking"
+- **Search:** `is:issue "cost" OR "budget" OR "tracking"` in langgraph
+- **Response angle:** guarded_node decorator wraps any node with budget + tracing
+
+### 8. "CrewAI agent cost management"
+- **Search:** `is:issue OR is:discussion "cost" OR "budget" OR "expensive"` in crewai
+- **Response angle:** AgentGuardCrewHandler + step_callback integration
+
+### 9. "CI testing for agent behavior"
+- **Search:** `is:issue "ci" OR "testing" OR "evaluation" OR "regression"`
+- **Repos:** langchain, langgraph
+- **Response angle:** EvalSuite + CI cost gates GitHub Action
+
+### 10. "Observability / monitoring for agents"
+- **Search:** `is:issue OR is:discussion "observability" OR "monitoring" OR "dashboard"`
+- **Repos:** langchain, crewai
+- **Response angle:** Full tracing with zero dependencies, optional hosted dashboard
+
+## 5 Template Responses
+
+### Template 1: Cost overrun / budget question
+
+> I ran into the same problem. I built [AgentGuard](https://github.com/bmdhodl/agent47) specifically for this — it lets you set a hard dollar limit on agent runs. When the budget is hit, it raises an exception and stops the agent immediately.
+>
+> Quick example: `BudgetGuard(max_cost_usd=5.00)` — that's it. Works with LangChain, LangGraph, CrewAI, or raw OpenAI/Anthropic. Zero dependencies.
+>
+> Happy to help if you have questions about integrating it.
+
+### Template 2: Infinite loop / stuck agent
+
+> This is a common pattern — the agent calls the same tool with the same args because it can't interpret the result. I've been working on [AgentGuard](https://github.com/bmdhodl/agent47) which has a `LoopGuard` that detects exactly this. It watches a sliding window of tool calls and raises `LoopDetected` when it sees repeats.
+>
+> There's also `FuzzyLoopGuard` for when the args change slightly but it's still effectively looping.
+
+### Template 3: Cost tracking / observability
+
+> For cost tracking, I've been using [AgentGuard](https://github.com/bmdhodl/agent47). It has built-in pricing for OpenAI, Anthropic, Google, Mistral models and auto-tracks cost when you patch the SDK client. Output goes to JSONL files or the hosted dashboard.
+>
+> The LangChain integration is a callback handler: `AgentGuardCallbackHandler(budget_guard=BudgetGuard(max_cost_usd=5.00))` — auto-extracts token usage from LLM responses.
+
+### Template 4: CI / testing question
+
+> We added cost gates to our CI pipeline using [AgentGuard](https://github.com/bmdhodl/agent47). It records traces during test runs, then asserts properties like max cost, no loops, and completion time. There's a GitHub Action that fails the build if any assertion breaks.
+>
+> The EvalSuite API is chainable: `EvalSuite("traces.jsonl").assert_no_loops().assert_budget_under(tokens=50000).run()`
+
+### Template 5: LangGraph specific
+
+> For LangGraph cost tracking, [AgentGuard](https://github.com/bmdhodl/agent47) has a `guarded_node` decorator that wraps any node with budget and loop guards. The budget is shared across all nodes, so a $5 limit applies to the entire graph execution.
+>
+> You can also add a standalone `guard_node` between steps for explicit budget checks.
+
+## Engagement Rules
+
+1. **Be helpful first.** Only mention AgentGuard when it genuinely solves the problem. Never force it.
+2. **No code blocks in comments.** Keep responses short (2-4 sentences), casual, and human. Link to docs for details.
+3. **Answer the actual question.** If AgentGuard doesn't solve their specific problem, help anyway. Goodwill compounds.
+4. **Never disparage competitors.** State facts about what AgentGuard does. Don't FUD LangSmith, Langfuse, or Portkey.
+5. **Disclose when relevant.** If asked directly, say you're the maintainer. Don't hide it.
+6. **One comment per thread.** Never reply to yourself or bump. If someone responds, engage naturally.
+7. **Track engagement.** Log each post in the tracker below.
+
+## Weekly Tracker
+
+| Week | Date | Channel | Thread | Response | Engagement |
+|------|------|---------|--------|----------|------------|
+| 1 | | | | | |
+| 1 | | | | | |
+| 1 | | | | | |
+
+## Metrics (Monthly)
+
+- GitHub stars gained
+- PyPI downloads delta
+- Dashboard signups
+- Inbound GitHub issues from community
+- Threads where AgentGuard was mentioned by others (not us)
+
+## Month 1 Targets
+
+- 12 community posts (3/week)
+- 5 new GitHub stars
+- 50 new PyPI downloads
+- 2 dashboard signups
+- 1 organic mention by someone else
diff --git a/docs/cost-guardrails.md b/docs/cost-guardrails.md
new file mode 100644
index 0000000..6538329
--- /dev/null
+++ b/docs/cost-guardrails.md
@@ -0,0 +1,244 @@
+# Cost Guardrails Guide
+
+Stop runaway AI agent costs before they happen. This guide covers everything you need to enforce dollar budgets on agent runs.
+
+## Why Cost Guardrails?
+
+AI agents are expensive and unpredictable. A single agent run can make 3 or 300 LLM calls depending on the task. Without guardrails:
+
+- A stuck agent burns your entire OpenAI budget in minutes
+- Cost overruns on autonomous tasks average 340% ([source](https://arxiv.org/abs/2401.15811))
+- You only find out when the invoice arrives
+
+AgentGuard's `BudgetGuard` enforces hard dollar limits at runtime — the agent stops the moment it exceeds your budget.
+
+## Quickstart
+
+```bash
+pip install agentguard47
+```
+
+```python
+from agentguard import BudgetGuard, BudgetExceeded
+
+budget = BudgetGuard(max_cost_usd=5.00)
+
+# After each LLM call:
+budget.consume(tokens=1500, calls=1, cost_usd=0.045)
+
+# When cumulative cost exceeds $5.00 → BudgetExceeded raised
+```
+
+That's it. Three lines to add a hard budget to any agent.
+
+## Configuration
+
+### BudgetGuard Parameters
+
+| Parameter | Type | Default | Description |
+|-----------|------|---------|-------------|
+| `max_tokens` | `int` | `None` | Maximum total tokens. `None` = unlimited |
+| `max_calls` | `int` | `None` | Maximum total API calls. `None` = unlimited |
+| `max_cost_usd` | `float` | `None` | Maximum total cost in USD. `None` = unlimited |
+| `warn_at_pct` | `float` | `None` | Fraction (0.0-1.0) to trigger warning. `None` = no warning |
+| `on_warning` | `callable` | `None` | Callback invoked with message when `warn_at_pct` is crossed |
+
+### Examples
+
+```python
+# Dollar limit only
+BudgetGuard(max_cost_usd=10.00)
+
+# Dollar + call limit with warning
+BudgetGuard(max_cost_usd=5.00, max_calls=100, warn_at_pct=0.8)
+
+# Token limit only
+BudgetGuard(max_tokens=50_000)
+
+# Full config with warning callback
+BudgetGuard(
+    max_cost_usd=5.00,
+    max_calls=200,
+    max_tokens=100_000,
+    warn_at_pct=0.8,
+    on_warning=lambda msg: print(f"WARNING: {msg}"),
+)
+```
+
+### consume() Method
+
+```python
+budget.consume(tokens=0, calls=0, cost_usd=0.0)
+```
+
+Call after each LLM API call. Pass any combination of tokens, calls, and cost. Raises `BudgetExceeded` if any configured limit is exceeded.
+
+### Checking State
+
+```python
+state = budget.state
+print(f"Tokens: {state.tokens_used}")
+print(f"Calls: {state.calls_used}")
+print(f"Cost: ${state.cost_used:.4f}")
+```
+
+## How Costs Are Calculated
+
+### Built-in Pricing
+
+AgentGuard includes hardcoded pricing for major models (last updated 2026-02-01):
+
+| Provider | Models |
+|----------|--------|
+| OpenAI | gpt-4o, gpt-4o-mini, gpt-4-turbo, gpt-4, gpt-3.5-turbo, o1, o1-mini, o3-mini |
+| Anthropic | claude-3.5-sonnet, claude-3.5-haiku, claude-3-opus, claude-sonnet-4.5, claude-haiku-4.5, claude-opus-4.6 |
+| Google | gemini-1.5-pro, gemini-1.5-flash, gemini-2.0-flash |
+| Mistral | mistral-large, mistral-small |
+| Meta | llama-3.1-70b |
+
+### Manual Cost Estimation
+
+```python
+from agentguard import estimate_cost
+
+cost = estimate_cost("gpt-4o", input_tokens=1000, output_tokens=500)
+# → $0.0075
+```
+
+### Custom Model Pricing
+
+Add pricing for custom or fine-tuned models:
+
+```python
+from agentguard.cost import update_prices
+
+# (input_price_per_1k, output_price_per_1k)
+update_prices({("openai", "my-fine-tuned-model"): (0.003, 0.006)})
+```
+
+## Auto-Tracking with OpenAI / Anthropic
+
+Skip manual `consume()` calls — patch the SDK to auto-track costs:
+
+```python
+from agentguard import Tracer, BudgetGuard, patch_openai, patch_anthropic
+
+tracer = Tracer(
+    service="my-agent",
+    guards=[BudgetGuard(max_cost_usd=5.00, warn_at_pct=0.8)],
+)
+
+patch_openai(tracer)      # auto-tracks all ChatCompletion calls
+patch_anthropic(tracer)   # auto-tracks all Messages calls
+
+# Use OpenAI/Anthropic normally — costs tracked automatically
+```
+
+When done, clean up:
+
+```python
+from agentguard import unpatch_openai, unpatch_anthropic
+
+unpatch_openai()
+unpatch_anthropic()
+```
+
+## LangChain Integration
+
+```bash
+pip install agentguard47[langchain]
+```
+
+```python
+from agentguard import Tracer, BudgetGuard
+from agentguard.integrations.langchain import AgentGuardCallbackHandler
+
+tracer = Tracer(service="my-agent")
+handler = AgentGuardCallbackHandler(
+    tracer=tracer,
+    budget_guard=BudgetGuard(max_cost_usd=5.00),
+)
+
+# Pass to any LangChain component
+llm = ChatOpenAI(callbacks=[handler])
+```
+
+The callback handler auto-extracts token usage from LLM responses and feeds it into BudgetGuard.
+
+## LangGraph Integration
+
+```bash
+pip install agentguard47[langgraph]
+```
+
+```python
+from agentguard import Tracer, BudgetGuard
+from agentguard.integrations.langgraph import guarded_node
+
+tracer = Tracer(service="my-graph-agent")
+budget = BudgetGuard(max_cost_usd=5.00)
+
+@guarded_node(tracer=tracer, budget_guard=budget)
+def research_node(state):
+    return {"messages": state["messages"] + [result]}
+```
+
+## Dashboard Integration
+
+Send traces to the hosted dashboard for centralized monitoring:
+
+```python
+from agentguard import Tracer, BudgetGuard, HttpSink
+
+tracer = Tracer(
+    sink=HttpSink(
+        url="https://app.agentguard47.com/api/ingest",
+        api_key="ag_...",
+    ),
+    guards=[BudgetGuard(max_cost_usd=50.00)],
+)
+```
+
+The dashboard provides:
+- Real-time cost tracking across all agents
+- Budget alerts via email and webhook
+- Remote kill switch to stop agents mid-run
+- Cost breakdown by agent, model, and time period
+
+## CI Cost Gates
+
+Fail CI if agent costs exceed a threshold:
+
+```yaml
+- uses: bmdhodl/agent47/.github/actions/agentguard-eval@main
+  with:
+    trace-file: traces.jsonl
+    assertions: "no_errors,max_cost:5.00"
+```
+
+Full workflow: [docs/ci/cost-gate-workflow.yml](ci/cost-gate-workflow.yml)
+
+## FAQ
+
+**Q: Does BudgetGuard work without a dashboard?**
+Yes. BudgetGuard is local — it runs in your process with zero network calls. The dashboard is optional.
+
+**Q: How accurate are the cost estimates?**
+Token-level accurate for supported models. AgentGuard uses published per-token pricing. For models not in the built-in list, use `update_prices()` to add custom pricing.
+
+**Q: What happens when BudgetExceeded is raised?**
+It's a normal Python exception. Your agent loop's try/except catches it and you decide what to do — log it, retry with a cheaper model, return a partial result, etc.
+
+**Q: Is it thread-safe?**
+Yes. BudgetGuard uses a lock internally. Safe to share across threads.
+
+**Q: Can I reset the budget mid-run?**
+Create a new `BudgetGuard` instance. There's no reset method by design — budgets should be immutable per run.
+
+## API Reference
+
+- [`BudgetGuard`](https://github.com/bmdhodl/agent47#guards) — budget enforcement
+- [`estimate_cost()`](https://github.com/bmdhodl/agent47#cost-tracking) — per-call cost estimation
+- [`patch_openai()`](https://github.com/bmdhodl/agent47#openai--anthropic-auto-instrumentation) — auto-instrumentation
+- [`AgentGuardCallbackHandler`](https://github.com/bmdhodl/agent47#langchain) — LangChain integration
+- [`guarded_node`](https://github.com/bmdhodl/agent47#langgraph) — LangGraph integration
diff --git a/site/blog/ai-agent-cost-overruns.html b/site/blog/ai-agent-cost-overruns.html
new file mode 100644
index 0000000..d84edb0
--- /dev/null
+++ b/site/blog/ai-agent-cost-overruns.html
@@ -0,0 +1,338 @@
+<!doctype html>
+<html lang="en">
+  <head>
+    <meta charset="utf-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1" />
+    <title>AI Agent Cost Overruns: Why They Happen and How to Prevent Them | AgentGuard</title>
+    <meta name="description" content="AI agent cost overruns average 340% on autonomous tasks. Learn why agents spiral out of control and how to enforce runtime budget limits." />
+    <meta name="keywords" content="ai agent cost overruns, prevent ai agent costs, agent spending control, ai agent budget, llm cost management" />
+    <meta name="author" content="AgentGuard Team" />
+    <meta name="robots" content="index, follow" />
+    <link rel="canonical" href="https://agentguard47.com/blog/ai-agent-cost-overruns.html" />
+    <link rel="icon" href="data:image/svg+xml,<svg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 100 100'><text y='.9em' font-size='90'>🛡️</text></svg>" />
+
+    <!-- Open Graph -->
+    <meta property="og:type" content="article" />
+    <meta property="og:url" content="https://agentguard47.com/blog/ai-agent-cost-overruns.html" />
+    <meta property="og:title" content="AI Agent Cost Overruns: Why They Happen and How to Prevent Them" />
+    <meta property="og:description" content="AI agent cost overruns average 340% on autonomous tasks. Learn why agents spiral out of control and how to enforce runtime budget limits." />
+    <meta property="og:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+    <meta property="og:site_name" content="AgentGuard" />
+    <meta property="article:published_time" content="2026-02-20T00:00:00Z" />
+    <meta property="article:author" content="AgentGuard Team" />
+
+    <!-- Twitter Card -->
+    <meta name="twitter:card" content="summary_large_image" />
+    <meta name="twitter:title" content="AI Agent Cost Overruns: Why They Happen and How to Prevent Them" />
+    <meta name="twitter:description" content="AI agent cost overruns average 340% on autonomous tasks. Learn why they spiral and how to stop them." />
+    <meta name="twitter:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+
+    <!-- JSON-LD Structured Data -->
+    <script type="application/ld+json">
+    {
+      "@context": "https://schema.org",
+      "@type": "BlogPosting",
+      "headline": "AI Agent Cost Overruns: Why They Happen and How to Prevent Them",
+      "description": "AI agent cost overruns average 340% on autonomous tasks. Learn why agents spiral out of control and how to enforce runtime budget limits.",
+      "url": "https://agentguard47.com/blog/ai-agent-cost-overruns.html",
+      "datePublished": "2026-02-20T00:00:00Z",
+      "dateModified": "2026-02-20T00:00:00Z",
+      "author": {
+        "@type": "Organization",
+        "name": "AgentGuard Team",
+        "url": "https://agentguard47.com"
+      },
+      "publisher": {
+        "@type": "Organization",
+        "name": "AgentGuard",
+        "url": "https://agentguard47.com"
+      },
+      "mainEntityOfPage": {
+        "@type": "WebPage",
+        "@id": "https://agentguard47.com/blog/ai-agent-cost-overruns.html"
+      },
+      "keywords": ["ai agent cost overruns", "prevent ai agent costs", "agent spending control"]
+    }
+    </script>
+
+    <script defer src="/_vercel/insights/script.js"></script>
+    <style>
+      :root {
+        --bg: #f7f4ee;
+        --ink: #1e1b16;
+        --accent: #1b6c4a;
+        --muted: #6c645a;
+        --card: #ffffff;
+      }
+      * { box-sizing: border-box; }
+      body {
+        margin: 0;
+        font-family: "IBM Plex Sans", "Space Grotesk", "Avenir", sans-serif;
+        background: radial-gradient(circle at 20% 10%, #f0efe9 0%, var(--bg) 55%);
+        color: var(--ink);
+      }
+      .wrap { max-width: 720px; margin: 0 auto; padding: 32px 16px 60px; }
+      @media (min-width: 640px) { .wrap { padding: 56px 24px 80px; } }
+      .nav { display: flex; align-items: center; gap: 8px; margin-bottom: 32px; font-size: 14px; flex-wrap: wrap; }
+      .nav a { color: var(--accent); text-decoration: none; }
+      .nav a:hover { text-decoration: underline; }
+      .nav .sep { color: var(--muted); }
+      .muted { color: var(--muted); }
+      .meta { font-size: 14px; color: var(--muted); margin-bottom: 32px; }
+      h1 { font-size: 28px; line-height: 1.2; margin: 0 0 12px; }
+      h2 { font-size: 22px; margin-top: 48px; margin-bottom: 12px; line-height: 1.3; }
+      h3 { font-size: 18px; margin-top: 32px; margin-bottom: 8px; }
+      @media (min-width: 640px) { h1 { font-size: 38px; } h2 { font-size: 26px; } h3 { font-size: 20px; } }
+      p { font-size: 16px; line-height: 1.7; margin: 0 0 16px; }
+      @media (min-width: 640px) { p { font-size: 18px; } }
+      ul, ol { font-size: 16px; line-height: 1.7; padding-left: 24px; margin: 0 0 16px; }
+      @media (min-width: 640px) { ul, ol { font-size: 18px; } }
+      li { margin-bottom: 8px; }
+      strong { color: var(--ink); }
+      a { color: var(--accent); }
+      a:hover { text-decoration: underline; }
+      code {
+        background: #e8e2d6; padding: 2px 6px; border-radius: 4px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 0.88em;
+      }
+
+      /* Code blocks */
+      .code {
+        background: #0f1115; color: #e2e8f0; padding: 16px; border-radius: 12px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 12px;
+        line-height: 1.7; overflow-x: auto; white-space: pre; margin: 16px 0 24px;
+      }
+      @media (min-width: 640px) { .code { padding: 20px 24px; font-size: 14px; } }
+      .code .kw { color: #c084fc; }
+      .code .fn { color: #60a5fa; }
+      .code .str { color: #4ade80; }
+      .code .comment { color: #6b7280; }
+
+      /* Callout box */
+      .callout {
+        background: var(--card); border-left: 4px solid var(--accent);
+        padding: 16px 20px; border-radius: 0 8px 8px 0; margin: 24px 0;
+      }
+      .callout p { margin: 0; font-size: 15px; }
+      @media (min-width: 640px) { .callout p { font-size: 16px; } }
+
+      /* Danger callout */
+      .callout-danger {
+        background: var(--card); border-left: 4px solid #b91c1c;
+        padding: 16px 20px; border-radius: 0 8px 8px 0; margin: 24px 0;
+      }
+      .callout-danger p { margin: 0; font-size: 15px; }
+      @media (min-width: 640px) { .callout-danger p { font-size: 16px; } }
+
+      /* Comparison table */
+      .table-wrap { overflow-x: auto; margin: 16px 0 24px; -webkit-overflow-scrolling: touch; }
+      table { width: 100%; border-collapse: collapse; min-width: 400px; }
+      th, td { text-align: left; padding: 12px 14px; font-size: 14px; border-bottom: 1px solid #e8e2d6; }
+      @media (min-width: 640px) { th, td { padding: 14px 18px; font-size: 15px; } }
+      th { background: #f0ede5; font-weight: 600; }
+      .yes { color: var(--accent); font-weight: 600; }
+      .no { color: #b91c1c; }
+
+      /* Failure mode cards */
+      .failure-grid { display: grid; gap: 16px; grid-template-columns: 1fr; margin: 24px 0; }
+      @media (min-width: 640px) { .failure-grid { grid-template-columns: repeat(3, 1fr); } }
+      .failure-card {
+        background: var(--card); padding: 20px; border-radius: 12px;
+        border: 1px solid #e8e2d6;
+      }
+      .failure-card strong { display: block; margin-bottom: 8px; font-size: 16px; }
+      .failure-card p { font-size: 14px; margin: 0; }
+
+      /* CTA section */
+      .cta-section {
+        background: var(--card); border: 1px solid #e8e2d6; border-radius: 12px;
+        padding: 24px; margin: 40px 0; text-align: center;
+      }
+      .cta-section h2 { margin-top: 0; }
+      .btn {
+        display: inline-block;
+        background: var(--accent); color: #fff; border: none; padding: 12px 18px;
+        border-radius: 8px; font-weight: 600; cursor: pointer; text-decoration: none;
+        font-size: 15px; text-align: center;
+      }
+      .btn:hover { opacity: 0.9; text-decoration: none; }
+      .btn.secondary { background: transparent; color: var(--accent); border: 1px solid var(--accent); }
+      .btn.secondary:hover { background: var(--accent); color: #fff; }
+      .cta-buttons { display: flex; gap: 12px; flex-wrap: wrap; justify-content: center; margin-top: 16px; }
+
+      footer { margin-top: 48px; font-size: 14px; color: var(--muted); text-align: center; }
+      .footer-links { display: flex; flex-wrap: wrap; gap: 16px; justify-content: center; margin-bottom: 8px; }
+      .footer-links a { color: var(--accent); text-decoration: none; }
+      .footer-links a:hover { text-decoration: underline; }
+    </style>
+  </head>
+  <body>
+    <div class="wrap">
+      <nav class="nav">
+        <a href="/">AgentGuard</a>
+        <span class="sep">/</span>
+        <a href="/blog/">Blog</a>
+        <span class="sep">/</span>
+        <span>Cost Overruns</span>
+      </nav>
+
+      <article>
+        <h1>AI Agent Cost Overruns: Why They Happen and How to Prevent Them</h1>
+        <div class="meta">By AgentGuard Team &middot; February 20, 2026 &middot; 7 min read</div>
+
+        <p>AI agents are moving from demos to production. With that shift comes a problem nobody talks about until it hits their invoice: <strong>cost overruns averaging 340%</strong> on autonomous tasks.</p>
+
+        <p>A research agent tasked with a $2 job returns a $9 bill. A code review agent loops 47 times on a single file. A customer support agent escalates to GPT-4 for every message, burning through $200 in an afternoon. These are not edge cases. They are the default behavior of autonomous agents without runtime budget enforcement.</p>
+
+        <h2>The Scale of the Problem</h2>
+
+        <p>LLM API costs are deceptively linear in documentation and exponential in practice. Here is why:</p>
+
+        <ul>
+          <li><strong>Agents decide their own workload.</strong> Unlike a batch job with a fixed input set, an autonomous agent generates its own next steps. A "summarize this document" task can become "read 50 pages, cross-reference 12 sources, draft 3 versions" if the agent decides that is necessary.</li>
+          <li><strong>Context windows grow with every step.</strong> Each tool call result gets appended to the conversation. By step 20, the agent is sending 100K tokens per request -- mostly prior context that the model charges you to re-read.</li>
+          <li><strong>Failures are invisible.</strong> An agent stuck in a loop looks like an agent that is "thinking." There is no error. There is no timeout. There is just a growing bill.</li>
+        </ul>
+
+        <div class="callout-danger">
+          <p><strong>Real scenario:</strong> A LangChain ReAct agent with access to a web search tool was asked to "find the best restaurant in Austin." It called the search tool 83 times, each time refining its query, spending $47 on a task that should have cost $0.50.</p>
+        </div>
+
+        <h2>Three Failure Modes That Drain Budgets</h2>
+
+        <div class="failure-grid">
+          <div class="failure-card">
+            <strong>1. Infinite Loops</strong>
+            <p class="muted">Agent calls the same tool with the same arguments, gets the same result, and tries again. Common with ReAct agents that misinterpret tool output as an error. Each loop iteration costs $0.03-0.30 depending on context size.</p>
+          </div>
+          <div class="failure-card">
+            <strong>2. Escalating Retries</strong>
+            <p class="muted">Agent encounters an error and retries with progressively longer prompts. "Add more context" is the default recovery strategy for most LLMs. Each retry is more expensive than the last because the context window grows.</p>
+          </div>
+          <div class="failure-card">
+            <strong>3. Model Cascading</strong>
+            <p class="muted">Agent decides its current model is not capable enough and routes to a more expensive one. GPT-3.5 to GPT-4, Claude Haiku to Opus. A single cascade can 10x the cost of a step, and the agent may cascade on every step.</p>
+          </div>
+        </div>
+
+        <h2>Why Existing Tools Do Not Fix This</h2>
+
+        <p>The AI observability market is full of tools that track costs. LangSmith, Langfuse, Portkey -- they all show you beautiful dashboards of what your agents spent. The problem is timing. These tools are <strong>post-hoc</strong>. They tell you what happened after the damage is done.</p>
+
+        <div class="table-wrap">
+          <table>
+            <thead>
+              <tr>
+                <th>Capability</th>
+                <th>Monitoring Tools</th>
+                <th>Runtime Enforcement</th>
+              </tr>
+            </thead>
+            <tbody>
+              <tr>
+                <td>Track cost per call</td>
+                <td class="yes">Yes</td>
+                <td class="yes">Yes</td>
+              </tr>
+              <tr>
+                <td>Dashboard visualization</td>
+                <td class="yes">Yes</td>
+                <td>Optional</td>
+              </tr>
+              <tr>
+                <td>Stop agent at dollar limit</td>
+                <td class="no">No</td>
+                <td class="yes">Yes</td>
+              </tr>
+              <tr>
+                <td>Detect infinite loops</td>
+                <td class="no">No</td>
+                <td class="yes">Yes</td>
+              </tr>
+              <tr>
+                <td>Warn before budget hit</td>
+                <td class="no">No</td>
+                <td class="yes">Yes</td>
+              </tr>
+              <tr>
+                <td>Enforce in CI/CD</td>
+                <td class="no">No</td>
+                <td class="yes">Yes</td>
+              </tr>
+            </tbody>
+          </table>
+        </div>
+
+        <p>Monitoring tells you the house burned down. Enforcement prevents the fire. You need both, but enforcement is the one that saves money.</p>
+
+        <h2>The Solution: Runtime Budget Enforcement</h2>
+
+        <p>Runtime enforcement means the guard runs <em>inside</em> your agent process. Every LLM call checks the budget before returning. When the limit is hit, the guard raises an exception and the agent stops immediately.</p>
+
+        <p>Here is what it looks like with AgentGuard:</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> Tracer, BudgetGuard, LoopGuard, patch_openai
+
+<span class="comment"># Two guards: budget cap + loop detection</span>
+budget = <span class="fn">BudgetGuard</span>(max_cost_usd=<span class="str">5.00</span>, warn_at_pct=<span class="str">0.8</span>)
+loops = <span class="fn">LoopGuard</span>(max_repeats=<span class="str">3</span>, window=<span class="str">6</span>)
+
+tracer = <span class="fn">Tracer</span>(
+    service=<span class="str">"research-agent"</span>,
+    guards=[budget, loops],  <span class="comment"># auto-check on every event</span>
+)
+<span class="fn">patch_openai</span>(tracer, budget_guard=budget)
+
+<span class="comment"># Agent runs normally. Guards enforce limits automatically.</span>
+<span class="comment"># - BudgetExceeded raised at $5.00</span>
+<span class="comment"># - LoopDetected raised if same tool called 3x in 6 events</span></pre>
+
+        <p>Three things happen automatically:</p>
+
+        <ol>
+          <li><strong>Every OpenAI call is intercepted.</strong> Token usage and cost are extracted from the response and fed to the BudgetGuard.</li>
+          <li><strong>Every tool call is checked for loops.</strong> The LoopGuard tracks the last N events and detects repeated patterns.</li>
+          <li><strong>Exceptions propagate up.</strong> <code>BudgetExceeded</code> and <code>LoopDetected</code> are standard Python exceptions. They stop the agent cleanly, no matter what framework you use.</li>
+        </ol>
+
+        <div class="callout">
+          <p><strong>Key insight:</strong> Guards that raise exceptions are fundamentally different from alerts. An alert requires a human to notice and act. An exception requires no human -- it stops the agent immediately, even at 3 AM.</p>
+        </div>
+
+        <h2>The Cost of Inaction</h2>
+
+        <p>Every day you run agents without budget enforcement is a day you are betting on best-case behavior from a system designed to be unpredictable. The math is simple:</p>
+
+        <ul>
+          <li><strong>10 agents</strong> running tasks at $2 expected cost each</li>
+          <li><strong>340% average overrun</strong> = $6.80 per task actual</li>
+          <li><strong>50 tasks per day</strong> = $340/day instead of $100/day</li>
+          <li><strong>$7,200 per month in overspend</strong></li>
+        </ul>
+
+        <p>Adding a budget guard takes three lines of code and costs nothing. The SDK is free, MIT-licensed, and has zero dependencies.</p>
+
+        <div class="cta-section">
+          <h2>Stop overspending on AI agents</h2>
+          <p class="muted">Three lines of Python. Zero dependencies. Hard budget limits that actually stop the agent.</p>
+<pre class="code" style="text-align: left; margin: 16px 0 0;">pip install agentguard47</pre>
+          <div class="cta-buttons">
+            <a class="btn" href="https://github.com/bmdhodl/agent47" target="_blank" rel="noopener noreferrer">View on GitHub</a>
+            <a class="btn secondary" href="/blog/budget-limits-ai-agents.html">Read the Tutorial</a>
+          </div>
+        </div>
+      </article>
+
+      <footer>
+        <div class="footer-links">
+          <a href="/">Home</a>
+          <a href="https://github.com/bmdhodl/agent47">GitHub</a>
+          <a href="/#pricing">Pricing</a>
+          <a href="/compare.html">Compare</a>
+          <a href="/trust.html">Security &amp; Trust</a>
+        </div>
+        <p class="muted">&copy; 2026 BMD PAT LLC &middot; MIT-licensed SDK &middot; Zero dependencies</p>
+      </footer>
+    </div>
+  </body>
+</html>
\ No newline at end of file
diff --git a/site/blog/budget-limits-ai-agents.html b/site/blog/budget-limits-ai-agents.html
new file mode 100644
index 0000000..fbde633
--- /dev/null
+++ b/site/blog/budget-limits-ai-agents.html
@@ -0,0 +1,308 @@
+<!doctype html>
+<html lang="en">
+  <head>
+    <meta charset="utf-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1" />
+    <title>How to Set Budget Limits on AI Agents | AgentGuard</title>
+    <meta name="description" content="Stop runaway AI agent costs with hard dollar limits. Three patterns for budget enforcement using Python — from basic caps to auto-tracking." />
+    <meta name="keywords" content="ai agent budget limits, openai budget enforcement, agent cost control python, ai agent cost cap, budget guard python" />
+    <meta name="author" content="AgentGuard Team" />
+    <meta name="robots" content="index, follow" />
+    <link rel="canonical" href="https://agentguard47.com/blog/budget-limits-ai-agents.html" />
+    <link rel="icon" href="data:image/svg+xml,<svg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 100 100'><text y='.9em' font-size='90'>🛡️</text></svg>" />
+
+    <!-- Open Graph -->
+    <meta property="og:type" content="article" />
+    <meta property="og:url" content="https://agentguard47.com/blog/budget-limits-ai-agents.html" />
+    <meta property="og:title" content="How to Set Budget Limits on AI Agents" />
+    <meta property="og:description" content="Stop runaway AI agent costs with hard dollar limits. Three patterns for budget enforcement using Python — from basic caps to auto-tracking." />
+    <meta property="og:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+    <meta property="og:site_name" content="AgentGuard" />
+    <meta property="article:published_time" content="2026-02-20T00:00:00Z" />
+    <meta property="article:author" content="AgentGuard Team" />
+
+    <!-- Twitter Card -->
+    <meta name="twitter:card" content="summary_large_image" />
+    <meta name="twitter:title" content="How to Set Budget Limits on AI Agents" />
+    <meta name="twitter:description" content="Stop runaway AI agent costs with hard dollar limits. Three patterns for budget enforcement using Python." />
+    <meta name="twitter:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+
+    <!-- JSON-LD Structured Data -->
+    <script type="application/ld+json">
+    {
+      "@context": "https://schema.org",
+      "@type": "BlogPosting",
+      "headline": "How to Set Budget Limits on AI Agents",
+      "description": "Stop runaway AI agent costs with hard dollar limits. Three patterns for budget enforcement using Python — from basic caps to auto-tracking.",
+      "url": "https://agentguard47.com/blog/budget-limits-ai-agents.html",
+      "datePublished": "2026-02-20T00:00:00Z",
+      "dateModified": "2026-02-20T00:00:00Z",
+      "author": {
+        "@type": "Organization",
+        "name": "AgentGuard Team",
+        "url": "https://agentguard47.com"
+      },
+      "publisher": {
+        "@type": "Organization",
+        "name": "AgentGuard",
+        "url": "https://agentguard47.com"
+      },
+      "mainEntityOfPage": {
+        "@type": "WebPage",
+        "@id": "https://agentguard47.com/blog/budget-limits-ai-agents.html"
+      },
+      "keywords": ["ai agent budget limits", "openai budget enforcement", "agent cost control python"]
+    }
+    </script>
+
+    <script defer src="/_vercel/insights/script.js"></script>
+    <style>
+      :root {
+        --bg: #f7f4ee;
+        --ink: #1e1b16;
+        --accent: #1b6c4a;
+        --muted: #6c645a;
+        --card: #ffffff;
+      }
+      * { box-sizing: border-box; }
+      body {
+        margin: 0;
+        font-family: "IBM Plex Sans", "Space Grotesk", "Avenir", sans-serif;
+        background: radial-gradient(circle at 20% 10%, #f0efe9 0%, var(--bg) 55%);
+        color: var(--ink);
+      }
+      .wrap { max-width: 720px; margin: 0 auto; padding: 32px 16px 60px; }
+      @media (min-width: 640px) { .wrap { padding: 56px 24px 80px; } }
+      .nav { display: flex; align-items: center; gap: 8px; margin-bottom: 32px; font-size: 14px; flex-wrap: wrap; }
+      .nav a { color: var(--accent); text-decoration: none; }
+      .nav a:hover { text-decoration: underline; }
+      .nav .sep { color: var(--muted); }
+      .muted { color: var(--muted); }
+      .meta { font-size: 14px; color: var(--muted); margin-bottom: 32px; }
+      h1 { font-size: 28px; line-height: 1.2; margin: 0 0 12px; }
+      h2 { font-size: 22px; margin-top: 48px; margin-bottom: 12px; line-height: 1.3; }
+      h3 { font-size: 18px; margin-top: 32px; margin-bottom: 8px; }
+      @media (min-width: 640px) { h1 { font-size: 38px; } h2 { font-size: 26px; } h3 { font-size: 20px; } }
+      p { font-size: 16px; line-height: 1.7; margin: 0 0 16px; }
+      @media (min-width: 640px) { p { font-size: 18px; } }
+      ul, ol { font-size: 16px; line-height: 1.7; padding-left: 24px; margin: 0 0 16px; }
+      @media (min-width: 640px) { ul, ol { font-size: 18px; } }
+      li { margin-bottom: 8px; }
+      strong { color: var(--ink); }
+      a { color: var(--accent); }
+      a:hover { text-decoration: underline; }
+      code {
+        background: #e8e2d6; padding: 2px 6px; border-radius: 4px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 0.88em;
+      }
+
+      /* Code blocks */
+      .code {
+        background: #0f1115; color: #e2e8f0; padding: 16px; border-radius: 12px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 12px;
+        line-height: 1.7; overflow-x: auto; white-space: pre; margin: 16px 0 24px;
+      }
+      @media (min-width: 640px) { .code { padding: 20px 24px; font-size: 14px; } }
+      .code .kw { color: #c084fc; }
+      .code .fn { color: #60a5fa; }
+      .code .str { color: #4ade80; }
+      .code .comment { color: #6b7280; }
+
+      /* Callout box */
+      .callout {
+        background: var(--card); border-left: 4px solid var(--accent);
+        padding: 16px 20px; border-radius: 0 8px 8px 0; margin: 24px 0;
+      }
+      .callout p { margin: 0; font-size: 15px; }
+      @media (min-width: 640px) { .callout p { font-size: 16px; } }
+
+      /* CTA section */
+      .cta-section {
+        background: var(--card); border: 1px solid #e8e2d6; border-radius: 12px;
+        padding: 24px; margin: 40px 0; text-align: center;
+      }
+      .cta-section h2 { margin-top: 0; }
+      .btn {
+        display: inline-block;
+        background: var(--accent); color: #fff; border: none; padding: 12px 18px;
+        border-radius: 8px; font-weight: 600; cursor: pointer; text-decoration: none;
+        font-size: 15px; text-align: center;
+      }
+      .btn:hover { opacity: 0.9; text-decoration: none; }
+      .btn.secondary { background: transparent; color: var(--accent); border: 1px solid var(--accent); }
+      .btn.secondary:hover { background: var(--accent); color: #fff; }
+      .cta-buttons { display: flex; gap: 12px; flex-wrap: wrap; justify-content: center; margin-top: 16px; }
+
+      footer { margin-top: 48px; font-size: 14px; color: var(--muted); text-align: center; }
+      .footer-links { display: flex; flex-wrap: wrap; gap: 16px; justify-content: center; margin-bottom: 8px; }
+      .footer-links a { color: var(--accent); text-decoration: none; }
+      .footer-links a:hover { text-decoration: underline; }
+    </style>
+  </head>
+  <body>
+    <div class="wrap">
+      <nav class="nav">
+        <a href="/">AgentGuard</a>
+        <span class="sep">/</span>
+        <a href="/blog/">Blog</a>
+        <span class="sep">/</span>
+        <span>Budget Limits</span>
+      </nav>
+
+      <article>
+        <h1>How to Set Budget Limits on AI Agents</h1>
+        <div class="meta">By AgentGuard Team &middot; February 20, 2026 &middot; 6 min read</div>
+
+        <p>AI agents are powerful, but they are also expensive and unpredictable. An autonomous agent with access to GPT-4 can burn through $50 in minutes if it gets stuck in a reasoning loop or decides to call an API 200 times to "be thorough." According to internal benchmarks, <strong>cost overruns on autonomous agent tasks average 340%</strong> above the expected budget.</p>
+
+        <p>The problem is not that agents are expensive. The problem is that nothing stops them once they start spending. Most observability tools track costs after the fact. By the time you see the dashboard, the damage is done.</p>
+
+        <p>This guide shows three patterns for setting hard budget limits on AI agents using Python. Each pattern builds on the last, from manual caps to fully automatic cost tracking.</p>
+
+        <h2>Pattern 1: Hard Dollar Cap</h2>
+
+        <p>The simplest pattern. Create a <code>BudgetGuard</code> with a dollar limit and call <code>consume()</code> after each LLM call. When the limit is hit, it raises <code>BudgetExceeded</code> and your agent stops immediately.</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> BudgetGuard, BudgetExceeded
+
+<span class="comment"># Create a guard with a $5 limit</span>
+guard = <span class="fn">BudgetGuard</span>(max_cost_usd=<span class="str">5.00</span>)
+
+<span class="kw">def</span> <span class="fn">call_llm</span>(prompt):
+    response = openai_client.chat.completions.<span class="fn">create</span>(
+        model=<span class="str">"gpt-4"</span>,
+        messages=[{<span class="str">"role"</span>: <span class="str">"user"</span>, <span class="str">"content"</span>: prompt}]
+    )
+    cost = response.usage.total_tokens * <span class="str">0.00003</span>  <span class="comment"># approximate</span>
+    guard.<span class="fn">consume</span>(cost_usd=cost)  <span class="comment"># raises BudgetExceeded at $5</span>
+    <span class="kw">return</span> response
+
+<span class="kw">try</span>:
+    <span class="kw">for</span> step <span class="kw">in</span> <span class="fn">range</span>(<span class="str">100</span>):
+        result = <span class="fn">call_llm</span>(<span class="str">"Next step in research..."</span>)
+<span class="kw">except</span> BudgetExceeded <span class="kw">as</span> e:
+    <span class="fn">print</span>(<span class="str">f"Agent stopped: {e}"</span>)
+    <span class="comment"># BudgetExceeded: cost_usd 5.02 exceeds limit 5.00</span></pre>
+
+        <p>This is a hard stop. The exception propagates up and kills the agent loop. No graceful degradation, no warnings -- just a circuit breaker. For many use cases, this is exactly what you want.</p>
+
+        <p>You can also set limits on tokens and call counts:</p>
+
+<pre class="code">guard = <span class="fn">BudgetGuard</span>(
+    max_cost_usd=<span class="str">5.00</span>,   <span class="comment"># dollar cap</span>
+    max_tokens=<span class="str">500000</span>,    <span class="comment"># token cap</span>
+    max_calls=<span class="str">50</span>,         <span class="comment"># call count cap</span>
+)
+guard.<span class="fn">consume</span>(tokens=<span class="str">1500</span>, calls=<span class="str">1</span>, cost_usd=<span class="str">0.045</span>)</pre>
+
+        <p>Any limit that gets hit first triggers <code>BudgetExceeded</code>. This is useful when you want to guard against both cost and runaway call volume.</p>
+
+        <h2>Pattern 2: Warning at 80%</h2>
+
+        <p>Sometimes you want a heads-up before the hard stop. The <code>warn_at_pct</code> parameter fires a callback when usage crosses a threshold, giving your agent a chance to wrap up gracefully.</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> BudgetGuard
+
+<span class="kw">def</span> <span class="fn">on_budget_warning</span>(msg):
+    <span class="fn">print</span>(<span class="str">f"WARNING: {msg}"</span>)
+    <span class="comment"># Could also: switch to cheaper model, save state, notify Slack</span>
+
+guard = <span class="fn">BudgetGuard</span>(
+    max_cost_usd=<span class="str">5.00</span>,
+    warn_at_pct=<span class="str">0.8</span>,         <span class="comment"># warn at 80% ($4.00)</span>
+    on_warning=on_budget_warning,
+)
+
+<span class="comment"># After consuming $4.01:</span>
+<span class="comment"># WARNING: cost_usd at 80.2% of limit 5.00 (used 4.01)</span>
+
+<span class="comment"># After consuming $5.01:</span>
+<span class="comment"># raises BudgetExceeded</span></pre>
+
+        <p>The warning fires once. After that, the agent keeps running until it hits the hard cap. This gives you a two-stage system: warn at 80%, kill at 100%.</p>
+
+        <div class="callout">
+          <p><strong>Tip:</strong> Use the warning callback to switch to a cheaper model (GPT-4o-mini instead of GPT-4) when you are approaching the budget. This lets the agent finish its current task without blowing the cap.</p>
+        </div>
+
+        <h2>Pattern 3: Auto-Tracking with patch_openai</h2>
+
+        <p>Manual <code>consume()</code> calls work but require discipline. Miss one call and your budget tracking is wrong. The <code>patch_openai</code> function eliminates this by automatically intercepting every OpenAI API call, extracting token usage from the response, and feeding it to the BudgetGuard.</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> Tracer, BudgetGuard, patch_openai
+
+<span class="comment"># Set up budget guard and tracer</span>
+budget = <span class="fn">BudgetGuard</span>(max_cost_usd=<span class="str">5.00</span>, warn_at_pct=<span class="str">0.8</span>)
+tracer = <span class="fn">Tracer</span>(service=<span class="str">"my-agent"</span>, guards=[budget])
+
+<span class="comment"># Patch OpenAI — every call is now auto-tracked</span>
+<span class="fn">patch_openai</span>(tracer, budget_guard=budget)
+
+<span class="comment"># Use OpenAI normally. No manual consume() calls needed.</span>
+<span class="kw">import</span> openai
+client = openai.<span class="fn">OpenAI</span>()
+
+<span class="kw">for</span> step <span class="kw">in</span> <span class="fn">range</span>(<span class="str">100</span>):
+    response = client.chat.completions.<span class="fn">create</span>(
+        model=<span class="str">"gpt-4"</span>,
+        messages=[{<span class="str">"role"</span>: <span class="str">"user"</span>, <span class="str">"content"</span>: <span class="str">"Research step..."</span>}]
+    )
+    <span class="comment"># BudgetGuard.consume() is called automatically</span>
+    <span class="comment"># with real token counts and cost from the response</span></pre>
+
+        <p>No manual bookkeeping. Every <code>chat.completions.create</code> call is intercepted, the token usage is extracted from the response, the cost is estimated using built-in per-model pricing, and <code>consume()</code> is called automatically. When the budget is hit, <code>BudgetExceeded</code> raises just like before.</p>
+
+        <p>The same pattern works for Anthropic:</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> patch_anthropic
+
+<span class="fn">patch_anthropic</span>(tracer, budget_guard=budget)
+
+<span class="comment"># Now Anthropic calls are auto-tracked too</span></pre>
+
+        <h2>Which Pattern Should You Use?</h2>
+
+        <ul>
+          <li><strong>Pattern 1 (hard cap)</strong> -- Use when you want maximum control and are already tracking costs yourself. Good for custom agent frameworks.</li>
+          <li><strong>Pattern 2 (warning + cap)</strong> -- Use when your agent can degrade gracefully, like switching to a cheaper model or reducing output quality.</li>
+          <li><strong>Pattern 3 (auto-tracking)</strong> -- Use for OpenAI and Anthropic workloads where you want zero-effort cost tracking. Best for most production deployments.</li>
+        </ul>
+
+        <p>All three patterns can be combined. Use <code>patch_openai</code> for automatic tracking and add a <code>warn_at_pct</code> callback for early warnings. The hard cap is always enforced regardless of which pattern you choose.</p>
+
+        <h2>What Happens When the Budget Is Hit?</h2>
+
+        <p><code>BudgetExceeded</code> is a regular Python exception. It propagates up the call stack and can be caught with a standard <code>try/except</code>. This means it integrates naturally with any agent framework:</p>
+
+        <ul>
+          <li><strong>LangChain:</strong> The exception stops the agent executor loop.</li>
+          <li><strong>LangGraph:</strong> The exception halts the graph traversal.</li>
+          <li><strong>Custom loops:</strong> The exception breaks out of your <code>while</code> or <code>for</code> loop.</li>
+        </ul>
+
+        <p>No special handling needed. The agent stops, and you get the last valid state.</p>
+
+        <div class="cta-section">
+          <h2>Start enforcing budgets in 60 seconds</h2>
+          <p class="muted">Zero dependencies. MIT licensed. Works with any Python agent.</p>
+<pre class="code" style="text-align: left; margin: 16px 0 0;">pip install agentguard47</pre>
+          <div class="cta-buttons">
+            <a class="btn" href="https://github.com/bmdhodl/agent47" target="_blank" rel="noopener noreferrer">View on GitHub</a>
+            <a class="btn secondary" href="https://pypi.org/project/agentguard47/" target="_blank" rel="noopener noreferrer">PyPI Package</a>
+          </div>
+        </div>
+      </article>
+
+      <footer>
+        <div class="footer-links">
+          <a href="/">Home</a>
+          <a href="https://github.com/bmdhodl/agent47">GitHub</a>
+          <a href="/#pricing">Pricing</a>
+          <a href="/compare.html">Compare</a>
+          <a href="/trust.html">Security &amp; Trust</a>
+        </div>
+        <p class="muted">&copy; 2026 BMD PAT LLC &middot; MIT-licensed SDK &middot; Zero dependencies</p>
+      </footer>
+    </div>
+  </body>
+</html>
\ No newline at end of file
diff --git a/site/blog/langchain-cost-tracking.html b/site/blog/langchain-cost-tracking.html
new file mode 100644
index 0000000..73955a9
--- /dev/null
+++ b/site/blog/langchain-cost-tracking.html
@@ -0,0 +1,408 @@
+<!doctype html>
+<html lang="en">
+  <head>
+    <meta charset="utf-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1" />
+    <title>LangChain Cost Tracking: Complete Guide | AgentGuard</title>
+    <meta name="description" content="Track and control LangChain agent costs with automatic token counting, budget enforcement, and per-run cost reports. Works with LangGraph too." />
+    <meta name="keywords" content="langchain cost tracking, langchain budget, langchain token usage, langgraph cost tracking, langchain agent cost" />
+    <meta name="author" content="AgentGuard Team" />
+    <meta name="robots" content="index, follow" />
+    <link rel="canonical" href="https://agentguard47.com/blog/langchain-cost-tracking.html" />
+    <link rel="icon" href="data:image/svg+xml,<svg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 100 100'><text y='.9em' font-size='90'>🛡️</text></svg>" />
+
+    <!-- Open Graph -->
+    <meta property="og:type" content="article" />
+    <meta property="og:url" content="https://agentguard47.com/blog/langchain-cost-tracking.html" />
+    <meta property="og:title" content="LangChain Cost Tracking: Complete Guide" />
+    <meta property="og:description" content="Track and control LangChain agent costs with automatic token counting, budget enforcement, and per-run cost reports. Works with LangGraph too." />
+    <meta property="og:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+    <meta property="og:site_name" content="AgentGuard" />
+    <meta property="article:published_time" content="2026-02-20T00:00:00Z" />
+    <meta property="article:author" content="AgentGuard Team" />
+
+    <!-- Twitter Card -->
+    <meta name="twitter:card" content="summary_large_image" />
+    <meta name="twitter:title" content="LangChain Cost Tracking: Complete Guide" />
+    <meta name="twitter:description" content="Track and control LangChain agent costs with automatic token counting, budget enforcement, and per-run cost reports." />
+    <meta name="twitter:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+
+    <!-- JSON-LD Structured Data -->
+    <script type="application/ld+json">
+    {
+      "@context": "https://schema.org",
+      "@type": "BlogPosting",
+      "headline": "LangChain Cost Tracking: Complete Guide",
+      "description": "Track and control LangChain agent costs with automatic token counting, budget enforcement, and per-run cost reports. Works with LangGraph too.",
+      "url": "https://agentguard47.com/blog/langchain-cost-tracking.html",
+      "datePublished": "2026-02-20T00:00:00Z",
+      "dateModified": "2026-02-20T00:00:00Z",
+      "author": {
+        "@type": "Organization",
+        "name": "AgentGuard Team",
+        "url": "https://agentguard47.com"
+      },
+      "publisher": {
+        "@type": "Organization",
+        "name": "AgentGuard",
+        "url": "https://agentguard47.com"
+      },
+      "mainEntityOfPage": {
+        "@type": "WebPage",
+        "@id": "https://agentguard47.com/blog/langchain-cost-tracking.html"
+      },
+      "keywords": ["langchain cost tracking", "langchain budget", "langchain token usage"]
+    }
+    </script>
+
+    <script defer src="/_vercel/insights/script.js"></script>
+    <style>
+      :root {
+        --bg: #f7f4ee;
+        --ink: #1e1b16;
+        --accent: #1b6c4a;
+        --muted: #6c645a;
+        --card: #ffffff;
+      }
+      * { box-sizing: border-box; }
+      body {
+        margin: 0;
+        font-family: "IBM Plex Sans", "Space Grotesk", "Avenir", sans-serif;
+        background: radial-gradient(circle at 20% 10%, #f0efe9 0%, var(--bg) 55%);
+        color: var(--ink);
+      }
+      .wrap { max-width: 720px; margin: 0 auto; padding: 32px 16px 60px; }
+      @media (min-width: 640px) { .wrap { padding: 56px 24px 80px; } }
+      .nav { display: flex; align-items: center; gap: 8px; margin-bottom: 32px; font-size: 14px; flex-wrap: wrap; }
+      .nav a { color: var(--accent); text-decoration: none; }
+      .nav a:hover { text-decoration: underline; }
+      .nav .sep { color: var(--muted); }
+      .muted { color: var(--muted); }
+      .meta { font-size: 14px; color: var(--muted); margin-bottom: 32px; }
+      h1 { font-size: 28px; line-height: 1.2; margin: 0 0 12px; }
+      h2 { font-size: 22px; margin-top: 48px; margin-bottom: 12px; line-height: 1.3; }
+      h3 { font-size: 18px; margin-top: 32px; margin-bottom: 8px; }
+      @media (min-width: 640px) { h1 { font-size: 38px; } h2 { font-size: 26px; } h3 { font-size: 20px; } }
+      p { font-size: 16px; line-height: 1.7; margin: 0 0 16px; }
+      @media (min-width: 640px) { p { font-size: 18px; } }
+      ul, ol { font-size: 16px; line-height: 1.7; padding-left: 24px; margin: 0 0 16px; }
+      @media (min-width: 640px) { ul, ol { font-size: 18px; } }
+      li { margin-bottom: 8px; }
+      strong { color: var(--ink); }
+      a { color: var(--accent); }
+      a:hover { text-decoration: underline; }
+      code {
+        background: #e8e2d6; padding: 2px 6px; border-radius: 4px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 0.88em;
+      }
+
+      /* Code blocks */
+      .code {
+        background: #0f1115; color: #e2e8f0; padding: 16px; border-radius: 12px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 12px;
+        line-height: 1.7; overflow-x: auto; white-space: pre; margin: 16px 0 24px;
+      }
+      @media (min-width: 640px) { .code { padding: 20px 24px; font-size: 14px; } }
+      .code .kw { color: #c084fc; }
+      .code .fn { color: #60a5fa; }
+      .code .str { color: #4ade80; }
+      .code .comment { color: #6b7280; }
+
+      /* Callout box */
+      .callout {
+        background: var(--card); border-left: 4px solid var(--accent);
+        padding: 16px 20px; border-radius: 0 8px 8px 0; margin: 24px 0;
+      }
+      .callout p { margin: 0; font-size: 15px; }
+      @media (min-width: 640px) { .callout p { font-size: 16px; } }
+
+      /* Step cards */
+      .step-grid { display: grid; gap: 16px; grid-template-columns: 1fr; margin: 24px 0; }
+      @media (min-width: 640px) { .step-grid { grid-template-columns: repeat(2, 1fr); } }
+      .step-card {
+        background: var(--card); padding: 20px; border-radius: 12px;
+        border: 1px solid #e8e2d6;
+      }
+      .step-card strong { display: block; margin-bottom: 8px; font-size: 16px; }
+      .step-card p { font-size: 14px; margin: 0; }
+
+      /* CTA section */
+      .cta-section {
+        background: var(--card); border: 1px solid #e8e2d6; border-radius: 12px;
+        padding: 24px; margin: 40px 0; text-align: center;
+      }
+      .cta-section h2 { margin-top: 0; }
+      .btn {
+        display: inline-block;
+        background: var(--accent); color: #fff; border: none; padding: 12px 18px;
+        border-radius: 8px; font-weight: 600; cursor: pointer; text-decoration: none;
+        font-size: 15px; text-align: center;
+      }
+      .btn:hover { opacity: 0.9; text-decoration: none; }
+      .btn.secondary { background: transparent; color: var(--accent); border: 1px solid var(--accent); }
+      .btn.secondary:hover { background: var(--accent); color: #fff; }
+      .cta-buttons { display: flex; gap: 12px; flex-wrap: wrap; justify-content: center; margin-top: 16px; }
+
+      footer { margin-top: 48px; font-size: 14px; color: var(--muted); text-align: center; }
+      .footer-links { display: flex; flex-wrap: wrap; gap: 16px; justify-content: center; margin-bottom: 8px; }
+      .footer-links a { color: var(--accent); text-decoration: none; }
+      .footer-links a:hover { text-decoration: underline; }
+    </style>
+  </head>
+  <body>
+    <div class="wrap">
+      <nav class="nav">
+        <a href="/">AgentGuard</a>
+        <span class="sep">/</span>
+        <a href="/blog/">Blog</a>
+        <span class="sep">/</span>
+        <span>LangChain Cost Tracking</span>
+      </nav>
+
+      <article>
+        <h1>LangChain Cost Tracking: Complete Guide</h1>
+        <div class="meta">By AgentGuard Team &middot; February 20, 2026 &middot; 8 min read</div>
+
+        <p>LangChain is the most popular framework for building AI agents. But it has a blind spot: <strong>there is no built-in way to track or enforce cost limits</strong>. You can build a ReAct agent in 10 lines of code, but you have no idea what it will cost until the bill arrives.</p>
+
+        <p>This guide covers everything you need to add cost tracking and budget enforcement to LangChain and LangGraph agents using AgentGuard. From basic token counting to CI cost gates, you will have full visibility and control over agent spend.</p>
+
+        <h2>Why LangChain Cost Tracking Matters</h2>
+
+        <p>LangChain agents are autonomous. They decide how many LLM calls to make, which tools to invoke, and when to stop. This is powerful for building useful agents, but it means costs are inherently unpredictable:</p>
+
+        <ul>
+          <li><strong>ReAct agents</strong> loop until they decide they have an answer. A simple question might take 2 steps or 20.</li>
+          <li><strong>Tool-using agents</strong> generate tool calls that return data, which gets appended to context. Each step is more expensive than the last.</li>
+          <li><strong>Multi-agent systems</strong> compound the problem. Agent A calls Agent B, which calls Agent C. One user request triggers dozens of LLM calls across multiple agents.</li>
+        </ul>
+
+        <p>Without cost tracking, you are flying blind. Without budget enforcement, you have no safety net.</p>
+
+        <h2>Step 1: Install AgentGuard with LangChain Support</h2>
+
+<pre class="code">pip install agentguard47[langchain]</pre>
+
+        <p>This installs the core SDK (zero dependencies) plus the optional <code>langchain-core</code> integration. If you are using LangGraph, that is included automatically since LangGraph depends on <code>langchain-core</code>.</p>
+
+        <h2>Step 2: Set Up the Callback Handler</h2>
+
+        <p>AgentGuard integrates with LangChain through its callback system. The <code>AgentGuardCallbackHandler</code> hooks into every LLM call, chain run, and tool invocation to automatically track costs and check guards.</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> Tracer, BudgetGuard, LoopGuard
+<span class="kw">from</span> agentguard.integrations.langchain <span class="kw">import</span> AgentGuardCallbackHandler
+
+<span class="comment"># Set up guards</span>
+budget = <span class="fn">BudgetGuard</span>(
+    max_cost_usd=<span class="str">5.00</span>,
+    warn_at_pct=<span class="str">0.8</span>,
+    on_warning=<span class="kw">lambda</span> msg: <span class="fn">print</span>(<span class="str">f"Budget warning: {msg}"</span>),
+)
+loops = <span class="fn">LoopGuard</span>(max_repeats=<span class="str">3</span>, window=<span class="str">6</span>)
+
+<span class="comment"># Create tracer with JSONL output</span>
+tracer = <span class="fn">Tracer</span>(service=<span class="str">"langchain-agent"</span>)
+
+<span class="comment"># Create the callback handler</span>
+handler = <span class="fn">AgentGuardCallbackHandler</span>(
+    tracer=tracer,
+    budget_guard=budget,
+    loop_guard=loops,
+)
+
+<span class="comment"># Use with any LangChain agent or chain</span>
+result = agent.<span class="fn">invoke</span>(
+    {<span class="str">"input"</span>: <span class="str">"Research the latest AI papers"</span>},
+    config={<span class="str">"callbacks"</span>: [handler]},
+)</pre>
+
+        <p>That is it. Every LLM call the agent makes now flows through AgentGuard. Token usage is extracted automatically from the LLM response metadata. If the agent exceeds $5 or loops 3 times, it gets a <code>BudgetExceeded</code> or <code>LoopDetected</code> exception.</p>
+
+        <h2>Step 3: Auto-Extract Token Usage</h2>
+
+        <p>The callback handler automatically extracts token counts from LangChain's LLM response objects. This works with any LLM provider that LangChain supports -- OpenAI, Anthropic, Google, Cohere, and others.</p>
+
+        <div class="step-grid">
+          <div class="step-card">
+            <strong>What gets tracked</strong>
+            <p class="muted">Prompt tokens, completion tokens, total tokens, model name, and estimated cost in USD for every LLM call.</p>
+          </div>
+          <div class="step-card">
+            <strong>How cost is calculated</strong>
+            <p class="muted">AgentGuard's built-in <code>estimate_cost()</code> uses per-model pricing tables for GPT-4, GPT-4o, Claude, Gemini, and more. Updated regularly.</p>
+          </div>
+          <div class="step-card">
+            <strong>What gets guarded</strong>
+            <p class="muted">BudgetGuard checks cumulative cost after each call. LoopGuard checks for repeated tool invocations. Both raise exceptions on violation.</p>
+          </div>
+          <div class="step-card">
+            <strong>What gets traced</strong>
+            <p class="muted">Every chain start/end, LLM call, and tool invocation is emitted as a structured JSONL event with timing, cost, and span hierarchy.</p>
+          </div>
+        </div>
+
+        <h2>Step 4: LangGraph with guarded_node</h2>
+
+        <p>If you are using LangGraph, AgentGuard provides a <code>guarded_node</code> decorator that wraps individual graph nodes with tracing and guard checks. This gives you per-node cost tracking across your entire graph.</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> Tracer, BudgetGuard, LoopGuard
+<span class="kw">from</span> agentguard.integrations.langgraph <span class="kw">import</span> guarded_node
+<span class="kw">from</span> langgraph.graph <span class="kw">import</span> StateGraph
+
+budget = <span class="fn">BudgetGuard</span>(max_cost_usd=<span class="str">10.00</span>)
+loops = <span class="fn">LoopGuard</span>(max_repeats=<span class="str">3</span>)
+tracer = <span class="fn">Tracer</span>(service=<span class="str">"langgraph-agent"</span>)
+
+<span class="comment"># Decorate each node — tracing + guards applied automatically</span>
+@<span class="fn">guarded_node</span>(tracer=tracer, budget_guard=budget, loop_guard=loops)
+<span class="kw">def</span> <span class="fn">research_node</span>(state):
+    <span class="comment"># Your node logic here</span>
+    result = llm.<span class="fn">invoke</span>(state[<span class="str">"query"</span>])
+    <span class="kw">return</span> {<span class="str">"research"</span>: result.content}
+
+@<span class="fn">guarded_node</span>(tracer=tracer, budget_guard=budget, loop_guard=loops)
+<span class="kw">def</span> <span class="fn">summarize_node</span>(state):
+    result = llm.<span class="fn">invoke</span>(<span class="str">f"Summarize: {state['research']}"</span>)
+    <span class="kw">return</span> {<span class="str">"summary"</span>: result.content}
+
+<span class="comment"># Build graph normally</span>
+graph = <span class="fn">StateGraph</span>(dict)
+graph.<span class="fn">add_node</span>(<span class="str">"research"</span>, research_node)
+graph.<span class="fn">add_node</span>(<span class="str">"summarize"</span>, summarize_node)
+graph.<span class="fn">add_edge</span>(<span class="str">"research"</span>, <span class="str">"summarize"</span>)
+graph.<span class="fn">set_entry_point</span>(<span class="str">"research"</span>)
+app = graph.<span class="fn">compile</span>()</pre>
+
+        <p>Each node execution is traced as a separate span. The BudgetGuard tracks cumulative cost across all nodes, so a $10 limit applies to the entire graph run, not per node. If one node exhausts the budget, subsequent nodes are never reached.</p>
+
+        <div class="callout">
+          <p><strong>Tip:</strong> For existing graphs where you cannot use decorators, use the <code>guard_node</code> function instead: <code>graph.add_node("research", guard_node(research_fn, tracer=tracer, budget_guard=budget))</code></p>
+        </div>
+
+        <h2>Step 5: View Cost Reports</h2>
+
+        <p>AgentGuard writes structured JSONL trace files by default. Use the CLI to generate human-readable cost reports from these traces:</p>
+
+<pre class="code"><span class="comment"># Generate a cost report from trace data</span>
+agentguard report traces.jsonl
+
+<span class="comment"># Output:</span>
+<span class="comment"># ┌──────────────────────┬────────┬──────────┬──────────┐</span>
+<span class="comment"># │ Span                 │ Calls  │ Tokens   │ Cost     │</span>
+<span class="comment"># ├──────────────────────┼────────┼──────────┼──────────┤</span>
+<span class="comment"># │ research_node        │ 3      │ 12,450   │ $0.37    │</span>
+<span class="comment"># │ summarize_node       │ 1      │ 2,100    │ $0.06    │</span>
+<span class="comment"># │ TOTAL                │ 4      │ 14,550   │ $0.43    │</span>
+<span class="comment"># └──────────────────────┴────────┴──────────┴──────────┘</span></pre>
+
+        <p>This gives you per-node and per-agent cost breakdowns. You can see exactly which part of your pipeline is expensive and optimize accordingly.</p>
+
+        <h2>Step 6: CI Cost Gates with GitHub Actions</h2>
+
+        <p>The final piece is preventing cost regressions in CI. AgentGuard includes a GitHub Action that runs your agent test suite and fails the build if costs exceed a threshold.</p>
+
+<pre class="code"><span class="comment"># .github/workflows/cost-gate.yml</span>
+name: Cost Gate
+on: [pull_request]
+jobs:
+  cost-check:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - uses: actions/setup-python@v5
+        with:
+          python-version: <span class="str">"3.12"</span>
+      - run: pip install agentguard47
+      - run: python -m pytest tests/ -v
+      - name: Check costs
+        run: |
+          agentguard eval traces.jsonl \
+            --assert-max-cost <span class="str">2.00</span> \
+            --assert-max-calls <span class="str">50</span></pre>
+
+        <p>If any test run generates traces exceeding $2.00 total cost or 50 LLM calls, the CI build fails. This catches cost regressions before they reach production -- a new prompt template that accidentally doubles token usage, a tool that triggers extra LLM calls, or a loop that was not caught in local testing.</p>
+
+        <h2>Putting It All Together</h2>
+
+        <p>Here is a complete example that combines everything: LangChain callback handler, budget enforcement, loop detection, and JSONL trace output.</p>
+
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> (
+    Tracer, BudgetGuard, LoopGuard,
+    JsonlFileSink, BudgetExceeded, LoopDetected,
+)
+<span class="kw">from</span> agentguard.integrations.langchain <span class="kw">import</span> AgentGuardCallbackHandler
+<span class="kw">from</span> langchain_openai <span class="kw">import</span> ChatOpenAI
+<span class="kw">from</span> langchain.agents <span class="kw">import</span> create_react_agent, AgentExecutor
+
+<span class="comment"># Guards</span>
+budget = <span class="fn">BudgetGuard</span>(
+    max_cost_usd=<span class="str">5.00</span>,
+    warn_at_pct=<span class="str">0.8</span>,
+    on_warning=<span class="kw">lambda</span> msg: <span class="fn">print</span>(<span class="str">f"[WARN] {msg}"</span>),
+)
+loops = <span class="fn">LoopGuard</span>(max_repeats=<span class="str">3</span>, window=<span class="str">6</span>)
+
+<span class="comment"># Tracer with file output</span>
+sink = <span class="fn">JsonlFileSink</span>(<span class="str">"traces.jsonl"</span>)
+tracer = <span class="fn">Tracer</span>(sink=sink, service=<span class="str">"my-agent"</span>, guards=[budget, loops])
+
+<span class="comment"># LangChain setup</span>
+handler = <span class="fn">AgentGuardCallbackHandler</span>(
+    tracer=tracer,
+    budget_guard=budget,
+    loop_guard=loops,
+)
+llm = <span class="fn">ChatOpenAI</span>(model=<span class="str">"gpt-4"</span>, callbacks=[handler])
+
+<span class="comment"># Run agent with protection</span>
+<span class="kw">try</span>:
+    result = agent_executor.<span class="fn">invoke</span>(
+        {<span class="str">"input"</span>: <span class="str">"Analyze Q4 sales data"</span>},
+        config={<span class="str">"callbacks"</span>: [handler]},
+    )
+    <span class="fn">print</span>(<span class="str">f"Result: {result}"</span>)
+<span class="kw">except</span> BudgetExceeded <span class="kw">as</span> e:
+    <span class="fn">print</span>(<span class="str">f"Agent stopped — budget exceeded: {e}"</span>)
+<span class="kw">except</span> LoopDetected <span class="kw">as</span> e:
+    <span class="fn">print</span>(<span class="str">f"Agent stopped — loop detected: {e}"</span>)
+
+<span class="comment"># View the report</span>
+<span class="comment"># $ agentguard report traces.jsonl</span></pre>
+
+        <h2>What You Get</h2>
+
+        <ul>
+          <li><strong>Automatic token counting</strong> for every LLM call via the callback handler</li>
+          <li><strong>Hard budget limits</strong> that stop the agent at a dollar amount you set</li>
+          <li><strong>Loop detection</strong> that catches repeated tool calls before they compound</li>
+          <li><strong>Per-node cost tracking</strong> in LangGraph with the <code>guarded_node</code> decorator</li>
+          <li><strong>CLI cost reports</strong> showing per-span breakdowns</li>
+          <li><strong>CI cost gates</strong> that prevent cost regressions in pull requests</li>
+        </ul>
+
+        <p>All of this with zero hard dependencies. The core SDK uses Python stdlib only. The LangChain integration requires <code>langchain-core</code>, which you already have if you are using LangChain.</p>
+
+        <div class="cta-section">
+          <h2>Add cost tracking to your LangChain agent</h2>
+          <p class="muted">One callback handler. Zero hard dependencies. Budget enforcement that actually stops the agent.</p>
+<pre class="code" style="text-align: left; margin: 16px 0 0;">pip install agentguard47[langchain]</pre>
+          <div class="cta-buttons">
+            <a class="btn" href="https://github.com/bmdhodl/agent47" target="_blank" rel="noopener noreferrer">View on GitHub</a>
+            <a class="btn secondary" href="https://pypi.org/project/agentguard47/" target="_blank" rel="noopener noreferrer">PyPI Package</a>
+          </div>
+        </div>
+      </article>
+
+      <footer>
+        <div class="footer-links">
+          <a href="/">Home</a>
+          <a href="https://github.com/bmdhodl/agent47">GitHub</a>
+          <a href="/#pricing">Pricing</a>
+          <a href="/compare.html">Compare</a>
+          <a href="/trust.html">Security &amp; Trust</a>
+        </div>
+        <p class="muted">&copy; 2026 BMD PAT LLC &middot; MIT-licensed SDK &middot; Zero dependencies</p>
+      </footer>
+    </div>
+  </body>
+</html>
\ No newline at end of file
diff --git a/site/compare.html b/site/compare.html
new file mode 100644
index 0000000..d4da42d
--- /dev/null
+++ b/site/compare.html
@@ -0,0 +1,362 @@
+<!doctype html>
+<html lang="en">
+  <head>
+    <meta charset="utf-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1" />
+    <title>AgentGuard vs LangSmith vs Langfuse vs Portkey — AI Agent Cost Tracking Comparison</title>
+    <meta name="description" content="Compare AgentGuard with LangSmith, Langfuse, and Portkey. Runtime budget enforcement, loop detection, and cost tracking for AI agents. Free, zero dependencies, MIT licensed." />
+    <link rel="canonical" href="https://agentguard47.com/compare.html" />
+    <link rel="icon" href="data:image/svg+xml,<svg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 100 100'><text y='.9em' font-size='90'>🛡️</text></svg>" />
+
+    <!-- Open Graph -->
+    <meta property="og:type" content="website" />
+    <meta property="og:url" content="https://agentguard47.com/compare.html" />
+    <meta property="og:title" content="AgentGuard vs LangSmith vs Langfuse vs Portkey" />
+    <meta property="og:description" content="The only tool that kills agents mid-run when they exceed spend limits. Compare features, pricing, and integrations." />
+    <meta property="og:image" content="https://opengraph.githubassets.com/1/bmdhodl/agent47" />
+
+    <!-- Twitter Card -->
+    <meta name="twitter:card" content="summary_large_image" />
+    <meta name="twitter:title" content="AgentGuard vs LangSmith vs Langfuse vs Portkey" />
+    <meta name="twitter:description" content="The only tool that kills agents mid-run when they exceed spend limits. Compare features, pricing, and integrations." />
+
+    <!-- SEO keywords -->
+    <meta name="keywords" content="langsmith alternative, langfuse alternative, ai agent cost tracking, agent budget enforcement, ai agent observability, langchain cost tracking" />
+
+    <script defer src="/_vercel/insights/script.js"></script>
+    <style>
+      :root {
+        --bg: #f7f4ee;
+        --ink: #1e1b16;
+        --accent: #1b6c4a;
+        --muted: #6c645a;
+        --card: #ffffff;
+      }
+      * { box-sizing: border-box; }
+      body {
+        margin: 0;
+        font-family: "IBM Plex Sans", "Space Grotesk", "Avenir", sans-serif;
+        background: radial-gradient(circle at 20% 10%, #f0efe9 0%, var(--bg) 55%);
+        color: var(--ink);
+      }
+      .wrap { max-width: 980px; margin: 0 auto; padding: 32px 16px 60px; }
+      @media (min-width: 640px) { .wrap { padding: 56px 24px 80px; } }
+      .nav { display: flex; align-items: center; gap: 16px; margin-bottom: 32px; font-size: 14px; }
+      .nav a { color: var(--accent); text-decoration: none; }
+      .nav a:hover { text-decoration: underline; }
+      h1 { font-size: 28px; line-height: 1.15; margin: 0 0 12px; }
+      h2 { margin-top: 48px; font-size: 22px; }
+      @media (min-width: 640px) { h1 { font-size: 40px; } h2 { font-size: 26px; } }
+      p { font-size: 16px; line-height: 1.6; }
+      @media (min-width: 640px) { p { font-size: 18px; } }
+      .muted { color: var(--muted); }
+      .btn {
+        display: inline-block;
+        background: var(--accent); color: #fff; border: none; padding: 12px 18px;
+        border-radius: 8px; font-weight: 600; cursor: pointer; text-decoration: none;
+        font-size: 15px; text-align: center;
+      }
+      .btn:hover { opacity: 0.9; }
+      .btn.secondary { background: transparent; color: var(--accent); border: 1px solid var(--accent); }
+      .btn.secondary:hover { background: var(--accent); color: #fff; }
+      .cta { display: flex; gap: 12px; flex-wrap: wrap; margin: 24px 0 32px; }
+
+      /* Comparison table */
+      .table-wrap { overflow-x: auto; margin-top: 16px; -webkit-overflow-scrolling: touch; }
+      table { width: 100%; border-collapse: collapse; min-width: 600px; }
+      th, td { text-align: left; padding: 12px 14px; font-size: 14px; border-bottom: 1px solid #e8e2d6; }
+      @media (min-width: 640px) { th, td { padding: 14px 18px; font-size: 15px; } }
+      th { background: #f0ede5; font-weight: 600; position: sticky; top: 0; }
+      th:first-child, td:first-child { font-weight: 600; min-width: 180px; }
+      .col-ag { background: rgba(27, 108, 74, 0.04); }
+      .yes { color: var(--accent); font-weight: 600; }
+      .no { color: #b91c1c; }
+      .partial { color: #ca8a04; }
+
+      /* Highlight cards */
+      .highlights { display: grid; gap: 16px; grid-template-columns: 1fr; margin-top: 24px; }
+      @media (min-width: 640px) { .highlights { grid-template-columns: repeat(3, 1fr); } }
+      .highlight-card {
+        background: var(--card); padding: 20px; border-radius: 12px;
+        border: 1px solid #e8e2d6;
+      }
+      .highlight-card strong { display: block; margin-bottom: 8px; }
+      .highlight-card p { font-size: 14px; margin: 0; }
+
+      /* Detail sections */
+      .detail-grid { display: grid; gap: 20px; grid-template-columns: 1fr; margin-top: 20px; }
+      @media (min-width: 640px) { .detail-grid { grid-template-columns: repeat(2, 1fr); } }
+      .detail-card {
+        background: var(--card); padding: 20px; border-radius: 12px; border: 1px solid #e8e2d6;
+      }
+      .detail-card h3 { margin: 0 0 8px; font-size: 17px; }
+      .detail-card p { font-size: 14px; margin: 8px 0 0; }
+
+      /* Code block */
+      .code {
+        background: #0f1115; color: #e2e8f0; padding: 16px; border-radius: 12px;
+        font-family: "IBM Plex Mono", "Fira Code", monospace; font-size: 12px;
+        line-height: 1.7; overflow-x: auto; white-space: pre; margin: 16px 0 0;
+      }
+      @media (min-width: 640px) { .code { padding: 20px 24px; font-size: 14px; } }
+      .code .kw { color: #c084fc; }
+      .code .fn { color: #60a5fa; }
+      .code .str { color: #4ade80; }
+      .code .comment { color: #6b7280; }
+
+      footer { margin-top: 48px; font-size: 14px; color: var(--muted); text-align: center; }
+      .footer-links { display: flex; flex-wrap: wrap; gap: 16px; justify-content: center; margin-bottom: 8px; }
+      .footer-links a { color: var(--accent); text-decoration: none; }
+      .footer-links a:hover { text-decoration: underline; }
+    </style>
+  </head>
+  <body>
+    <div class="wrap">
+      <nav class="nav">
+        <a href="/">AgentGuard</a>
+        <span class="muted">/</span>
+        <span>Compare</span>
+      </nav>
+
+      <h1>AgentGuard vs LangSmith vs Langfuse vs Portkey</h1>
+      <p class="muted">The only AI agent observability tool with runtime budget enforcement. Compare features, pricing, and integrations side-by-side.</p>
+
+      <div class="cta">
+        <a class="btn" href="https://github.com/bmdhodl/agent47" target="_blank" rel="noopener noreferrer">View on GitHub</a>
+        <a class="btn secondary" href="https://app.agentguard47.com/sign-up">Try Free Dashboard</a>
+      </div>
+
+      <!-- Key differentiators -->
+      <div class="highlights">
+        <div class="highlight-card">
+          <strong>Runtime intervention</strong>
+          <p class="muted">AgentGuard kills agents mid-run when they exceed spend limits. Others only report after the damage is done.</p>
+        </div>
+        <div class="highlight-card">
+          <strong>Zero dependencies</strong>
+          <p class="muted">Pure Python stdlib. One package, nothing to audit. No supply chain risk, no dependency conflicts.</p>
+        </div>
+        <div class="highlight-card">
+          <strong>Free and open source</strong>
+          <p class="muted">MIT-licensed SDK. No per-trace pricing for local use. Dashboard optional.</p>
+        </div>
+      </div>
+
+      <!-- Main comparison table -->
+      <h2>Feature comparison</h2>
+      <div class="table-wrap">
+        <table>
+          <thead>
+            <tr>
+              <th>Feature</th>
+              <th class="col-ag">AgentGuard</th>
+              <th>LangSmith</th>
+              <th>Langfuse</th>
+              <th>Portkey</th>
+            </tr>
+          </thead>
+          <tbody>
+            <tr>
+              <td>Hard budget enforcement</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; raises exception at limit</td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>Kill agent mid-run</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; BudgetExceeded stops execution</td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>Loop detection</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; exact + fuzzy + A-B-A-B</td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>Cost tracking</td>
+              <td class="col-ag"><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+            </tr>
+            <tr>
+              <td>Tracing / spans</td>
+              <td class="col-ag"><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+            </tr>
+            <tr>
+              <td>Timeout guard</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; wall-clock enforcement</td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="partial">Partial</span> &mdash; gateway-level only</td>
+            </tr>
+            <tr>
+              <td>Rate limit guard</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; per-minute throttling</td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="yes">Yes</span> &mdash; gateway-level</td>
+            </tr>
+            <tr>
+              <td>Runtime dependencies</td>
+              <td class="col-ag"><span class="yes">Zero</span></td>
+              <td>5+</td>
+              <td>3+</td>
+              <td>3+</td>
+            </tr>
+            <tr>
+              <td>Open source SDK</td>
+              <td class="col-ag"><span class="yes">MIT</span></td>
+              <td><span class="no">Proprietary</span></td>
+              <td><span class="yes">MIT</span></td>
+              <td><span class="partial">Partial</span></td>
+            </tr>
+            <tr>
+              <td>Self-hosted option</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; SDK works fully offline</td>
+              <td><span class="no">No</span></td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>CI cost gates</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; GitHub Action included</td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>LangChain integration</td>
+              <td class="col-ag"><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span> &mdash; native</td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span></td>
+            </tr>
+            <tr>
+              <td>LangGraph integration</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; guarded_node decorator</td>
+              <td><span class="yes">Yes</span> &mdash; native</td>
+              <td><span class="partial">Partial</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>CrewAI integration</td>
+              <td class="col-ag"><span class="yes">Yes</span></td>
+              <td><span class="no">No</span></td>
+              <td><span class="partial">Partial</span></td>
+              <td><span class="no">No</span></td>
+            </tr>
+            <tr>
+              <td>OpenAI/Anthropic auto-patch</td>
+              <td class="col-ag"><span class="yes">Yes</span> &mdash; one-line patching</td>
+              <td><span class="partial">Via wrapper</span></td>
+              <td><span class="yes">Yes</span></td>
+              <td><span class="yes">Yes</span> &mdash; gateway proxy</td>
+            </tr>
+          </tbody>
+        </table>
+      </div>
+
+      <!-- Pricing comparison -->
+      <h2>Pricing comparison</h2>
+      <div class="table-wrap">
+        <table>
+          <thead>
+            <tr>
+              <th>Plan</th>
+              <th class="col-ag">AgentGuard</th>
+              <th>LangSmith</th>
+              <th>Langfuse</th>
+              <th>Portkey</th>
+            </tr>
+          </thead>
+          <tbody>
+            <tr>
+              <td>Free tier</td>
+              <td class="col-ag"><span class="yes">Unlimited local use</span><br>+ 10K dashboard events/mo</td>
+              <td>5K traces/mo</td>
+              <td>50K observations/mo</td>
+              <td>10K requests/mo</td>
+            </tr>
+            <tr>
+              <td>Paid plans</td>
+              <td class="col-ag">$39/mo (Pro)<br>$79/mo (Team)</td>
+              <td>$39/mo + $2.50/1K traces</td>
+              <td>$59/mo (Pro)</td>
+              <td>$49/mo</td>
+            </tr>
+            <tr>
+              <td>Per-trace pricing</td>
+              <td class="col-ag"><span class="yes">No</span> &mdash; flat rate</td>
+              <td><span class="no">Yes</span> &mdash; $2.50/1K traces</td>
+              <td><span class="no">Yes</span> &mdash; overage charges</td>
+              <td><span class="yes">No</span></td>
+            </tr>
+            <tr>
+              <td>SDK cost</td>
+              <td class="col-ag"><span class="yes">Free forever (MIT)</span></td>
+              <td>Free (proprietary)</td>
+              <td>Free (MIT)</td>
+              <td>Free</td>
+            </tr>
+          </tbody>
+        </table>
+      </div>
+
+      <!-- Detail sections -->
+      <h2>Why AgentGuard for budget enforcement?</h2>
+      <div class="detail-grid">
+        <div class="detail-card">
+          <h3>LangSmith tracks costs. AgentGuard enforces them.</h3>
+          <p class="muted">LangSmith shows you what an agent spent after it finishes. AgentGuard raises <code>BudgetExceeded</code> at the dollar limit you set and stops the agent immediately. The difference between a dashboard alert and a circuit breaker.</p>
+        </div>
+        <div class="detail-card">
+          <h3>Langfuse is open source. So is AgentGuard.</h3>
+          <p class="muted">Both are MIT-licensed. Langfuse focuses on tracing and prompt management. AgentGuard adds runtime guards &mdash; budget limits, loop detection, and timeout enforcement that stop agents before they cause damage.</p>
+        </div>
+        <div class="detail-card">
+          <h3>Portkey is a gateway. AgentGuard is a library.</h3>
+          <p class="muted">Portkey proxies all LLM traffic through their servers. AgentGuard runs in your process with zero network calls. No latency overhead, no data leaving your infrastructure, no single point of failure.</p>
+        </div>
+        <div class="detail-card">
+          <h3>Zero dependencies means zero risk.</h3>
+          <p class="muted">Every dependency is supply chain attack surface. AgentGuard uses Python stdlib only. One package to install, one package to audit. No transitive vulnerabilities, no version conflicts.</p>
+        </div>
+      </div>
+
+      <!-- Code example -->
+      <h2>Add budget enforcement in 3 lines</h2>
+      <p class="muted">No signup required. No API keys. Works offline.</p>
+<pre class="code"><span class="kw">from</span> agentguard <span class="kw">import</span> Tracer, BudgetGuard, patch_openai
+
+tracer = <span class="fn">Tracer</span>(guards=[<span class="fn">BudgetGuard</span>(max_cost_usd=<span class="str">5.00</span>, warn_at_pct=<span class="str">0.8</span>)])
+<span class="fn">patch_openai</span>(tracer)  <span class="comment"># auto-tracks every OpenAI call</span>
+
+<span class="comment"># Use OpenAI normally — agent stops at $5</span></pre>
+
+      <div class="cta" style="margin-top: 32px;">
+        <a class="btn" href="https://github.com/bmdhodl/agent47">View on GitHub</a>
+        <a class="btn secondary" href="https://pypi.org/project/agentguard47/" target="_blank" rel="noopener noreferrer">pip install agentguard47</a>
+      </div>
+
+      <footer>
+        <div class="footer-links">
+          <a href="/">Home</a>
+          <a href="https://github.com/bmdhodl/agent47">GitHub</a>
+          <a href="/#pricing">Pricing</a>
+          <a href="trust.html">Security &amp; Trust</a>
+        </div>
+        <p class="muted">&copy; 2026 BMD PAT LLC &middot; MIT-licensed SDK &middot; Zero dependencies</p>
+        <p class="muted" style="font-size: 12px; margin-top: 12px;">Comparison data gathered from public documentation as of February 2026. Feature availability may change. All trademarks belong to their respective owners.</p>
+      </footer>
+    </div>
+  </body>
+</html>
diff --git a/site/index.html b/site/index.html
index 9a01cc7..4c2e1bf 100644
--- a/site/index.html
+++ b/site/index.html
@@ -214,7 +214,7 @@ <h1>Your agents are running. Do you know what they're spending?</h1>
 
       <div class="stats">
         <div class="stat">
-          <span class="num">502</span>
+          <span class="num">516</span>
           <span class="label">Tests passing</span>
         </div>
         <div class="stat">
@@ -407,6 +407,7 @@ <h2>Get updates</h2>
         <div class="footer-links">
           <a href="https://github.com/bmdhodl/agent47">GitHub</a>
           <a href="#pricing">Pricing</a>
+          <a href="compare.html">Compare</a>
           <a href="trust.html">Security &amp; Trust</a>
         </div>
         <p class="muted">&copy; 2026 BMD PAT LLC &middot; MIT-licensed SDK &middot; Zero dependencies</p>
diff --git a/site/sitemap.xml b/site/sitemap.xml
index 67dab7b..2f7ea2f 100644
--- a/site/sitemap.xml
+++ b/site/sitemap.xml
@@ -10,4 +10,24 @@
     <lastmod>2026-02-14</lastmod>
     <priority>0.7</priority>
   </url>
+  <url>
+    <loc>https://agentguard47.com/compare.html</loc>
+    <lastmod>2026-02-20</lastmod>
+    <priority>0.9</priority>
+  </url>
+  <url>
+    <loc>https://agentguard47.com/blog/budget-limits-ai-agents.html</loc>
+    <lastmod>2026-02-20</lastmod>
+    <priority>0.8</priority>
+  </url>
+  <url>
+    <loc>https://agentguard47.com/blog/ai-agent-cost-overruns.html</loc>
+    <lastmod>2026-02-20</lastmod>
+    <priority>0.8</priority>
+  </url>
+  <url>
+    <loc>https://agentguard47.com/blog/langchain-cost-tracking.html</loc>
+    <lastmod>2026-02-20</lastmod>
+    <priority>0.8</priority>
+  </url>
 </urlset>