ghx

GitHub execution router for AI agents. One typed capability interface over gh CLI + GraphQL.

%%{init: {'theme': 'base', 'themeVariables': {'primaryColor': '#4A90D9', 'primaryTextColor': '#fff', 'primaryBorderColor': '#2E6BA4', 'lineColor': '#666', 'fontSize': '13px'}}}%%
sequenceDiagram
    participant Agent
    participant ghx as ghx
    participant GH as GitHub API

    Agent->>ghx: executeTask("pr.view", { owner, name, prNumber })

    Note over ghx: 1. Validate input schema<br/>2. Select optimal route (GraphQL/CLI)<br/>3. Run preflight checks
    ghx->>GH: typed GraphQL request
    GH-->>ghx: response
    Note over ghx: normalize errors & structure

    ghx-->>Agent: ResultEnvelope { ok: true, data: { ... } }

ghx helps agents execute GitHub tasks without re-discovering API surfaces on every run. Agents call stable capabilities like repo.view or pr.merge; ghx handles route choice, retries, fallbacks, and normalized output.

30-Second Quick Start

Requirements: Node.js 22+, gh CLI authenticated, GITHUB_TOKEN or GH_TOKEN in env.

npm i -g @ghx-dev/core
ghx capabilities list
ghx capabilities explain repo.view
ghx run repo.view --input '{"owner":"aryeko","name":"ghx"}'

Then wire ghx into your agent:

# Claude Code — install from the plugin marketplace
/plugin marketplace add aryeko/ghx
/plugin install ghx@ghx-dev

# Cursor, Windsurf, Codex, other agents — install the skill
ghx setup --scope user --yes

Try without installing (npx)

npx @ghx-dev/core capabilities list
npx @ghx-dev/core run repo.view --input '{"owner":"aryeko","name":"ghx"}'
npx @ghx-dev/core setup --scope user --yes

Who is this for?

Claude Code users -- install from the plugin marketplace for automatic skill loading
Cursor / Windsurf / Codex users -- install globally and run ghx setup --scope user to get the agent skill
Custom agent builders -- import createExecuteTool() for typed GitHub access in your own agent framework

The Problem

Agents instructed to "use gh CLI" for GitHub operations waste significant tokens on research, trial-and-error, and output parsing:

Array parameter syntax is fragile. Submitting a PR review with inline comments via gh api requires comments[0][path], comments[][body], or heredoc piping. Agents try 3-15 syntaxes before one works.
API surface re-discovery every session. Each new session, the agent figures out which gh subcommands exist, what --json fields are available, and how to format GraphQL queries from scratch.
Output shapes vary by endpoint. REST, GraphQL, and gh CLI each return different structures. The agent spends tokens parsing and normalizing before it can reason about results.

Before / After

WITHOUT ghx -- agent submitting a PR review with inline comments (15 tool calls, 126s):

gh pr view 42 --repo acme/repo                          # read PR
gh pr diff 42 --repo acme/repo                          # read diff
gh api POST reviews -f event=REQUEST_CHANGES \           # attempt 1: 422 error
  -f 'comments[0][path]=src/stats.ts' ...
noglob gh api POST reviews ...                           # attempt 2: 422 error
python3 -c "import json; ..." | gh api --input -         # attempt 3: no inline comments
gh api POST reviews/comments -f path=src/stats.ts ...    # attempt 4-6: individual comments
gh api POST reviews -f event=REQUEST_CHANGES             # attempt 7: submit event
gh pr view 42 --json reviews                             # verify

WITH ghx -- same task (2 tool calls, 26s):

ghx chain --steps - <<'EOF'
[
  {"task":"pr.diff.view","input":{"owner":"acme","name":"repo","prNumber":42}},
  {"task":"pr.view","input":{"owner":"acme","name":"repo","prNumber":42}}
]
EOF
ghx run pr.reviews.submit --input - <<'EOF'
{
  "owner": "acme", "name": "repo", "prNumber": 42,
  "event": "REQUEST_CHANGES",
  "body": "Found blocking issues.",
  "comments": [
    {"path": "src/stats.ts", "line": 4, "body": "Empty array guard missing."},
    {"path": "src/stats.ts", "line": 8, "body": "Missing await on fetch."},
    {"path": "src/stats.ts", "line": 12, "body": "Hardcoded credential."}
  ]
}
EOF

Benchmarked Performance

Three-mode comparison (baseline vs MCP vs ghx) across 30 runs (2 scenarios, 5 iterations each, 3 modes) with Codex 5.3. ghx achieved 100% success rate.

Scenario	Tool calls	Active tokens	Latency
Reply to unresolved review threads	-73%	-18%	-54%
Review and comment on PR	-71%	-18%	-54%

Full methodology, per-iteration data, and statistical analysis: Evaluation Report

Chain: Batch Operations

ghx chain batches multiple operations into a single tool call. One command, batched execution, three operations:

ghx chain --steps - <<'EOF'
[
  {"task":"issue.labels.remove","input":{"owner":"acme","name":"repo","issueNumber":42,"labels":["triage","feature-request"]}},
  {"task":"issue.labels.add","input":{"owner":"acme","name":"repo","issueNumber":42,"labels":["enhancement"]}},
  {"task":"issue.comments.create","input":{"owner":"acme","name":"repo","issueNumber":42,"body":"Triaged -- tracking as enhancement."}}
]
EOF

Agents use chain to collapse multi-step workflows (label swap + comment, bulk thread resolve + reply, etc.) into a single tool call instead of sequential shell commands.

Example Output

{
  "ok": true,
  "data": {
    "id": "...",
    "name": "ghx",
    "nameWithOwner": "aryeko/ghx"
  },
  "error": null,
  "meta": {
    "capability_id": "repo.view",
    "route_used": "cli",
    "reason": "CARD_PREFERRED"
  }
}

Golden Workflow: CI Diagnosis

Diagnose a failing CI run, read logs, rerun, and merge:

ghx run workflow.run.view --input '{"owner":"acme","name":"repo","runId":123456}'
ghx run workflow.job.logs.view --input '{"owner":"acme","name":"repo","jobId":789012}'
ghx run workflow.run.rerun.failed --input '{"owner":"acme","name":"repo","runId":123456}'
ghx run pr.checks.list --input '{"owner":"acme","name":"repo","prNumber":14}'
ghx run pr.merge --input '{"owner":"acme","name":"repo","prNumber":14,"method":"squash"}'

Capabilities

70+ capabilities across 6 domains (full list).

Security and Permissions

Use least-privilege tokens and only grant scopes needed for the capabilities you execute.
For fast local evaluation, a classic PAT with repo scope is the simplest path.
For production agents, prefer fine-grained tokens with read permissions first (Metadata, Contents, Pull requests, Issues, Actions, Projects) and add write permissions only where required.
ghx reads GITHUB_TOKEN or GH_TOKEN from environment.

Packages

@ghx-dev/core (packages/core) -- public npm package; CLI + execution engine
@ghx-dev/agent-profiler (packages/agent-profiler) -- private; generic AI agent session profiler
@ghx-dev/eval (packages/eval) -- private; evaluation harness for ghx benchmarking

Documentation

Full documentation lives in docs/:

Core Documentation -- Getting started, architecture, capabilities, guides
Agent Profiler -- Profiler architecture, guides, API reference
Eval Harness -- Evaluation methodology, scenarios, fixtures
Contributing -- Development setup, testing, CI, publishing
Repository Structure -- Monorepo layout and module organization
Branding assets: assets/branding/README.md

Background

Read the full motivation and benchmark methodology: AI Agents Shouldn't Relearn GitHub on Every Run

Roadmap

Current roadmap priorities and capability batches are tracked in ROADMAP.md.

Contributing

See CONTRIBUTING.md for local setup, test commands, and PR expectations.

Tooling notes for local development:

gh CLI is required for CLI-backed execution paths (gh auth status).
opencode CLI is only required if you run E2E suites locally (pnpm run test:e2e); CI installs it via curl -fsSL https://opencode.ai/install | bash.

git clone https://github.com/aryeko/ghx.git && cd ghx
./scripts/setup-dev-env.sh
pnpm install
pnpm run build
pnpm run ci

Questions? Open a Discussion.

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
.changeset		.changeset
.claude-plugin		.claude-plugin
.codex		.codex
.github		.github
assets/branding		assets/branding
docs		docs
packages		packages
scripts		scripts
.gitignore		.gitignore
.npmrc		.npmrc
.nvmrc		.nvmrc
.nxignore		.nxignore
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
biome.json		biome.json
codecov.yml		codecov.yml
eslint.config.mjs		eslint.config.mjs
lefthook.yml		lefthook.yml
nx.json		nx.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ghx

30-Second Quick Start

Who is this for?

The Problem

Before / After

Benchmarked Performance

Chain: Batch Operations

Example Output

Golden Workflow: CI Diagnosis

Capabilities

Security and Permissions

Packages

Documentation

Background

Roadmap

Contributing

License

About

Uh oh!

Releases 13

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ghx

30-Second Quick Start

Who is this for?

The Problem

Before / After

Benchmarked Performance

Chain: Batch Operations

Example Output

Golden Workflow: CI Diagnosis

Capabilities

Security and Permissions

Packages

Documentation

Background

Roadmap

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 13

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages