Zero-Trust Gatekeeper

The Zero-Trust Gatekeeper is SecureShell's core security feature. It uses an LLM to evaluate commands before execution, treating every command as untrusted until validated.

How It Works

For YELLOW or RED risk commands, the gatekeeper:

Receives command + reasoning from agent
Analyzes command in context of risk, OS, and security policy
Makes decision: ALLOW, DENY, or CHALLENGE
Returns reasoning explaining the decision

Decision Types

ALLOW - Command is safe to execute

DENY - Command is too dangerous or inappropriate

CHALLENGE - Needs clarification (treated as DENY, agent can provide better reasoning and retry)

Platform-Aware Evaluation

The gatekeeper understands platform differences:

// On Windows
await shell.execute('ls -la', 'List files');
// Decision: DENY
// Reason: "ls is Unix-only. Use 'dir' on Windows."

This helps agents learn and self-correct automatically.

Configuration

Enable/Disable:

const shell = new SecureShell({
    config: { gatekeeperEnabled: true } // default
});

Risk Threshold - Control which commands trigger gatekeeper:

const shell = new SecureShell({
    config: { riskThreshold: 'YELLOW' } // YELLOW and RED (default)
});

Options: 'GREEN' (all), 'YELLOW' (default), 'RED' (only high-risk)

LLM Provider:

// Fast: OpenAI
new SecureShell({ provider: new OpenAIProvider({ apiKey: '...', model: 'gpt-4o-mini' }) });

// Strong reasoning: Anthropic
new SecureShell({ provider: new AnthropicProvider({ model: 'claude-3-5-sonnet-20241022' }) });

// Private: Local model
new SecureShell({ provider: new OllamaProvider({ model: 'llama3' }) });

Fail-Safe Behavior

const shell = new SecureShell({
    config: { failClosed: true } // default: deny on error
});

failClosed: true - Deny if gatekeeper fails (safe default)
failClosed: false - Allow if gatekeeper fails (risky)

Agent Learning

Agents learn from gatekeeper feedback:

Turn 1: Agent tries "ls -la" → DENY ("Use 'dir' on Windows")
Turn 2: Agent tries "dir" → ALLOW → Success!

Example Evaluations

Safe development:

Command: git commit -m "Fix bug"
Decision: ALLOW
Reason: "Standard version control, no destructive effects"

Dangerous with poor reasoning:

Command: rm -rf /var
Decision: DENY
Reason: "Would break system. Never acceptable."

Scoped deletion:

Command: rm -rf build
Decision: ALLOW
Reason: "Removing ./build is standard for cleaning artifacts"

Security Templates

Templates configure gatekeeper strictness:

Paranoid - Denies almost everything, extreme caution
Production - Balanced, allows common operations
Development - Permissive for dev workflows
CI/CD - Optimized for automation

See Security Templates.

Best Practices

Good reasoning - Help gatekeeper understand intent
Choose appropriate template - Match your use case
Monitor logs - Review decisions in audit trail
Let agents learn - Don't bypass, let them self-correct

Next Steps

Risk Classification - How commands are categorized
Security Templates - Pre-configured policies
Audit Logging - Track decisions

How It Works

When a YELLOW or RED risk command is submitted, the gatekeeper:

Receives command + reasoning from the agent
Analyzes the command in context of:
- Risk classification
- Operating system
- Agent's stated reasoning
- Security policy
Makes decision: ALLOW, DENY, or CHALLENGE
Returns reasoning explaining the decision

Decision Types

ALLOW

Command is safe to execute.

Example:

Command: mkdir ./build
Reasoning: Creating build output directory

Gatekeeper Decision: ALLOW
Gatekeeper Reasoning: "Creating a build directory is a standard development practice and poses no risk"

DENY

Command is too dangerous or inappropriate.

Example:

Command: rm -rf /
Reasoning: Clean up files

Gatekeeper Decision: DENY
Gatekeeper Reasoning: "This command would delete the entire filesystem. Never acceptable regardless of reasoning."

CHALLENGE

Command needs clarification or confirmation.

Example:

Command: rm -rf node_modules
Reasoning: Clean up

Gatekeeper Decision: CHALLENGE
Gatekeeper Reasoning: "Please confirm: Are you sure you want to delete node_modules? This will require reinstalling all dependencies."

Note: CHALLENGE is treated as DENY in current implementation. Agent can provide better reasoning and retry.

Gatekeeper Prompt

The gatekeeper receives a structured prompt:

You are a security gatekeeper evaluating shell commands.

Command: {command}
Risk Tier: {tier}
Agent Reasoning: {reasoning}
OS: {os_info}

Evaluate this command and respond with:
- ALLOW: Safe to execute
- DENY: Too dangerous
- CHALLENGE: Needs clarification

Provide clear reasoning for your decision.

Platform-Aware Evaluation

The gatekeeper understands platform differences:

On Windows:

await shell.execute('ls -la', 'List files');
// Decision: DENY
// Reasoning: "ls is a Unix command. On Windows, use 'dir' instead."

On Linux:

await shell.execute('dir', 'List files');
// Decision: DENY  
// Reasoning: "dir is a Windows command. On Linux, use 'ls' instead."

This helps agents learn and self-correct.

Configuration

Enable/Disable

// TypeScript
const shell = new SecureShell({
    config: {
        gatekeeperEnabled: true // default
    }
});

# Python
shell = SecureShell(
    config=SecureShellConfig(gatekeeper_enabled=True)
)

Risk Threshold

Control which commands trigger gatekeeper:

const shell = new SecureShell({
    config: {
        riskThreshold: 'YELLOW' // Evaluate YELLOW and RED
    }
});

Options:

'GREEN' - Evaluate everything
'YELLOW' - Evaluate YELLOW and RED (default)
'RED' - Only evaluate RED

LLM Provider

Choose which LLM powers the gatekeeper:

// OpenAI (fast, reliable)
new SecureShell({
    provider: new OpenAIProvider({ 
        apiKey: '...', 
        model: 'gpt-4o-mini' 
    })
});

// Anthropic (strong reasoning)
new SecureShell({
    provider: new AnthropicProvider({ 
        apiKey: '...', 
        model: 'claude-3-5-sonnet-20241022' 
    })
});

// Local (private, offline)
new SecureShell({
    provider: new OllamaProvider({ 
        model: 'llama3',
        endpoint: 'http://localhost:11434'
    })
});

Fail-Safe Behavior

If the gatekeeper fails (API error, timeout, etc.):

const shell = new SecureShell({
    config: {
        failClosed: true // default: deny on error
    }
});

failClosed: true (default) - Deny command if gatekeeper fails
failClosed: false - Allow command if gatekeeper fails (risky!)

Agent Learning

The gatekeeper helps agents learn by providing:

Clear error messages - "ls is not available on Windows"
Alternatives - "Use 'dir' instead"
Context - "This command would delete your entire filesystem"

Agents can use this feedback to self-correct:

Iteration 1:
Agent: "Run ls -la"
Gatekeeper: DENY - "Use 'dir' on Windows"

Iteration 2:
Agent: "Run dir"
Gatekeeper: ALLOW
Success!

Example Evaluations

Safe Development Command

Command: git commit -m "Fix bug"
Reasoning: Committing code changes
Risk: YELLOW

Decision: ALLOW
Reasoning: "git commit is a standard version control operation with no destructive effects"

Dangerous Command with Poor Reasoning

Command: rm -rf /var
Reasoning: Cleaning up
Risk: RED

Decision: DENY
Reasoning: "Deleting /var would break system functionality. This is never acceptable."

Ambiguous Command

Command: rm -rf build
Reasoning: Clean build artifacts
Risk: RED

Decision: ALLOW
Reasoning: "Removing ./build directory is a common development practice for cleaning build artifacts. The path is relative and scoped."

Platform Mismatch

Command: powershell -Command "Remove-Item file.txt"
Reasoning: Delete file
Risk: YELLOW
OS: Linux

Decision: DENY
Reasoning: "PowerShell is not typically available on Linux. Use 'rm file.txt' instead."

Security Templates & Gatekeeper

Templates configure gatekeeper strictness:

Paranoid:

Denies almost everything
Requires extremely strong reasoning
Even GREEN commands evaluated

Production:

Balanced evaluation
Allows common operations
Strict on destructive commands

Development:

Permissive for dev workflows
Allows most YELLOW commands
Still blocks obvious dangers

CI/CD:

Optimized for automation
Allows build/deploy commands
Fast gatekeeper responses

See Security Templates.

Troubleshooting

Gatekeeper always denies

Cause: Too strict template or threshold.

Solution:

const shell = new SecureShell({
    template: 'development', // More permissive
    config: { riskThreshold: 'RED' } // Only evaluate RED
});

Gatekeeper disabled warning

Cause: No valid LLM provider configured.

Solution: Set API key and provider:

const shell = new SecureShell({
    provider: new OpenAIProvider({ apiKey: process.env.OPENAI_API_KEY })
});

Slow gatekeeper responses

Cause: LLM API latency.

Solutions:

Use faster model (gpt-4o-mini vs gpt-4)
Use local model (Ollama)
Increase timeout
Lower risk threshold

Best Practices

Good reasoning - Help gatekeeper understand intent
Specific commands - Avoid vague or overly complex commands
Template selection - Choose appropriate template for use case
Monitor logs - Review gatekeeper decisions in audit logs
Iterate - Let agents learn from denials

Next Steps

Risk Classification - How commands are categorized
Security Templates - Pre-configured policies
Audit Logging - Track gatekeeper decisions
Platform Awareness - OS-specific evaluation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zero-Trust Gatekeeper

How It Works

Decision Types

Platform-Aware Evaluation

Configuration

Fail-Safe Behavior

Agent Learning

Example Evaluations

Security Templates

Best Practices

Next Steps

How It Works

Decision Types

ALLOW

DENY

CHALLENGE

Gatekeeper Prompt

Platform-Aware Evaluation

Configuration

Enable/Disable

Risk Threshold

LLM Provider

Fail-Safe Behavior

Agent Learning

Example Evaluations

Safe Development Command

Dangerous Command with Poor Reasoning

Ambiguous Command

Platform Mismatch

Security Templates & Gatekeeper

Troubleshooting

Gatekeeper always denies

Gatekeeper disabled warning

Slow gatekeeper responses

Best Practices

Next Steps

FilesExpand file tree

gatekeeper.md

Latest commit

History

gatekeeper.md

File metadata and controls

Zero-Trust Gatekeeper

How It Works

Decision Types

Platform-Aware Evaluation

Configuration

Fail-Safe Behavior

Agent Learning

Example Evaluations

Security Templates

Best Practices

Next Steps

How It Works

Decision Types

ALLOW

DENY

CHALLENGE

Gatekeeper Prompt

Platform-Aware Evaluation

Configuration

Enable/Disable

Risk Threshold

LLM Provider

Fail-Safe Behavior

Agent Learning

Example Evaluations

Safe Development Command

Dangerous Command with Poor Reasoning

Ambiguous Command

Platform Mismatch

Security Templates & Gatekeeper

Troubleshooting

Gatekeeper always denies

Gatekeeper disabled warning

Slow gatekeeper responses

Best Practices

Next Steps