The execution firewall
for agents
Shell commands and MCP tool calls evaluated by deterministic policy before execution. Native adapters can emit the same action envelope for file, network, and process operations. Stop covered agent side effects from going rogue.
Features
Shell Interception
Protected commands, evaluated.
PATH-prepended shims catch common agent-spawned commands before execution. Git, curl, npm, python, cloud CLIs, package managers, and more. The real binary only runs after policy approval.
MCP Tool Proxy
Policy for tool calls.
Stdio man-in-the-middle on JSON-RPC tools/call. Read tools can pass, side effects can pause, and dangerous payloads can be denied before reaching the server.
Policy Engine
Deterministic decisions.
Three built-in profiles: default, strict, and permissive. Or write your own in YAML. Sub-millisecond evaluation per action.
Human Escalation
Humans stay in the loop.
When the gateway is uncertain, it pauses and prompts the human inline. Allow, deny, or end the session entirely.
Anti-Pattern Detection
Behavioral threat modeling.
Sliding window detection catches credential harvesting, sandbox probing, self-modification, data exfiltration, and history tampering.
Circuit Breaker
Session-level kill switch.
Critical pattern matches trip a circuit breaker, halting all further actions. No recovery without human restart. Rate limiting built in.
Full Audit Trail
Every decision, recorded.
Append-only JSONL logs for every action evaluation. Session ID, timestamps, decisions, reasons, sub-millisecond latency. Tail live or query after.
Configuration
Declarative policy as YAML
Define what's allowed, denied, and what requires confirmation. Patterns, risk thresholds, and temporal rules. All in a single file.
Default Policy
Blocked by default
Pipe-to-shell patterns are always blocked, regardless of policy.
Integrations
Works with your stack
Cursor, Claude Code, or any agent that runs shell commands or MCP tools. Three lines of config.
Cursor
Shell commands intercepted automatically. Wrap MCP servers in your config.
Claude Code
Wrap the process so protected commands Claude spawns are intercepted.
Codex
Layer Blackwall around Codex while keeping the native sandbox and approvals enabled.
Any Agent
Wrap any agent process that launches shell commands or stdio MCP servers.
Temporal Detection
Catches what permissions can't
Some attacks look harmless as individual actions. Blackwall detects multi-step sequences over sliding windows, derived from documented frontier model behavior.
Reading /proc then environment data
Process info → network config → external request
Reading then writing MCP/tool configuration
Workspace reads then unknown network requests
Git operations then writing .git internals
Architecture
Four layers of evaluation
Each action passes through four deterministic stages. Sub-millisecond latency. No inference.
Circuit Breakers
Session halted? Rate limit exceeded?
Permissions
Deny → Confirm → Allow → Default deny
Pattern Matching
Sliding window anti-pattern detection
Risk Scoring
Cumulative score with pause/halt thresholds
.-#+-+#--#. +-. ..-##.. .+.##-+##-..-.-. .##-. -## .### .##. ####. .##. +#+ .+#+--++##-..##. -+###+-. +#++-..-+...++#+--##-. ##+ ####- ##- ###-##. .##. ### .+#+.---+#-..##. -##+--### #+. . .+######+ .##.-## ## ### ### -##. .##. ### -###-+#+###--###++-....-##########..###+.. -#---####++###.. .####. +#### ##########..#######-######## .-......-. .----.... ... .. ...--..--. .... ... .... ... .. ... ....... ........
Stop your AI going rogue
Open source. MIT licensed. Install in under a minute.