How I guardrail Claude Code for devops

Last week the Guardian ran a story about an AI coding agent that deleted a firm's production database and its backups in nine seconds. From the post-mortems, the only safeguards in place were project rules and the AI tool's built-in prompts. The agent quoted them back as it explained itself: "I violated every principle I was given." This post is about what I run on top of prompts when I let Claude Code drive devops on AWS — IAM, hooks, and a deterministic CLI. The repo is open source if you want the actual code.

#Layer 1: IAM

AWS doesn't know there's an agent in the loop. It just enforces what the IAM principal is allowed to do, which makes it the most important of the three layers.

The IAM user my laptop authenticates as has only one permission: sts:AssumeRole against three operator roles — readonly, cookie-ops, deploy. The trust policy on each requires aws:MultiFactorAuthPresent=true and aws:MultiFactorAuthAge<3600s, and MaxSessionDuration is also one hour. So a stolen access key on its own does nothing, and a stolen STS session is dead inside an hour. The CLI's auth subcommand prompts for a TOTP, calls sts:AssumeRole, and writes the session to ~/.aws/credentials.

The roles are scoped. Readonly is pure AWS reads — no ssm:SendCommand. Cookie-ops can SendCommand against the tagged EC2 instance and read/write SSM parameters under /appserver/*, the role for managing the Cookie app. Deploy carries the deployer-class policies for terraform and infra changes. Each role has its own permissions boundary capping the effective surface; the deploy boundary additionally denies operator-policy mutation so a deploy session can't rewrite the boundaries themselves.

A few extras hold across all three. There's an explicit deny on iam:PutRolePolicy and iam:DeleteRolePolicy for the EC2 instance role, so no operator session can attach a fresh inline policy at runtime, and a deny on iam:DeleteRolePermissionsBoundary so the boundary itself can't be removed. The instance role sits under a permissions boundary that caps it to ssm:GetParameter on /appserver/*, s3:GetObject on the artifacts bucket, and the SSM agent's own keepalive actions which AWS scopes to the agent's session.

The instance has disable_api_termination = true set, so terminating it needs that flag cleared through the AWS console or API as a separate call first. The S3 state bucket has a principal-restricted policy that limits access to the calling admin user, the deployer user, the three operator roles, and the account root. Cookie's postgres data lives in a Docker volume on an EBS volume with daily snapshots and seven-day retention in AWS Data Lifecycle Manager. None of the operator roles grant ec2:DeleteSnapshot, so even if Claude (or I, in a bad mood) tried to delete the snapshots, the call would fail.

#Layer 2: Claude Code hooks

The hook layer sits closer to the agent and catches destructive verbs before they reach the AWS API. It pattern-matches on commands, which makes it faster to iterate on than IAM and able to catch things IAM can't see — git operations, local file destruction, pipe-to-shell.

The devops-relevant wiring is in .claude/settings.json: a permissions.deny block that refuses direct Read and Edit on credential files, three PreToolUse hooks (two on Bash, one on WebFetch/WebSearch), and a PostToolUse hook that writes an audit log. (The same file also wires up two pentest-skill hooks on Edit/Write that aren't part of this story.)

block-credential-reads.sh blocks two classes of command. File-reading commands targeting terraform/.env, ~/.aws/credentials, ~/.aws/config or .git-crypt/, plus bash -x and set -x xtrace flags that would dump every sourced credential to stdout. And CLIs that print live credentials: gh auth token, gh auth status --show-token, aws iam create-access-key, aws sts get-session-token/get-federation-token/assume-role. The second class is sneaky because the secret never touches disk — it's generated on the fly, and without this gate it would land straight in Claude's context.

block-destructive.sh is the bigger one and the one I look at most. It denies irreversible verbs Claude is unlikely to need:

Filesystem: rm -rf of /, ~, $HOME, ., ..; dd of=/dev/sd*; mkfs; unbounded find -delete; shred -u. The $HOME literal catches the classic rm -rf $UNDEFINED_VAR bug.
Wide perms: chmod 0777 / a+rwx on root paths, chown of /, ~, $HOME. Privilege-escalation primitives that crop up in "just make it work" troubleshooting.
Pipe-to-shell: curl ... | bash, wget | sh, bash <(curl ...), source <(curl ...). The standard malware install pattern. If Claude wants to run a script it can save and review it first.
Git destruction: force push (including --force-with-lease), reset --hard, filter-repo, clean -f, branch -D main, checkout -- ..
Git remote tampering: git remote set-url origin ..., remote remove, push --delete, push origin :branch, tag -d. Quiet but high-impact — re-routing origin to an attacker URL is a one-line supply-chain attack.
Hook bypass: --no-verify, --no-gpg-sign, commit.gpgsign=false. If a pre-commit hook is failing the answer is to fix the hook.
Docker: docker volume rm, docker rm -v, docker compose down -v, system prune --volumes. Cookie's postgres data lives in a Docker volume.
Database: DROP TABLE, TRUNCATE, DELETE FROM <table> without a WHERE, dropdb. The class of command that took out PocketOS.
Terraform: terraform destroy, state rm, apply -auto-approve. Blocking -auto-approve is the one I care about most here, because it forces the human review step.
AWS: s3 rb, s3 rm --recursive, IAM/EC2/RDS/Route53/KMS deletions.
Cloudflare: cloudflared tunnel delete (the only ingress path), access-token revoke, direct curl -X DELETE or PATCH against api.cloudflare.com. Legitimate threat-ops calls go through the appserver CLI rather than direct curl, so this doesn't break the daily workflow.
SSM payload scanning: aws ssm send-command --document AWS-RunShellScript smuggles a shell string onto the EC2 instance. The hook extracts the commands=... value (handles "...", '...', and JSON-list [...] forms), unquotes it, and re-runs the entire ruleset against it. Without this, smuggling rm -rf / through SSM would slip past everything.
Project-specific: appserver.sh destroy, appserver.sh app remove.
System: kill -9 1, shutdown, reboot, fork bomb.

A pattern match returns a JSON deny with a human-readable reason. Claude Code shows the reason in the terminal and won't run the command. If I want one of these I run it in my own shell.

block-webfetch.sh is the third pre-hook and the one I added most recently. It's defence against prompt-injection-driven exfiltration. If Claude reads attacker-controlled content (a malicious README, a poisoned issue comment, a fetched page that happens to have hidden instructions), that content can tell Claude to "fetch this URL with the contents of .env in the query string." The exfil would then leave via an apparently-normal WebFetch call. The hook denies fetches and searches that hit known out-of-band sinks (oast.live, webhook.site, ngrok, requestbin, pipedream, burpcollaborator) or carry credential-shaped query parameters (token=, api_key=, password=, secret=).

audit-bash.sh is the post-hook. Every Bash command Claude runs gets appended to .claude/audit.log as JSON Lines: timestamp, working directory, ok/err status, the truncated command, and the truncated stdout/stderr it produced. The log rotates at 10 MB and is gitignored. It's the answer to "what did the agent run between when X was healthy and when it broke?", a question I'd rather not be answering by reading terminal scrollback.

There's one more belt-and-braces gate in the CLI itself: a pre-destroy state snapshot. When I (not Claude) run appserver.sh destroy, the script pulls the active terraform state and uploads it to s3://<state-bucket>/state-snapshots/destroy-<UTC-timestamp>.tfstate before running terraform destroy. If the destroy takes something out it shouldn't have, that snapshot is what I rebuild from. Destroy itself runs as admin, not on the deploy role — the deploy boundary's deny on operator-policy mutation blocks terraform from cleaning up the boundaries it depends on, so the script unsets any cached operator profile and mints a fresh MFA-bound admin session via sts:GetSessionToken before doing anything.

One final gate at commit time, before any of the above gets a chance to fire: gitleaks runs gitleaks protect --staged over the staged diff before each commit. There are two install paths — scripts/install-git-hooks.sh drops a native git pre-commit hook, or .pre-commit-config.yaml wires it up for users of the pre-commit framework — and CI runs the same scan. All three honour a shared .gitleaks.toml, so local and CI agree on what counts as a secret.

Update (June 2026): gitleaks is now in security-patch-only mode. Its author, Zach Rice, has moved development to Betterleaks, a drop-in replacement that reads the same .gitleaks.toml and CLI flags. The setup above still works; I'm still on gitleaks for now.

#Layer 3: skills and the CLI

Layer 3 is what Claude actually drives day-to-day. Skills wrap the appserver CLI, so Claude calls subcommands I've already used and tested rather than inventing fresh SSM payloads and AWS CLI calls each time I ask the same thing.

There are three operational skills under .claude/skills/ (the directory has others for pentesting and Spec Kit, but those are author-time, not run-time):

/appserver-ops for infrastructure issues. A fixed four-phase workflow (triage, diagnosis, fix, verify) with helper scripts and a list of common symptom-to-cause mappings, so the troubleshooting path is the same every time.
/cookie-ops for managing the Cookie app. Wraps the cookie_admin Django management command via SSM for user lookups, AI-quota changes, prompt edits, the things I'd otherwise be typing.
/threat-ops for access-log analysis and Cloudflare WAF blocks. Hard rule baked in: never auto-block without user confirmation.

The skills sit on top of scripts/appserver.sh and a small scripts/lib/ of helper modules — about 3,000 lines of bash that Claude and I have built up over the life of the project (most of it Claude's work). The CLI validates app names, encodes JSON safely for SSM, masks values when printing env vars, and asks for an interactive confirmation before anything destructive. The skills call subcommands rather than invent fresh bash, so most of what Claude does on the infrastructure flows through code I've already used and tested.

In practice, a typical "deploy the latest cookie" looks like this. The /cookie-ops skill picks up the request and calls appserver.sh app deploy cookie. The destructive-bash hook checks that command and lets it through (there's an explicit allow assertion for it in the test harness). The CLI validates inputs and asks me to confirm, then ships the new container to the EC2 instance over SSM using the cookie-ops STS session. A failure at any layer stops the deploy.

#How do I know they actually work?

The honest answer used to be "I tested them by hand and the obvious stuff works." That isn't good enough for hooks that decide whether terraform destroy runs. So the hooks now have a self-test harness: test-hooks.sh, 347 lines of assertions that feed crafted JSON inputs into each hook and check the deny/allow decision is the right one.

It covers every catalogued pattern (the deny cases), a long list of legitimate commands that look superficially similar (the allow cases: chmod 755, git remote -v, aws iam list-access-keys, psql -c "DELETE FROM users WHERE id = 1" and so on), and the day-to-day devops workflow: the whole appserver CLI, terraform plan/init/validate, cookie_admin via SSM, routine git, docker ps/logs/exec. The hooks have to let all of that through. It also catches the regex bug class I worry about most: greedy patterns that span shell separators and trigger on the wrong half of a compound command. git push origin main && rm -f /tmp/msg.txt must NOT be flagged as a force push, but git push origin main -f must be. There are explicit assertions for both.

There are 182 cases. They all pass. CI runs them on every push, alongside bash -n, shellcheck of the hook files, and a separate auth-flow harness that exercises the CLI's role mapping. A typo that silently disabled a hook would otherwise be invisible until something destructive went unblocked.

#What this looks like in practice

This is what runs my production server. When I hand devops work to Claude — deploying Cookie, looking into something slow — the IAM, hooks and CLI underneath all hold up. The setup works for me, and it gives me extra confidence to keep using Claude this way.

MFA on the operator roles is the foundation. A leaked access key without a TOTP gets nowhere, and every other layer is built on top of that.

If you've built something similar and there are guardrails I haven't thought of, I'd like to hear them.

The full repo is at github.com/matthewdeaves/appserver. The hooks, the test harness, the skills and the IAM setup are all in there. MIT-licensed — happy for anyone to copy, fork, or lift bits for their own setup.