ClawAudit verdict

portkey-guardrails

88
🟢 Trusted
Low risk — reviewed by ClawAudit, behavior matches stated purpose

Workspace guardrail hook implementing prompt injection detection, PII redaction, and budget guards running entirely offline without external services — detection patterns like 'Ignore all previous instructions' appear as interception targets, not injections.

⚠ Flagged for review — coarse, uncorroborated signal, not a confirmed exploit. Review the config yourself before installing.

Automated static analysis — not a human review. ClawAudit flags capabilities, not confirmed intent, and can produce false positives. Disagree with this verdict? Use Dispute below.

55
security
100
transparency
90
maintenance

Findings (4)

Pattern match critical

Prompt injection — tries to override agent instructions

SKILL.md · frontmatter · Ignore all previous instructions

Pattern match medium

Possible prompt injection — attempts to redefine agent identity

tests/cases.yaml · prose · downgraded · You are now

Pattern match low

Base64 encoding/decoding

SKILL.md · prose · downgraded · base64-encode

Pattern match low

References agent configuration files

SKILL.md · prose · downgraded · agent-config

Permissions & capabilities

Requires 2 system binaries.

Is this flag fair?

Check another skill Browse the registry Auditing your own skills or configs? Use the API